2017 © Pedro Peláez
 

library tld-extract

TLDExtract, library for extracting parts of domain, e.q. domain parser

image

layershifter/tld-extract

TLDExtract, library for extracting parts of domain, e.q. domain parser

  • Tuesday, June 19, 2018
  • by LayerShifter
  • Repository
  • 8 Watchers
  • 127 Stars
  • 302,519 Installations
  • PHP
  • 12 Dependents
  • 0 Suggesters
  • 10 Forks
  • 1 Open issues
  • 17 Versions
  • 21 % Grown

The README.md

DEPRECATED

Consider to use https://github.com/jeremykendall/php-domain-parser as maintained alternative., (*1)

TLDExtract

TLDExtract accurately separates the gTLD or ccTLD (generic or country code top-level domain) from the registered domain and subdomains of a URL, e.g. domain parser. For example, say you want just the 'google' part of 'http://www.google.com'., (*2)

![Latest Version on Packagist][ico-version] Software License ![Build Status][ico-travis] Coverage Status ![Total Downloads][ico-downloads], (*3)


Everybody gets this wrong. Splitting on the '.' and taking the last 2 elements goes a long way only if you're thinking of simple e.g. .com domains. Think parsing http://forums.bbc.co.uk for example: the naive splitting method above will give you 'co' as the domain and 'uk' as the TLD, instead of 'bbc' and 'co.uk' respectively., (*4)

TLDExtract on the other hand knows what all gTLDs and ccTLDs look like by looking up the currently living ones according to the Public Suffix List. So, given a URL, it knows its subdomain from its domain, and its domain from its country code., (*5)

$result = tld_extract('http://forums.news.cnn.com/');
var_dump($result);

object(LayerShifter\TLDExtract\Result)#34 (3) {
  ["subdomain":"LayerShifter\TLDExtract\Result":private]=>
  string(11) "forums.news"
  ["hostname":"LayerShifter\TLDExtract\Result":private]=>
  string(3) "cnn"
  ["suffix":"LayerShifter\TLDExtract\Result":private]=>
  string(3) "com"
}

Result implements ArrayAccess interface, so you simple can access to its result., (*6)

var_dump($result['subdomain']);
string(11) "forums.news"
var_dump($result['hostname']);
string(3) "cnn"
var_dump($result['suffix']);
string(3) "com"

Also you can simply convert result to JSON., (*7)

var_dump($result->toJson());
string(54) "{"subdomain":"forums.news","hostname":"cnn","suffix":"com"}"

This package is compliant with PSR-1, PSR-2, PSR-4. If you notice compliance oversights, please send a patch via pull request., (*8)

Does TLDExtract make requests to Public Suffix List website?

No. TLDExtract uses database from TLDDatabase that generated from Public Suffix List and updated regularly. It does not make any HTTP requests to parse or validate a domain., (*9)

Requirements

The following versions of PHP are supported., (*10)

  • PHP 5.5
  • PHP 5.6
  • PHP 7.0
  • PHP 7.1
  • PHP 7.2
  • PHP 7.3
  • HHVM

Install

Via Composer, (*11)

``` bash $ composer require layershifter/tld-extract, (*12)

## Additional result methods

Class `LayerShifter\TLDExtract\Result` has some usable methods:
```php
$extract = new LayerShifter\TLDExtract\Extract();

# For domain 'shop.github.com'

$result = $extract->parse('shop.github.com');
$result->getFullHost(); // will return (string) 'shop.github.com'
$result->getRegistrableDomain(); // will return (string) 'github.com'
$result->isValidDomain(); // will return (bool) true
$result->isIp(); // will return (bool) false

# For IP '192.168.0.1'

$result = $extract->parse('192.168.0.1');
$result->getFullHost(); // will return (string) '192.168.0.1'
$result->getRegistrableDomain(); // will return null
$result->isValidDomain(); // will return (bool) false
$result->isIp(); // will return (bool) true

Custom database

By default package is using database from TLDDatabase package, but you can override this behaviour simply:, (*13)

new LayerShifter\TLDExtract\Extract(__DIR__ . '/cache/mydatabase.php');

For more details and how keep database updated TLDDatabase., (*14)

Implement own result

By default after parse you will receive object of LayerShifter\TLDExtract\Result class, but sometime you need own methods or additional functionality., (*15)

You can create own class that implements LayerShifter\TLDExtract\ResultInterface and use it as parse result., (*16)

class CustomResult implements LayerShifter\TLDExtract\ResultInterface {}

new LayerShifter\TLDExtract\Extract(null, CustomResult::class);

Parsing modes

Package has three modes of parsing: * allow ICANN suffixes (domains are those delegated by ICANN or part of the IANA root zone database); * allow private domains (domains are amendments submitted to Public Suffix List by the domain holder, as an expression of how they operate their domain security policy); * allow custom (domains that are not in list, but can be usable, for example: example, mycompany, etc)., (*17)

For keeping compatibility with Public Suffix List ideas package runs in all these modes by default, but you can easily change this behavior:, (*18)

use LayerShifter\TLDExtract\Extract;

new Extract(null, null, Extract::MODE_ALLOW_ICANN);
new Extract(null, null, Extract::MODE_ALLOW_PRIVATE);
new Extract(null, null, Extract::MODE_ALLOW_NOT_EXISTING_SUFFIXES);
new Extract(null, null, Extract::MODE_ALLOW_ICANN | Extract::MODE_ALLOW_PRIVATE);

Change log

Please see CHANGELOG for more information what has changed recently., (*19)

Testing

bash $ composer test, (*20)

Contributing

Please see CONTRIBUTING and CONDUCT for details., (*21)

License

This library is released under the Apache 2.0 license. Please see License File for more information., (*22)

The Versions

19/06 2018

dev-master

9999999-dev

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

19/06 2018

1.2.5

1.2.5.0

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

14/04 2018

1.2.4

1.2.4.0

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

14/04 2018

dev-fix/parser-lengths

dev-fix/parser-lengths

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

18/11 2017

1.2.3

1.2.3.0

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

17/10 2017

1.2.2

1.2.2.0

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

17/04 2017

1.2.1

1.2.1.0

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

17/11 2016

1.2.0

1.2.0.0

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

03/08 2016

1.1.1

1.1.1.0

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

29/06 2016

1.1.0

1.1.0.0

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

20/06 2016

1.0.0

1.0.0.0

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

20/06 2016

dev-dev

dev-dev

TLDExtract, library for extracting parts of domain, e.q. domain parser

  Sources   Download

Apache-2.0

The Requires

 

The Development Requires

by Alexander Fedyashov

domain parser tldextract

10/01 2016

0.2.0

0.2.0.0

TLDExtract package

  Sources   Download

MIT

The Requires

 

The Development Requires

by Alexander Fedyashov

24/11 2015

0.1.3

0.1.3.0

TLDExtract package

  Sources   Download

MIT

The Requires

 

The Development Requires

by Alexander Fedyashov

03/11 2015

0.1.2

0.1.2.0

TLDExtract package

  Sources   Download

MIT

The Requires

 

The Development Requires

by Alexander Fedyashov

25/10 2015

0.1.1

0.1.1.0

TLDExtract package

  Sources   Download

MIT

The Requires

 

The Development Requires

by Alexander Fedyashov

23/10 2015

0.1

0.1.0.0

TLDExtract package

  Sources   Download

MIT

The Requires

 

The Development Requires

by Alexander Fedyashov