GitHub - theelous3/sansio-tld-parser: A top level domain parser with no builtin io.

A sansio tld parser.

Given a tld rule list and a domain, parse the tld from the domain.

Yes, it's more complicated than domain.split(".") ;)

Why is this sansio? What has io got to do with it?

Parsing TLDs requires actually knowing all of the TLDS. These are maintained in a list online.

For some reason, all of the TLD parsers out that at the moment like to handle lookups to these lists internally, making them awkward to couple with whatever your flavour of application is.

This is a get-the-list-yourself situation.

Installation

pip install tld-parser :)

Use

You'll need access to the public suffix list:

Canonical: https://publicsuffix.org/list/public_suffix_list.dat

Git hosted: https://raw.githubusercontent.com/publicsuffix/list/master/public_suffix_list.dat

>>> from tld_parser import parse_rule_list, parse_domain
>>>
>>> from some_http_client import get
>>>
>>> suffix_list = get(list_url).content
>>> # The parser expects a Sequence of rules in the same format at the public suffix list.
>>> suffix_list = suffix_list.decode().splitlines()
>>>
>>> tld_rules = parse_rule_list(suffix_list)
>>>
>>> parse_domain(tld_rules, "some_subdomain.domain.co.uk")
Result(registrable_part='some_subdomain.domain', tld='co.uk')

And out pops a Result object :)

Breaking changelog

Raise TLDParserError on error parse fail, rather than returning None, lol. Raises NoRegisterablePart when the TLD is valid but the domain given was just a TLD. Raises NoTLDMatch when no TLD could be found.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
tests		tests
tld_parser		tld_parser
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A sansio tld parser.

Why is this sansio? What has io got to do with it?

Installation

Use

Breaking changelog

About

Releases 1

Packages

Languages

License

theelous3/sansio-tld-parser

Folders and files

Latest commit

History

Repository files navigation

A sansio tld parser.

Why is this sansio? What has io got to do with it?

Installation

Use

Breaking changelog

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages