``), - lists (``
`` text, and links (````, if ``links=True``). + lists (````, ``
foo bar -``), ``
`` text, ``
`` line breaks, and links (````, if ``links=True``). Extraction of particular elements and attributes such as links, alt texts, or form fields can be configured individually by setting the corresponding parameter to ``True``. diff --git a/tests/resiliparse/extract/test_html2text.py b/tests/resiliparse/extract/test_html2text.py index 813fb9ba..0258ea1e 100644 --- a/tests/resiliparse/extract/test_html2text.py +++ b/tests/resiliparse/extract/test_html2text.py @@ -98,7 +98,7 @@ def test_basic_extraction():baz +
baz
Copyright (C) 2021 Foo Bar""" @@ -118,7 +118,7 @@ def test_basic_extraction():
bar
baz +
baz
bar
baz +
baz
bar
Hello World
-Hello
World
!
Hello
World
Hello
World
Hello World
-Hello\nWorld\n\n\n\n!
+Hello
+World
Hello
+World