aboutsummaryrefslogtreecommitdiffstatshomepage
path: root/Lib/html
Commit message (Collapse)AuthorAge
* gh-97669: Create Tools/build/ directory (#97963)Victor Stinner2022-10-17
| | | | | | | | | | | | | | | | | | | | | | | Create Tools/build/ directory. Move the following scripts from Tools/scripts/ to Tools/build/: * check_extension_modules.py * deepfreeze.py * freeze_modules.py * generate_global_objects.py * generate_levenshtein_examples.py * generate_opcode_h.py * generate_re_casefix.py * generate_sre_constants.py * generate_stdlib_module_names.py * generate_token.py * parse_html5_entities.py * smelly.py * stable_abi.py * umarshal.py * update_file.py * verify_ensurepip_wheels.py Update references to these scripts.
* gh-95813: Improve HTMLParser from the view of inheritance (#95874)Dong-hee Na2022-08-18
| | | | | | | * gh-95813: Improve HTMLParser from the view of inheritance * gh-95813: Add unittest * Address code review
* gh-82927: Update files related to HTML entities. (GH-92504)Ezio Melotti2022-06-21
|
* Add source for character mappings (#92014)slateny2022-05-06
|
* bpo-45421: Remove dead code from html.parser (GH-28847)Alberto Mardegan2021-10-12
| | | | | Support for HtmlParserError was removed back in 2014 with commit 73a4359eb0eb624c588c5d52083ea4944f9787ea, however this small block was missed.
* Fix typos in the Lib directory (GH-28775)Christian Clauss2021-10-06
| | | | | Fix typos in the Lib directory as identified by codespell. Co-authored-by: Terry Jan Reedy <tjreedy@udel.edu>
* bpo-41748: Handles unquoted attributes with commas (#24072)Karl Dubost2021-02-01
| | | | | | | | | | | | | | | | | | * bpo-41748: Adds tests for unquoted attributes with comma * bpo-41748: Handles unquoted attributes with comma * bpo-41748: Addresses review comments * bpo-41748: Addresses review comments * Adds more test cases * Simplifies the regex for handling spaces * bpo-41748: Moves attributes tests under the right class * bpo-41748: Addresses review about duplicate attributes * bpo-41748: Adds NEWS.d entry for this patch
* bpo-37328: remove deprecated HTMLParser.unescape (GH-14186)Inada Naoki2019-08-27
| | | It is deprecated since Python 3.4.
* bpo-30629: Remove second call of str.lower() in html.parser.parse_endtag. ↵Motoki Naruse2017-06-16
| | | | | | (#2099) elem is the result of .lower() 6 lines above the handle_endtag call. Patch by Motoki Naruse
* Revert "Fixed a typo in the HTMLParser.feed docstrings" (#1771)Serhiy Storchaka2017-05-24
| | | | | * Revert "Fixed a typo in the HTMLParser.feed docstrings. The docstring started with an 'r', like a The docstring was correct. I read the patch in opposite direction, as *adding* the "r" prefix. This reverts commit 5ba185039f1bd465d3f82531324fd3fe1ee42f0c.
* Fixed a typo in the HTMLParser.feed docstrings. The docstring started with ↵Jani Šumak2017-05-23
| | | | an 'r', like a rawstring. (#1759)
* #27364: fix "incorrect" uses of escape character in the stdlib.R David Murray2016-09-08
| | | | | | | And most of the tools. Patch by Emanual Barry, reviewed by me, Serhiy Storchaka, and Martin Panter.
* Issue #27076: Doc, comment and tests spelling fixesMartin Panter2016-05-26
| | | | Most fixes to Doc/ and Lib/ directories by Ville Skyttä.
* Merge spelling fixes from 3.4 into 3.5Martin Panter2015-10-31
|\
| * Fix some spelling errors in documentation and code commentsMartin Panter2015-10-31
| |
* | #23144: merge with 3.4.Ezio Melotti2015-09-06
|\|
| * #23144: Make sure that HTMLParser.feed() returns all the data, even when ↵Ezio Melotti2015-09-06
| | | | | | | | convert_charrefs is True.
* | Issue #23181: More "codepoint" -> "code point".Serhiy Storchaka2015-01-18
|\|
| * Issue #23181: More "codepoint" -> "code point".Serhiy Storchaka2015-01-18
| |
* | #21047: set the default value for the *convert_charrefs* argument of ↵Ezio Melotti2014-08-02
| | | | | | | | HTMLParser to True. Patch by Berker Peksag.
* | Add an __all__ to html.entities.Ezio Melotti2014-08-02
| |
* | #15114: the strict mode and argument of HTMLParser, HTMLParser.error, and ↵Ezio Melotti2014-08-02
|/ | | | the HTMLParserError exception have been removed.
* #20288: merge with 3.3.Ezio Melotti2014-02-01
|\
| * #20288: fix handling of invalid numeric charrefs in HTMLParser.Ezio Melotti2014-02-01
| |
* | #13633: Added a new convert_charrefs keyword arg to HTMLParser that, when ↵Ezio Melotti2013-11-23
| | | | | | | | True, automatically converts all character references.
* | #19688: add back and deprecate the internal HTMLParser.unescape() method.Ezio Melotti2013-11-22
| |
* | #2927: Added the unescape() function to the html module.Ezio Melotti2013-11-19
| |
* | #19480: merge with 3.3.Ezio Melotti2013-11-07
|\|
| * #19480: HTMLParser now accepts all valid start-tag names as defined by the ↵Ezio Melotti2013-11-07
| | | | | | | | HTML5 standard.
* | #15114: The html.parser module now raises a DeprecationWarning when the ↵Ezio Melotti2013-11-02
| | | | | | | | strict argument of HTMLParser or the HTMLParser.error method are used.
* | #18020: improve html.escape speed by an order of magnitude. Patch by Matt ↵Ezio Melotti2013-07-07
| | | | | | | | Bryant.
* | #17802: merge with 3.3.Ezio Melotti2013-05-01
|\|
| * #17802: Fix an UnboundLocalError in html.parser. Initial tests by Thomas ↵Ezio Melotti2013-05-01
| | | | | | | | Barlow.
* | #14679: add an __all__ (that contains only HTMLParser) to html.parser.Ezio Melotti2013-05-01
|/
* #16245: Fix the value of a few entities in html.entities.html5.Ezio Melotti2012-10-23
|
* Reorder html.entities.html5 entities to make updates easier. Patch by ↵Ezio Melotti2012-10-23
| | | | Iuliia Proskurnia.
* #15156: HTMLParser now uses the new "html.entities.html5" dictionary.Ezio Melotti2012-06-24
|
* #11113: add a new "html5" dictionary containing the named character ↵Ezio Melotti2012-06-24
| | | | references defined by the HTML5 standard and the equivalent Unicode character(s) to the html.entities module.
* #15114: the strict mode of HTMLParser and the HTMLParseError exception are ↵Ezio Melotti2012-06-23
| | | | deprecated now that the parser is able to parse invalid markup.
* #14538: HTMLParser can now parse correctly start tags that contain a bare /.Ezio Melotti2012-04-18
|
* HTMLParser is now able to handle slashes in the start tag.Ezio Melotti2012-02-21
|
* Fix an index and clean up comments.Ezio Melotti2012-02-13
|
* Improve handling of declarations in HTMLParser.Ezio Melotti2012-02-13
|
* #13993: HTMLParser is now able to handle broken end tags when strict=False.Ezio Melotti2012-02-13
|
* #13960: HTMLParser is now able to handle broken comments when strict=False.Ezio Melotti2012-02-10
|
* #13358: HTMLParser now calls handle_data only once for each CDATA.Ezio Melotti2011-11-18
|
* #1745761, #755670, #13357, #12629, #1200313: improve attribute handling in ↵Ezio Melotti2011-11-14
| | | | HTMLParser.
* #670664: Fix HTMLParser to correctly handle the content of ↵Ezio Melotti2011-11-01
| | | | ``<script>...</script>`` and ``<style>...</style>``.
* #13273: fix a bug that prevented HTMLParser to properly detect some tags ↵Ezio Melotti2011-10-28
| | | | when strict=False.
* Fix issue12938 - Update the docstring of html.escape. Include the ↵Senthil Kumaran2011-09-13
| | | | information on single quote.