aboutsummaryrefslogtreecommitdiffstatshomepage
path: root/Lib/test/test_unicode.py
Commit message (Collapse)AuthorAge
...
| * #8271: the utf-8 decoder now outputs the correct number of U+FFFD ↵Ezio Melotti2012-11-04
| | | | | | | | characters when used with the "replace" error handler on invalid utf-8 sequences. Patch by Serhiy Storchaka, tests by Ezio Melotti.
* | Issue #14700: merge tests from 3.3.Mark Dickinson2012-10-28
|\|
| * Issue #14700: merge tests from 3.2.Mark Dickinson2012-10-28
| |\
| | * Issue #14700: Fix buggy overflow checks for large precision and width in ↵Mark Dickinson2012-10-28
| | | | | | | | | | | | new-style and old-style formatting.
* | | Issue #16147: PyUnicode_FromFormatV() doesn't need anymore to allocate a bufferVictor Stinner2012-10-06
| | | | | | | | | | | | on the heap to format numbers.
* | | Issue #16147: Rewrite PyUnicode_FromFormatV() to use _PyUnicodeWriter APIVictor Stinner2012-10-06
|/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Simplify the code: replace 4 steps with one unique step using the _PyUnicodeWriter API. PyUnicode_Format() has the same design. It avoids to store intermediate results which require to allocate an array of pointers on the heap. * Use the _PyUnicodeWriter API for speed (and its convinient API): overallocate the buffer to reduce the number of "realloc()" * Implement "width" and "precision" in Python, don't rely on sprintf(). It avoids to need of a temporary buffer allocated on the heap: only use a small buffer allocated in the stack. * Add _PyUnicodeWriter_WriteCstr() function * Split PyUnicode_FromFormatV() into two functions: add unicode_fromformat_arg(). * Inline parse_format_flags(): the format of an argument is now only parsed once, it's no more needed to have a subfunction. * Optimize PyUnicode_FromFormatV() for characters between two "%" arguments: search the next "%" and copy the substring in one chunk, instead of copying character per character.
* | add another testcaseBenjamin Peterson2012-08-05
| |
* | Remove a now worthless test.Brett Cannon2012-05-12
| |
* | unicode_writer_finish() checks string consistencyVictor Stinner2012-05-09
| |
* | Close #14648: Compute correctly maxchar in str.format() for substrinVictor Stinner2012-04-23
| |
* | inherit maxchar of field value where needed (closes #14648)Benjamin Peterson2012-04-23
| |
* | str.format_map tests don't do what they say: fix to actually implement the ↵Eric V. Smith2012-03-12
|\| | | | | | | intent of the test. Closes #13450. Patch by Akira Li.
| * str.format_map tests don't do what they say: fix to actually implement the ↵Eric V. Smith2012-03-12
| | | | | | | | intent of the test. Closes #13450.
| * Issue #13093: Fix error handling on PyUnicode_EncodeDecimal()Victor Stinner2011-11-22
| | | | | | | | | | * Add tests for PyUnicode_EncodeDecimal() and PyUnicode_TransformDecimalToASCII() * Remove the unused "e" variable in replace()
* | add str.casefold() (closes #13752)Benjamin Peterson2012-01-14
| |
* | use full unicode mappings for upper/lower/title case (#12736)Benjamin Peterson2012-01-11
| | | | | | | | Also broaden the category of characters that count as lowercase/uppercase.
* | Close #13093: PyUnicode_EncodeDecimal() doesn't support error handlersVictor Stinner2011-11-25
| | | | | | | | | | different than "strict" anymore. The caller was unable to compute the size of the output buffer: it depends on the error handler.
* | (Merge 3.2) Issue #13093: Fix error handling on PyUnicode_EncodeDecimal()Victor Stinner2011-11-22
| |
* | Add a test on str.__getnewargs__()Victor Stinner2011-11-22
| | | | | | | | It tests indirectly PyUnicode_Copy(): ensure that the string is a copy.
* | Rewrite PyUnicode_EncodeDecimal() to use the new Unicode APIVictor Stinner2011-11-21
| | | | | | | | | | Add tests for PyUnicode_EncodeDecimal() and PyUnicode_TransformDecimalToASCII().
* | "unicode_internal" codec has been deprecated: fix related testsVictor Stinner2011-11-15
| |
* | Issue #13333: The UTF-7 decoder now accepts lone surrogatesAntoine Pitrou2011-11-15
|\| | | | | | | (the encoder already accepts them).
| * Issue #13333: The UTF-7 decoder now accepts lone surrogatesAntoine Pitrou2011-11-15
| | | | | | | | (the encoder already accepts them).
* | Fix range in test.Ezio Melotti2011-11-11
| |
* | Make test more inclusiveAntoine Pitrou2011-11-11
| |
* | Enable commented out testAntoine Pitrou2011-11-11
| |
* | Issue #13134: optimize finding single-character strings using memchrAntoine Pitrou2011-10-11
| |
* | test_unicode was forgetting to run the common string tests for str.find()Antoine Pitrou2011-10-08
|\|
| * test_unicode was forgetting to run the common string tests for str.find()Antoine Pitrou2011-10-08
| |
* | Mark 'abc'.expandtab() optimization as specific to CPythonVictor Stinner2011-10-07
| | | | | | | | Improve also str.replace(a, a) test
* | str.replace(a, a) is now returning str unchanged if a is aVictor Stinner2011-10-07
| |
* | #13054: fix usage of sys.maxunicode after PEP-393.Ezio Melotti2011-10-04
| |
* | When expandtabs() would be a no-op, don't create a duplicate stringAntoine Pitrou2011-10-04
| |
* | Optimize unicode_subtype_new(): don't encode to wchar_t and decode from wchar_tVictor Stinner2011-10-01
| | | | | | | | Rewrite unicode_subtype_new(): allocate directly the right type.
* | remove "fast-path" for (i)adding stringsBenjamin Peterson2011-09-30
| | | | | | | | | | | | These were just an artifact of the old unicode concatenation hack and likely just penalized other kinds of adding. Also, this fixes __(i)add__ on string subclasses.
* | Fix struct sizes. Drop -1, since the resulting string was actually the ↵Martin v. Löwis2011-09-28
| | | | | | | | | | | | largest one that could be allocated.
* | Implement PEP 393.Martin v. Löwis2011-09-28
| |
* | Merge indentation fix and skip decorator with 3.2.Ezio Melotti2011-08-23
|\|
| * Fix indentation and add a skip decorator.Ezio Melotti2011-08-23
| |
* | #9200: merge with 3.2.Ezio Melotti2011-08-22
|\|
| * #9200: The str.is* methods now work with strings that contain non-BMP ↵Ezio Melotti2011-08-22
| | | | | | | | characters even in narrow Unicode builds.
* | merge 3.2 (#12732)Benjamin Peterson2011-08-12
|\|
| * in narrow builds, make sure to test codepoints as identifier characters ↵Benjamin Peterson2011-08-12
| | | | | | | | | | | | (closes #12732) This fixes the use of Unicode identifiers outside the BMP in narrow builds.
* | Merge from 3.2.Eric V. Smith2011-07-18
|\|
| * Closes #12579. Positional fields with str.format_map() now raise a ↵Eric V. Smith2011-07-18
| | | | | | | | ValueError instead of SystemError.
* | merge from 3.2Senthil Kumaran2011-07-03
|\|
| * Fix closes issue12471 - wrong TypeError message when '%i' format spec was used.Senthil Kumaran2011-07-03
| |
* | #6780: merge with 3.2.Ezio Melotti2011-04-26
|\|
| * #6780: merge with 3.1.Ezio Melotti2011-04-26
| |\
| | * #6780: fix starts/endswith error message to mention that tuples are accepted ↵Ezio Melotti2011-04-26
| | | | | | | | | | | | too.