GitHub - rwarren/py2casefold: casefold for python 2.

`py2casefold`

Python 3 has str.casefold(). Python 2 doesn't. py2casefold brings casefolding support to Python 2.

Installation

pip install py2casefold

Usage

>>> from py2casefold import casefold
>>> print casefold(u"tschüß")
tschüss
>>> casefold(u"ΣίσυφοςﬁÆ") == casefold(u"ΣΊΣΥΦΟσFIæ") == u"σίσυφοσfiæ"
True

Note that casefold does not normalize the string. Casefolding and normalization are different operations. For more info see http://www.w3.org/International/wiki/Case_folding, and http://www.w3.org/TR/charmod-norm/.

If you are looking for string similarity you will also probably want to consider one of the unicode normalization options (NFC, NFKC, NFD, NFKD) that are available with Python's built in unicodedata.normalize().

Speed

At the moment, this pure Python casefold implementation is significantly (> 20x) slower than the optimized py3 C implementation. This can be improved later, but it is currently more than sufficient for basic case folding. As a rough estimate, case folding 100 characters clocks in at ~25μs on an old developer laptop.

Tests

To run the tests on all supported Python version simple use tox.

tox

You will need to have Python 2.7, Python 3.4, Python 3.5 and Python 3.6 installed.

License

BSD and the Unicode license agreement. This module includes data from the Unicode consortium which should include the appropriate notice (see http://unicode.org/copyright.html).

See LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
py2casefold		py2casefold
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`py2casefold`

Installation

Usage

Speed

Tests

License

About

Releases

Packages

Contributors 3

Languages

License

rwarren/py2casefold

Folders and files

Latest commit

History

Repository files navigation

py2casefold

Installation

Usage

Speed

Tests

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

`py2casefold`

Packages