Sophie

Sophie

distrib > Altlinux > 4.1 > i586 > media > core > by-pkgid > 93e313ad94738d7ed06ce5d9d33c65c9

pear-I18N_UnicodeNormalizer-1.0.0-alt2.noarch.rpm

Description:

"...Unicode's normalization is the concept of character composition and
decomposition.
Character composition is the process of combining simpler characters into
fewer precomposed characters, such as the n character and the combining ~
character into the single n+~ character. Decomposition is the opposite
process, breaking precomposed characters back into their component
pieces...
...Normalization is important when comparing text strings for searching
and sorting (collation)..." [Wikipedia]
Performs the 4 normalizations:
NFD: Canonical Decomposition NFC: Canonical Decomposition, followed by
Canonical Composition NFKD: Compatibility Decomposition NFKC: Compatibility
Decomposition, followed by Canonical Composition Complies with the official
Unicode.org regression test.
Uses UTF8 binary strings natively but can normalize a string in any UTF
format.
Fully tested with phpUnit. Code coverage test close to 100%.

Sources packages:

Other version of this rpm: