Sophie

Sophie

distrib > Mageia > 4 > x86_64 > by-pkgid > fd61ab2fac97c1381d6a730f51ddf4d2

python-guess-language-0.2-3.mga4.src.rpm

Description:

Attempts to determine the natural language of a selection of Unicode (utf-8)
text.

Based on guesslanguage.cpp by Jacob R Rideout for KDE which itself is based on
Language::Guess by Maciej Ceglowski.

Detects over 60 languages - all languages listed in the trigrams directory plus
Japanese, Chinese, Korean and Greek.

guess_language uses heuristics based on the character set and trigrams in a
sample text to detect the language. It works better with longer samples and
will be confused if the sample text includes markup such as HTML tags.

Generated packages:

Other version of this rpm: