Sophie

Sophie

distrib > Mageia > 6 > x86_64 > media > core-release > by-pkgid > fbc59c91a96fa6f1bb308691fb82ca9a

python-guess-language-0.2-8.mga6.noarch.rpm

Description:

Attempts to determine the natural language of a selection of Unicode (utf-8)
text.

Based on guesslanguage.cpp by Jacob R Rideout for KDE which itself is based on
Language::Guess by Maciej Ceglowski.

Detects over 60 languages - all languages listed in the trigrams directory plus
Japanese, Chinese, Korean and Greek.

guess_language uses heuristics based on the character set and trigrams in a
sample text to detect the language. It works better with longer samples and
will be confused if the sample text includes markup such as HTML tags.

Sources packages:

Other version of this rpm: