textcat - Written language identification
Website: | http://www.let.rug.nl/~vannoord/TextCat/ |
---|---|
License: | LGPLv2+ |
Vendor: | Fedora Project |
- Description:
TextCat is an implementation of the text categorization algorithm presented in Cavnar, W. B. and J. M. Trenkle, "N-Gram-Based Text Categorization". TextCat uses this the technique to implement a written language identification. At the moment, it knows about 69 natural languages (counting Esperanto as a natural language).
Packages
textcat-1.10-1.el7.noarch [199 KiB] |
Changelog
by Björn Esser (2014-03-12):
- initial rpm release (#1075662) |