Letter T

textcat - Written language identification

Website: http://www.let.rug.nl/~vannoord/TextCat/
License: LGPLv2+
Vendor: Fedora Project
Description:
TextCat is an implementation of the text categorization algorithm
presented in Cavnar, W. B. and J. M. Trenkle, "N-Gram-Based Text
Categorization".  TextCat uses this the technique to implement a
written language identification.  At the moment, it knows about 69
natural languages (counting Esperanto as a natural language).

Packages

textcat-1.10-1.el7.noarch [199 KiB] Changelog by Björn Esser (2014-03-12):
- initial rpm release (#1075662)

Listing created by Repoview-0.6.6-4.el7