Description
DKPro Similarity is an open source and completely free Java framework that can be used for text similarity: between two terms, between two lists of strings that represent entire documents, and between two texts based on a UIMA JCas representation.
DKPro Similarity's main goal is to provide a complete repository of text similarity measures that can be implemented using standardized interfaces. It is designed as an add-on for the DKPro Core software.
The application is comprised of various measures ranging from ones based on common subsequences and simple n-grams, to more complex ones, such as high-dimensional vector comparisons.
User Reviews for DKPro Similarity For Linux 2
-
DKPro Similarity For Linux offers a diverse range of text similarity measures. Ideal for text analysis projects, especially with DKPro Core integration.
-
DKPro Similarity For Linux offers a comprehensive set of text similarity measures. Great for comparing documents. User-friendly and free.