Katoa
Katoa is a Java-based toolkit for concept-based text processing. Katoa makes use of external knowledge bases and provides 1) methods for representing natural-language text by the concepts it mentions (instead of words); 2) similarity measures that take the semantic relatedness among concepts into account; and 3) enhanced clustering methods that utilize the semantic concept relatedness information. Katoa now supports two knowledge bases: Wikipedia and WordNet.