The University of Waikato - Te Whare Wānanga o Waikato
Centre for Open Software Innovation Research Centre
Home Waikato Home  >  COSI  >  Projects  >  Machine Learning  >  Katoa Staff + Students Login |  - Logout


Katoa is a Java-based toolkit for concept-based text processing. Katoa makes use of external knowledge bases and provides 1) methods for representing natural-language text by the concepts it mentions (instead of words); 2) similarity measures that take the semantic relatedness among concepts into account; and 3) enhanced clustering methods that utilize the semantic concept relatedness information. Katoa now supports two knowledge bases: Wikipedia and WordNet.

Digital Library Group and Machine Learning Group

Project homepage



Apply Now!