The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at http://knime.com.
This node assigns to each term of a document a part of speech (POS) tag. Therefore the Penn Treebank tag set is used when models for English texts are used (for details see http://www.cis.upenn.edu/~treebank), and the STTS tag set is used when models for German texts are used (for details see http://www.ims.uni-stuttgart.de/projekte/CQPDemos/Bundestag/help-tagset.html ). The underlying tagger model deciding what tag to assign to which term is a model of the Stanford framework (see http://nlp.stanford.edu/software/tagger.shtml for details).
0 | The input table containing the documents to tag. |
0 | An output table containing the tagged documents. |