The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at
http://knime.com.
POS Filter
Filters all terms that don't have the specified POS tags assigned.
The terms are filtered out of the bag of words and the documents as well
if deep preprocessing is checked.
A term is kept if at least one of the specified POS tags is assigned.
If strict filtering is set, all specified POS tags have to be
assigned to a term.
Dialog Options
Preprocessing options
- Strict filtering
-
If strict filtering is checked, all specified POS tags have to be applied
to the terms. Otherwise at least one of the specified tags has to be
applied to the terms.
- Filter matching
-
If checked, terms which have the specified tags are filtered,
otherwise they are kept. If strict filtering is checked, all tags
must match to filter or keep a term.
- Tags
-
The tags to filter for.
Deep preprocessing options
- Deep preprocessing
-
If deep preprocessing is checked, the terms contained inside
the documents are preprocessed too, this means that the documents
themselves are changed too, which is more time consuming.
- Document column
-
Specifies the column containing the documents to preprocess.
- Append unchanged documents
-
If checked, the documents contained in the specified "Original
Document column" are appended unchanged even if deep preprocessing
is checked. This helps to keep the original documents in the
output data table without the agonizing pain of joining.
- Original Document column
-
Specifies the column containing the original documents which
can be attached unchanged.
- Ignore unmodifiable tag
-
If checked unmodifiable terms will be preprocessed too.
Ports
Input Ports
0 |
The input table which contains the terms to filter. |
Output Ports
0 |
The output table which contains the preprocessed terms.
|
This node is contained in KNIME Textprocessing Plug-in
provided by KNIME GmbH, Konstanz, Germany.