The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at
http://knime.com.
N Chars Filter
Filters all terms consisting of words with altogether less than
the specified number N characters.
Dialog Options
Preprocessing options
- N Chars
-
Specifies the number of minimum characters of a term.
Deep preprocessing options
- Deep preprocessing
-
If deep preprocessing is checked, the terms contained inside
the documents are preprocessed too, this means that the documents
themselves are changed too, which is more time consuming.
- Document column
-
Specifies the column containing the documents to preprocess.
- Append unchanged documents
-
If checked, the documents contained in the specified "Original
Document column" are appended unchanged even if deep preprocessing
is checked. This helps to keep the original documents in the
output data table without the agonizing pain of joining.
- Original Document column
-
Specifies the column containing the original documents which
can be attached unchanged.
- Ignore unmodifiable tag
-
If checked unmodifiable terms will be preprocessed too.
Ports
Input Ports
0 |
The input table which contains the terms to convert. |
Output Ports
0 |
The output table which contains the preprocessed terms.
|
This node is contained in KNIME Textprocessing Plug-in
provided by KNIME GmbH, Konstanz, Germany.