The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at http://knime.com.

Strings To Document

Converts the specified strings to documents. For each row a document will be created and attached to that row. The strings of the specified columns will be used as title, authors, and full text. Furthermore the defined category, source, type, and date will be set.

Dialog Options

Title
The column containing the string which is used as title.
Full text
The column containing the string which is used as text.
Authors
The column containing the string which is split up and used as author names.
Author name separator
The string separating the author names.
Document source
The source which is set to all documents (if "Use sources from column" is not checked).
Use sources from column
If checked, the string values of the specified column will be used as document sources.
Document source column
The column containing the string used as source. No source is set for missing values.
Document category
The category which is set to all documents (if "Use categories from column" is not checked).
Use categories from column
If checked, the string values of the specified column will be used as document categories.
Document category column
The column containing the string used as category. No category is set for missing values.
Document type
The type which is set to all documents.
Publication date
The publication as which is set to all documents. The date has to be formatted like "dd-mm-yyy". Two digits to specify the day, two to specify the month and four to specify the year. The specified date has to be a valid date.

Ports

Input Ports
0 An input data table containing string cells.ht
Output Ports
0 An output table containing the strings of the data of the input table as well as the created documents in an additional column.
This node is contained in KNIME Textprocessing Plug-in provided by KNIME GmbH, Konstanz, Germany.