The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at http://knime.com.

Random Label Assigner (Data)

Assigns the labels based on the probabilities to the rows. For each row on inPort 1 a new StringCell is created randomly. Distribution for this new class column (respectively the name of the class and the probabilities) is read from the file on InPort 0

Dialog Options

Column name
the name of the new column
Seed
the random seed to produce a deterministic result
Name of category
The column of the first table containing the names of the categories
Probability
The column of the first table containing the probabilities to the categories

Ports

Input Ports
0 Should contain the class names (StringCell) and probabilities for each class (DoubleCell)
1 Original Data
Output Ports
0 The rows from InPort1 with one additional column, containing the new classes
This node is contained in KNIME Datageneration provided by KNIME GmbH, Konstanz, Germany.