The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at http://knime.com.

Auto-Binner

This node allows to group numeric data in intervals - called bins. There are two naming options for the bins and two methods which define the number and the range of values that fall in a bin. Please use the "Numeric Binner" node if you want to define custom bins.

Dialog Options

Column Selection:
Columns in the include list are processed separately. The columns in the exclude list are omitted by the node.
Binning Method:
Use Fixed number of bins for equal spaced intervals over the domain range and Sample quantiles to produces bins corresponding to the given list of probabilities. The smallest element corresponds to a probability of 0 and the largest do probability of 1. The applied estimation method is Type 7 which is the default method in R, S and Excel.
Bin Naming:
Use Numbered for bins labeled by an integer with prefix "Bin" or Borders for labels using '"(a,b]"' interval notation.
Replace target column(s):
If set the columns in the include list are replaced by the binned columns otherwise columns named with suffix '[binned]' are appended.

Ports

Input Ports
0 Data to be categorized
Output Ports
0 Data with bins defined
1 The PMML Model fragment containing information how to bin
This node is contained in KNIME Base Nodes provided by KNIME GmbH, Konstanz, Germany.