The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at http://knime.com.

Statistics

This node calculates statistical moments such as minimum, maximum, mean, standard deviation, variance, median, overall sum, number of missing values and row count across all numeric columns, and counts all nominal values together with their occurrences. The dialog offers two options for choosing the median and/or nominal values calculations:

Dialog Options

Compute median values
Select this option if for all numeric columns the medians are computed. Note, this computation might be expensive, since it requires to sort all column independently to find the values that divides the distribution into two halves of the same number of values.
Column filter
Filter columns for counting all possible values.
Nominal values
Adjusts the number of counts for both, top number of frequent and infrequent occurrences of categorical values per column (displayed in the node view!).
Nominal values in output
Adjusts the maximum number of possible values per column in the nominal output table.

Ports

Input Ports
0 Table from which to compute statistics.
Output Ports
0 Table with numeric values.
1 Table with all nominal values and their counts.

Views

Statistics View
Displays all statistic moments (for all numeric columns) and nominal values (for all categorical columns).
This node is contained in KNIME Base Nodes provided by KNIME GmbH, Konstanz, Germany.