The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at http://knime.com.
Computes the inverse category frequency (icf) of each term according to
the given set of documents, categories of documents respectively,
and adds a column containing the icf value.
The value for term t is computed as follows:
icf(t) = log(1 + (f(C) / f(c, t)),
where f(C) is the number of all categories and f(c, t) is the number of
categories containing term t.
0 | The input table which contains terms and documents. |
0 | The output table which contains terms documents and a corresponding frequency value. |