The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at http://knime.com.
Reads a well formed XML document. An XPath query can be specified to read only a portion of the file. In this case the output will be the nodes in the document that match the XPath query.
Only nodes of the document, which match this XPath query, will be read. Each matching node is read in a single data cell.
Note, that XPath requires to explicitly denote namespaces. E.g. to
read only the body of a XHTML document you can use the XPath query:
/dns:html/dns:body
where dns is the prefix of the namespace defined in the
Namespaces table.
A limited XPath syntax is supported. Only absolute paths to
nodes can be defined. Among the XPath operators the |-Operator
is supported. It can be used to read e.g. the head and the body of
an XHTML document in single cells:
/dns:html/dns:head | /dns:html/dns:body
This option is useful when you do not have the default namespace of your document at hand.
If checked, the namespace of the root element is added to the Namespaces table during runtime. Please define a prefix for this namespace in Prefix of root's namespace.
For the example of XHTML documents the namespace of the root element is http://www.w3.org/1999/xhtml so that with the root's prefix of dns you can leave the Namespaces table empty.
0 | The complete XML document in a single data cell or the nodes matching the XPath query if XPath filtering is checked. |