The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at http://knime.com.

Document Grabber

Downloads document from a certain database which can be specified in the dialog, i.e.: PubMed. After sending the specified query to the database and downloading the resulting documents, the documents will be parsed and deleted if it is specified in the dialog.

Dialog Options

Query
The query which is send to the specified database.
Number of results
After a click at the button, the number of results related to the specified query will be shown.
Database
The database to send the query to and receive the resulting documents from, i.e.: PubMed.
Maximal results
The number of maximal resulting documents to download and parse.
Documents directory
The directory to save the documents to. The specified directory must exist, be writable and empty.
Delete after parsing
If checked, the files containing the documents will be deleted after parsing.
Document category
The category of the documents.
Document type
The type of the documents.
Extract meta information if provided by database
If checked, meta information is extracted if provided by database. In case of PubMed the meta information consists of PubMed ID, the chemical list, and the mesh heading list assigned to the article. The meta information is stored as a regular section in the documents, annotated as meta information section.

Ports

Output Ports
0 An output table which contains the parsed document data.
This node is contained in KNIME Textprocessing Plug-in provided by KNIME GmbH, Konstanz, Germany.