The following node is available in the Open Source KNIME predictive analytics and data mining platform version 2.7.1. Discover over 1000 other nodes, as well as enterprise functionality at
http://knime.com.
Document Grabber
Downloads document from a certain database which can be specified in
the dialog, i.e.: PubMed. After sending the specified query to the
database and downloading the resulting documents, the documents
will be parsed and deleted if it is specified in the dialog.
Dialog Options
- Query
-
The query which is send to the specified database.
- Number of results
-
After a click at the button, the number of results related to the
specified query will be shown.
- Database
-
The database to send the query to and receive the resulting
documents from, i.e.: PubMed.
- Maximal results
-
The number of maximal resulting documents to download and parse.
- Documents directory
-
The directory to save the documents to. The specified directory
must exist, be writable and empty.
- Delete after parsing
-
If checked, the files containing the documents will be deleted
after parsing.
- Document category
-
The category of the documents.
- Document type
-
The type of the documents.
- Extract meta information if provided by database
-
If checked, meta information is extracted if provided by database.
In case of PubMed the meta information consists of PubMed ID, the
chemical list, and the mesh heading list assigned to the article.
The meta information is stored as a regular section in the
documents, annotated as meta information section.
Ports
Output Ports
0 |
An output table which contains the parsed document data.
|
This node is contained in KNIME Textprocessing Plug-in
provided by KNIME GmbH, Konstanz, Germany.