Workflow : Information Retrieval and Extraction Basic

From Anote2Wiki
Jump to: navigation, search

Pre-Configuration

The Information Retrieval and Extraction Basic workflow is a specific workflow constructed for less user interaction. Unlike the other workflows, here you cannot configure the Corpus, NER and RE processes during the wizard configuration. Changes in these configurations need to be done in the default settings, using the Preferences option in the Settings Menu and selecting Workflow -> Basics.


Workflow Basics Preconfiguration.png


Corpus

Corpus Name: Corpus Name

Corpus Type

Abstract : Only publications with abstracts will be considered.

Full Text : Only publications with full Text / PDF will be considered.

Hybrid : Publications with either full text or abstract will be considered.

Retrieve PDF : Only publications with full Text / PDF will be considered, and a Journal Retrieval Process will be launched to collect all selected documents


Workflow Basics Preconfiguration.png


NER

Using the combo box in Workflow -> Basics -> NER, you can select the NER process configuration that is more suitable.


Workflow Settings NER Selection.png


Using Workflow -> Basics -> NER -> [NER Process] you can configure the NER default options

Example:


Workflow Settings NER Selection Example.png


RE

Using the combo box in Workflow -> Basics -> RE, you can select the RE process configuration that is more suitable.


Workflow Settings RE Selection.png


Using Workflow -> Basics -> RE -> [RE Process], you can configure the RE default options

Example:


Workflow Settings RE Selection Example.png

Operation

The basic workflow for Information Retrieval and Extraction allows you to set up some tasks in @Note, including the Journal Retrieval, Corpus creation, NER and RE Processes with a minimal set of configuration steps.

To run the workflow, you must select the option Workflow -> Information Retrieval and Extraction Basic on the Menu Bar.


Workflow Basics1.png


Step 1: Pubmed Search

in the first panel, you can select a basic query or an advanced query to submit to Pubmed Search.

Basic search

In the Basic search tab, you can only select keywords and organism.


Workflow Basic 2a.png


Advanced Search

In Advanced search, you can restrict the search to a specific organism or keywords and can also select the name of an author, a journal, the type of article, if the article is present in PubMed Central or Medline, if full text is available or select a publication date range.


Workflow Basic 2b.png


Step 2: Dictionaries Selection

In the next panel, select dictionaries, dictionaries can be added to the set that will be used in the NER process. When all dictionaries are selected, press Next.


Workflow Basic 3.png


Step 3: Class(es) Selection

In the next panel, for all dictionaries previously selected you can filter for classes. Pressing Next, the Workflow is started.


Workflow Basic 4.png


Processing

As the process proceeds you can check the progress status looking at the bar.


Workflow7.png


Workflow Report

As a result of the process, a Workflow_report is shown.