Difference between revisions of "Workflow : Information Retrieval and Extraction"

From Anote2Wiki
Jump to: navigation, search
Line 36: Line 36:
 
[[File:Workflow4b.png|center|800px]]
 
[[File:Workflow4b.png|center|800px]]
  
If you just select corpus creation your workflows process starts after clinking '''ok button'''. When process over you can see results in [[Workflow_:_Information_Extraction_From_Query#Workflow_Report|Workflow_Report]]
+
If you have selected only the task of corpus creation, the processing of your data will start after clicking '''ok'''.
 +
When this process finishes, you can see the results in the [[Workflow_:_Information_Extraction_From_Query#Workflow_Report|Workflow_Report]]
  
Otherwise press '''next button ''' to proceed to NER configuration.
+
Otherwise, if other steps are included, press '''next''' to proceed to NER configuration.
  
 
== Select NER Process ==
 
== Select NER Process ==

Revision as of 17:38, 16 April 2013

Operation

The Information Retrieval and Extraction Workflow allows you to set up some tasks in @Note, including the Journal Retrieval (mandatory) and Journal Crawling, Corpus creation, NER and RE Processes (optionally).

To run the workflow, you must select the option Workflow -> Information Retrieval and Extraction on the Menu Bar.

Workflow1.png


Select Steps

The next step is to determine the tasks that will be executed by the workflow. The Pubmed Search is mandatory but Corpus Creation and NER and RE processes can also be applied to this Corpus in subsequent operations. After selecting the tasks, press Next to continue.

Workflow 2.png


PubMed Search

Create Corpus

The next step is to configure the corpus creation, where you have to select the name of the Corpus and its type.

Corpus Type

Abstract : : Only publications with abstracts will be considered.

Full Text : Only publications with full Text / PDF will be considered.

Retrieve PDF : Only publications with full Text / PDF will be considered, and a Journal Retrieval Process will be launched to collect all selected documents

Workflow4.png
Workflow4b.png

If you have selected only the task of corpus creation, the processing of your data will start after clicking ok. When this process finishes, you can see the results in the Workflow_Report

Otherwise, if other steps are included, press next to proceed to NER configuration.

Select NER Process

(Optional)

The next step is configuration a NER Process. Using combo box the must select NER process that servers your efforts.

NER - Based in Lexical Resources

You can select a NER Based in lexical resources. The configuration have two panel Basics an Advanced:

Basic Option

Here you must select one or many dictionaries to run NER.

Workflow5.png

Advance Option

Expert User can configure some advance options. This options are based in NER - Lexical Resource NER - Lexical Resources

Workflow5b.png

If you just select NER PRocess, workflows process starts after clinking ok button. When process over you can see results in Workflow_Report

Otherwise press next button to proceed to RE configuration.

Select RE Process

(Optional)

The next step is configuration a REProcess. Using combo box the must select Re process that servers your efforts.

RE Based in POS-Tagging

You can select a RE Based Natural Language processing. The configuration have two panel Basics an Advanced.

Basic Option

Here you must select the relation model

Workflow6.png

Advance Option

Expert User can configure some advance options. This options are based in RE Relation Extraction

Workflow6b.png

Processing

As the process proceeds the user can see the status of activity looking to progress bar

Workflow7.png

Workflow Report

As a result of the process appears a Workflow_report