Difference between revisions of "Workflow : Information Extraction From Query"

From Anote2Wiki
Jump to: navigation, search
Line 5: Line 5:
  
 
The Information Extraction From Query Workflow allows you to set up some tasks over the results of a query, including the Corpus creation (mandatory) and NER Process and RE Processes (optionally).  
 
The Information Extraction From Query Workflow allows you to set up some tasks over the results of a query, including the Corpus creation (mandatory) and NER Process and RE Processes (optionally).  
To run the workflow, you must select a Query object on the Clipboard and right click '''Workflow ->  Information Extraction (From a query)''' or click on the '''Workflow Information Extraction''' button in the Query View.
+
To run the workflow, you must select a Query object on the Clipboard and right click '''Workflow ->  Information Extraction (From query)''' or click on the '''Workflow Information Extraction''' button in the Query View.
 +
 
  
 
[[File:Workflow_Syngenta1.png||1500px|center]]
 
[[File:Workflow_Syngenta1.png||1500px|center]]
 +
  
 
== Select Query Publications ==
 
== Select Query Publications ==
  
The first step of the workflow is to select candidate publications to create a Corpus. If the user has already selected publications using Query View this information already be present.
+
The first step of the workflow is to select candidate publications to create a Corpus. If you have already selected publications using the Query View this information will already be present.
After selecting publication you must press '''next Button'''
+
After selecting the desired publications, press the '''Next'''
 +
 
  
 
[[File:Workflow_Syngenta2.png|800px|center]]
 
[[File:Workflow_Syngenta2.png|800px|center]]
 +
  
 
== Select Steps ==
 
== Select Steps ==
  
The next step is to determine how far will the Workflow. The Corpus Creation are mandatory but NER Process and RE process applied to Corpus could be performed.   
+
The next step is to determine the steps that will be executed by the workflow. The Corpus creation is mandatory but NER and RE processes can also be applied to this Corpus in subsequent operations.   
You must select how far you go in workflow and press '''next button'''
+
After selecting the steps,  press '''next ''' to continue.
 +
 
  
 
[[File:Workflow_Syngenta3.png|800px|center]]
 
[[File:Workflow_Syngenta3.png|800px|center]]
 +
  
 
== Create Corpus ==
 
== Create Corpus ==
Line 27: Line 33:
 
(Mandatory)
 
(Mandatory)
  
The next step are to configure the corpus creation, here you have to select the name of the Corpus and select the type of corpus.
+
The next step is to configure the corpus creation, where you have to select the name of the Corpus and its type.
  
 
=== Corpus Type ===
 
=== Corpus Type ===
Line 35: Line 41:
 
''' Full Text :''' Only publications with full Text / PDF will be considered.
 
''' Full Text :''' Only publications with full Text / PDF will be considered.
  
''' Retrieve PDF :''' Only publications with full Text / PDF will be considered, and a Journal Retrieval Process to all select document are launched (after configuration steps)
+
''' Retrieve PDF :''' Only publications with full Text / PDF will be considered, and a Journal Retrieval Process will be launched to collect all selected documents
 +
 
  
 
[[File:Workflow_Syngenta4.png|center|800px]]
 
[[File:Workflow_Syngenta4.png|center|800px]]
 +
  
 
[[File:Workflow_Syngenta4b.png|center|800px]]
 
[[File:Workflow_Syngenta4b.png|center|800px]]
  
If you just select corpus creation your workflows process starts after clinking '''ok button'''. When process over you can see results in [[Workflow_:_Information_Extraction_From_Query#Workflow_Report|Workflow_Report]]
 
  
Otherwise press '''next button ''' to proceed to NER configuration.
+
If you just have selected corpus creation, the processing of your data will start after clicking '''ok'''.
 +
When this process finishes, you can see the results in the [[Workflow_:_Information_Extraction_From_Query#Workflow_Report|Workflow_Report]]
 +
 
 +
Otherwise, if other steps are included, press '''next''' to proceed to NER configuration.
  
 
== Select NER Process ==
 
== Select NER Process ==
Line 49: Line 59:
 
(Optional)
 
(Optional)
  
The next step is configuration a NER Process. Using combo box the must select NER process that servers your efforts.  
+
The next step is the configuration of an NER Process. Using the combo box, select the NER process configuration that is more suitable.  
  
 
=== NER - Based in Lexical Resources ===
 
=== NER - Based in Lexical Resources ===
  
You can select a NER Based in lexical resources. The configuration have two panel Basics an Advanced:
+
You can select a NER Based in lexical resources. The configuration has two panels: Basics an Advanced:
  
==== Basic Option ====
+
==== Basic ====
 +
 
 +
Here you can select one or more dictionaries to use as resources in the NER.
  
Here you must select one or many dictionaries to run NER.
 
  
 
[[File:Workflow_Syngenta5.png|center|800px]]
 
[[File:Workflow_Syngenta5.png|center|800px]]
  
==== Advance Option ====
 
  
Expert User can configure some advance options. This options are based in NER - Lexical Resource [[Corpus_Create_Annotation_Schema_By_NER_Lexical_Resources|NER - Lexical Resources]]
+
==== Advanced Option ====
 +
 
 +
Expert users can configure some advanced options. These options are based in NER - Lexical Resource: check details in [[Corpus_Create_Annotation_Schema_By_NER_Lexical_Resources|NER - Lexical Resources]]
 +
 
  
 
[[File:Workflow_Syngenta5b.png|center|800px]]
 
[[File:Workflow_Syngenta5b.png|center|800px]]
  
If you just select NER PRocess, workflows process starts after clinking '''ok button'''. When process over you can see results in Workflow_Report
 
  
Otherwise press '''next button''' to proceed to RE configuration.
+
If your workflow terminates in the NER process, the data processing will start after clicking '''ok'''.
 +
When the process finishes the processing, you can see results the in the [[Workflow_:_Information_Extraction_From_Query#Workflow_Report|Workflow_Report]]
 +
 
 +
Otherwise, if an RE process will be conducted, press '''next''' to proceed to its configuration.
  
 
== Select RE Process ==
 
== Select RE Process ==
Line 75: Line 90:
 
(Optional)
 
(Optional)
  
The next step is configuration a REProcess. Using combo box the must select Re process that servers your efforts.  
+
The next step is the configuration of an REProcess. Using the combo box select the RE process that best serves your needs.  
  
 
=== RE Based in POS-Tagging ===
 
=== RE Based in POS-Tagging ===
  
You can select a RE Based Natural Language processing. The configuration have two panel Basics an Advanced.  
+
You can select an RE Based in Natural Language processing. The configuration has two panels: Basic an Advanced.  
  
==== Basic Option ====
+
==== Basic ====
 +
 
 +
Here you can select the relation model
  
Here you must select the relation model
 
  
 
[[File:Workflow_Syngenta6.png|center|800px]]
 
[[File:Workflow_Syngenta6.png|center|800px]]
  
==== Advance Option ====
 
  
Expert User can configure some advance options. This options are based in RE [[Corpus_Relation_Extraction|Relation Extraction]]
+
==== Advanced ====
 +
 
 +
Expert users can configure some advanced options. These options are based in the RE operations detailed in: [[Corpus_Relation_Extraction|Relation Extraction]]
 +
 
  
 
[[File:Workflow_Syngenta6b.png|center|800px]]
 
[[File:Workflow_Syngenta6b.png|center|800px]]
 +
  
 
== Processing ==
 
== Processing ==
  
As the process proceeds the user can see the status of activity looking to progress bar
+
As the process proceeds you can check the status of activity looking at the progress bar
 +
 
  
 
[[File:Workflow_Syngenta7.png|center]]
 
[[File:Workflow_Syngenta7.png|center]]
 +
  
 
== Workflow Report ==
 
== Workflow Report ==
  
As a result of the process appears a [[Workflow_report]]
+
As a result of the process, a [[Workflow_report]] is shown.

Revision as of 13:14, 16 April 2013

Operation

The Information Extraction From Query Workflow allows you to set up some tasks over the results of a query, including the Corpus creation (mandatory) and NER Process and RE Processes (optionally). To run the workflow, you must select a Query object on the Clipboard and right click Workflow -> Information Extraction (From query) or click on the Workflow Information Extraction button in the Query View.


Workflow Syngenta1.png


Select Query Publications

The first step of the workflow is to select candidate publications to create a Corpus. If you have already selected publications using the Query View this information will already be present. After selecting the desired publications, press the Next


Workflow Syngenta2.png


Select Steps

The next step is to determine the steps that will be executed by the workflow. The Corpus creation is mandatory but NER and RE processes can also be applied to this Corpus in subsequent operations. After selecting the steps, press next to continue.


Workflow Syngenta3.png


Create Corpus

(Mandatory)

The next step is to configure the corpus creation, where you have to select the name of the Corpus and its type.

Corpus Type

Abstract : : Only publications with abstracts will be considered.

Full Text : Only publications with full Text / PDF will be considered.

Retrieve PDF : Only publications with full Text / PDF will be considered, and a Journal Retrieval Process will be launched to collect all selected documents


Workflow Syngenta4.png


Workflow Syngenta4b.png


If you just have selected corpus creation, the processing of your data will start after clicking ok. When this process finishes, you can see the results in the Workflow_Report

Otherwise, if other steps are included, press next to proceed to NER configuration.

Select NER Process

(Optional)

The next step is the configuration of an NER Process. Using the combo box, select the NER process configuration that is more suitable.

NER - Based in Lexical Resources

You can select a NER Based in lexical resources. The configuration has two panels: Basics an Advanced:

Basic

Here you can select one or more dictionaries to use as resources in the NER.


Workflow Syngenta5.png


Advanced Option

Expert users can configure some advanced options. These options are based in NER - Lexical Resource: check details in NER - Lexical Resources


Workflow Syngenta5b.png


If your workflow terminates in the NER process, the data processing will start after clicking ok. When the process finishes the processing, you can see results the in the Workflow_Report

Otherwise, if an RE process will be conducted, press next to proceed to its configuration.

Select RE Process

(Optional)

The next step is the configuration of an REProcess. Using the combo box select the RE process that best serves your needs.

RE Based in POS-Tagging

You can select an RE Based in Natural Language processing. The configuration has two panels: Basic an Advanced.

Basic

Here you can select the relation model


Workflow Syngenta6.png


Advanced

Expert users can configure some advanced options. These options are based in the RE operations detailed in: Relation Extraction


Workflow Syngenta6b.png


Processing

As the process proceeds you can check the status of activity looking at the progress bar


Workflow Syngenta7.png


Workflow Report

As a result of the process, a Workflow_report is shown.