Difference between revisions of "Corpus Relation Extraction"

From Anote2Wiki
Jump to: navigation, search
Line 3: Line 3:
 
To perform a '''Relation Extraction''' (RE) process based in Natutal Language Processing algorithms, you should right click a Corpus data-type object and select the option '''Corpus -> RE-> Relation Extraction'''.
 
To perform a '''Relation Extraction''' (RE) process based in Natutal Language Processing algorithms, you should right click a Corpus data-type object and select the option '''Corpus -> RE-> Relation Extraction'''.
 
If the Corpus is not in the clipboard, you should begin by  [[Corpora_Load_Corpus|loading the Corpus to the Clipboard]].
 
If the Corpus is not in the clipboard, you should begin by  [[Corpora_Load_Corpus|loading the Corpus to the Clipboard]].
 +
  
 
[[Image:Corpus_Process_RE.png|1500px|center]]
 
[[Image:Corpus_Process_RE.png|1500px|center]]
 +
  
 
A new wizard is presented that allows to configure the RE process.
 
A new wizard is presented that allows to configure the RE process.
 
The first panel enables the '''selection''' of the '''processes''' that contain the entities annotated, allowing to view some statistics and process properties. After selecting the process press '''Next''' to continue.
 
The first panel enables the '''selection''' of the '''processes''' that contain the entities annotated, allowing to view some statistics and process properties. After selecting the process press '''Next''' to continue.
 +
  
 
[[Image:RE1.png|800px|center]]
 
[[Image:RE1.png|800px|center]]
  
Next panel allows for '''POS-Tagger selection'''. Here, you select which POS-Tagger will be used, and some information about its origin and properties will be presented. After choosing the desired POS-Tagger press '''Next''' to continue.
+
 
 +
The next panel allows for '''POS-Tagger selection'''. Here, you select which POS-Tagger will be used, and some information about its origin and properties will be presented. After choosing the desired POS-Tagger press '''Next''' to continue.
 +
 
  
 
[[Image:RE2.png|800px|center]]
 
[[Image:RE2.png|800px|center]]
  
The following panel allows for '''Relation Extraction Model Selection'''. The user selects the most adequate model (the panel below shows the expected type of results). After selecting the best model press '''Next'''.
+
 
 +
The following panel allows for '''Relation Extraction Model Selection'''. Select the most adequate model (the panel below shows the expected type of results). After selecting the best model press '''Next'''.
 +
 
  
 
[[Image:RE3.png|800px|center]]
 
[[Image:RE3.png|800px|center]]
 +
  
 
The next panel allows choosing if a '''Verb List''' will be used to filter the results and define this list.  You can select a list of verbs (a Lexical Words object) that will not be used to create relations Typically, this will be used to avoid relations with common English verbs (e.g. be, do).
 
The next panel allows choosing if a '''Verb List''' will be used to filter the results and define this list.  You can select a list of verbs (a Lexical Words object) that will not be used to create relations Typically, this will be used to avoid relations with common English verbs (e.g. be, do).
 +
  
 
[[Image:RE4.png|800px|center]]
 
[[Image:RE4.png|800px|center]]
 +
  
 
The next panel allows choosing an additional '''Verb List'''. You can select a list of verbs (a Lexical Words object) to add as relation clues, i.e. they provide a complement to the verbs list used internally by @Note.  
 
The next panel allows choosing an additional '''Verb List'''. You can select a list of verbs (a Lexical Words object) to add as relation clues, i.e. they provide a complement to the verbs list used internally by @Note.  
 
Once this option is selected,  press '''Ok'''.
 
Once this option is selected,  press '''Ok'''.
 +
  
 
[[Image:RE5.png|800px|center]]
 
[[Image:RE5.png|800px|center]]
 +
  
 
The RE operation will start and a progress window is shown, indicating the execution of the operation. The RE operation will take a few minutes or hours, depending on corpus size.
 
The RE operation will start and a progress window is shown, indicating the execution of the operation. The RE operation will take a few minutes or hours, depending on corpus size.
  
 
When the process finishes, a new '''RE Process''' object will be added to the [[Corpus_Load_Process|''Corpus Process View'']].
 
When the process finishes, a new '''RE Process''' object will be added to the [[Corpus_Load_Process|''Corpus Process View'']].

Revision as of 19:59, 16 January 2013


To perform a Relation Extraction (RE) process based in Natutal Language Processing algorithms, you should right click a Corpus data-type object and select the option Corpus -> RE-> Relation Extraction. If the Corpus is not in the clipboard, you should begin by loading the Corpus to the Clipboard.


Corpus Process RE.png


A new wizard is presented that allows to configure the RE process. The first panel enables the selection of the processes that contain the entities annotated, allowing to view some statistics and process properties. After selecting the process press Next to continue.


RE1.png


The next panel allows for POS-Tagger selection. Here, you select which POS-Tagger will be used, and some information about its origin and properties will be presented. After choosing the desired POS-Tagger press Next to continue.


RE2.png


The following panel allows for Relation Extraction Model Selection. Select the most adequate model (the panel below shows the expected type of results). After selecting the best model press Next.


RE3.png


The next panel allows choosing if a Verb List will be used to filter the results and define this list. You can select a list of verbs (a Lexical Words object) that will not be used to create relations Typically, this will be used to avoid relations with common English verbs (e.g. be, do).


RE4.png


The next panel allows choosing an additional Verb List. You can select a list of verbs (a Lexical Words object) to add as relation clues, i.e. they provide a complement to the verbs list used internally by @Note. Once this option is selected, press Ok.


RE5.png


The RE operation will start and a progress window is shown, indicating the execution of the operation. The RE operation will take a few minutes or hours, depending on corpus size.

When the process finishes, a new RE Process object will be added to the Corpus Process View.