Difference between revisions of "Corpus Create Annotation Schema By NER Lexical Resources"

From Anote2Wiki
Jump to: navigation, search
Line 1: Line 1:
 
[[Category:HOWTOs]]
 
[[Category:HOWTOs]]
  
 +
The user can perform a new NER (Entity recognition) based in Lexical Resources [[Load_Corpus|loading Corpus to Clipboard]] based on previous settings (NER already performed - Same Resources and Same Options).
  
When there are one or more Corpus are available on clipboard, is possible execute entity name recognition (NER). '''Named Entity Recognition whit Lexical Resources''' is a native operation over Corpus. The user must select right click button on Corpus data-type.
+
Selecting Corpus, the user must press right mouse button an select '''Corpus -> NER -> Lexical Resources'''
 
 
Corpus -> NER -> Lexical Resources
 
  
 
[[Image:Corpus_Process_NER_ANote.png|1500px|center]]
 
[[Image:Corpus_Process_NER_ANote.png|1500px|center]]
  
 
+
A wizard will be presented. The first allows to select two options: Create a new process ('' New Configuration'') and ''Load Configuration'' from a process that already performed. The user must select '''New Configuration''' and press '''Next button'''.
A wizard will be presented. This allows to configure the NER process. The first step is to select the A new NER Process Configuration or apply a older one . Click on ''new NER Process'' an continuing '''Next button'''.
 
  
 
[[Image:NER_ANote_Wizard1.png|800px|center]]
 
[[Image:NER_ANote_Wizard1.png|800px|center]]

Revision as of 13:06, 18 June 2012


The user can perform a new NER (Entity recognition) based in Lexical Resources loading Corpus to Clipboard based on previous settings (NER already performed - Same Resources and Same Options).

Selecting Corpus, the user must press right mouse button an select Corpus -> NER -> Lexical Resources

Corpus Process NER ANote.png

A wizard will be presented. The first allows to select two options: Create a new process ( New Configuration) and Load Configuration from a process that already performed. The user must select New Configuration and press Next button.

NER ANote Wizard1.png


In the next step, a lexical resources must be selected for the NER. Here, dictionaries, lookup tables, Rules set and Ontologies can be imported for process. One important restriction for every resource the user just could select one resource. For use more than one the user must merge resources on Resources Plug-in. After lexical resources selection, continuing pressing Next button.


NER ANote Wizard1a.png


NER ANote Wizard1b.png

In the next step, For each lexical resource the user must select the classes that pretend for entity annotation.

NER ANote Wizard2.png

Proceeding, appears Stop Words GUI. Here the user can select a list of stop words (Lexical Words Set - Lexical Resources) to perform NER algorithm. Stop words are important for algorithm don't annotate common English word as entities ( Remove false positive annotations)

NER ANote Wizard3.png

After all the configurations have been made, the Ok button has to be pressed. When the button is pressed, the NER operation will start and a small window will appear, indicating the execution of the operation. The NER operation will take a few minutes or hours.

When the process is finished, a new Process object will be added to the Corpus Process View.