NER Model Creation

From Anote2Wiki
Revision as of 16:30, 8 July 2015 by RRodrigues (talk | contribs) (Created page with "_TOC__ Category:HOWTOs == Select Option == To perform a NER Model creation using the BioTML Framework, start by [[Corpus_Load_Process|loading a NER Process to the Clipboa...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

_TOC__

Select Option

To perform a NER Model creation using the BioTML Framework, start by loading a NER Process to the Clipboard.
Selecting the NER Process, you should right click over it and choose NER Process -> BioTML Tagger -> Create Model


Create NER Model By BioTML Tagger.png


New Configuration or Load Configuration

A wizard will be presented to configure the model creation. The first step allows to select two options: Create New BioTML Model Configuration or Load BioTML Model Configuration. To start a new model configuraion select Create New BioTML Model Configuration and press the Next button.


Select New or Load Configuration By BioTML Tagger.png


Select NLP Tokenizer System

A GUI is presented to select the possible NLP systems that are integrated in the BioTML framework. Those systems are used to preform the tokenization of all documents in order to create a data matrix for machine learning algorithms.
The possible NLP systems to be chosen are the ClearNLP, Standford Core NLP and OpenNLP. Each system contains a description that is presented on this GUI.


Select NLP System By BioTML.png


BioTML Features Selection

The features selection for machine learning data matrix are selected in this GUI. Regarding the number and type of features, the produced model could have more or less fitting into the data. This selection will have a great impact in the prediction capability, recall and accuracy of the model during the NER annotation.

Attention: The number of features and some feature types could dramatically increase the memory and CPU usage!!


Select Features For BioTML.png


BioTML Model Algorithm Configuration

Select Algorithm Settings For BioTML.png