Difference between revisions of "Corpora"

From Anote2Wiki
Jump to: navigation, search
(Views:)
(User HowTOs)
 
(21 intermediate revisions by 2 users not shown)
Line 3: Line 3:
 
== About Corpora Plug-in ==
 
== About Corpora Plug-in ==
  
Central @Note2 plug-in that Define Corpora (Corpus Set) perspective and possibility the integration of extraction processes ( like entity extraction or relation extraction processes).
+
Plug-in that defines central data-types for Corpora (Corpus Set). All information extraction processes are applied over a Corpus in @Note2. A Corpus is a set of documents that could be annotated with entities/events in IEProcesses. In this plug-in is already some standard View for Annotated documents.
Which Corpus, a set of document, could be composed for processes. Which Corpus Process have Annotation for a document ( entity/event annotations).
 
In this plug-in is already some standard View for Annotated documents.
 
  
 
== User HowTOs ==
 
== User HowTOs ==
 +
 +
[[Corpora Load Corpus]]
 +
 +
[[Corpora Remove Corpus]]
 +
 +
[[Corpus Load Process]]
 +
 +
[[Corpus Remove Process]]
 +
 +
[[Annotated Document Default View]]
 +
 +
[[Process Entity Details View]]
 +
 +
[[Process Load Annotated Document]]
 +
 +
[[Create Corpus By Publication Manager]]
 +
 +
[[Process Relations view]]
 +
 +
[[Process Relations Resume Stats]]
 +
 +
[[Merging NER Schemas]]
 +
 +
[[REProcess Export To XGMML File]]
 +
 +
[[Change Class Colors]]
  
 
== MVC AIBench Model: ==
 
== MVC AIBench Model: ==
Line 13: Line 37:
 
=== '''''Data-types:''''' ===
 
=== '''''Data-types:''''' ===
  
'''''Corpora''''': Represents Corpora and contains a Corpus Set. Contains methods for Corpus database management.  
+
'''''Corpora''''': Represents Corpora and contains a Corpus Set. Contains methods for Corpus database management.
  
'''''Corpus''''':  Represent a set of publications. Contain information about Corpus properties, name, description, database id and list of IEPRocess applied to corpus.  
+
'''''Corpus''''':  Represents a set of publications. Contains information about Corpus properties, name, description, database id and lists of IEPRocess applied to corpus.
  
'''''NERDocumentAnnotation''''':  Contains information about document entities annotation resulted by NER processes.  
+
'''''NERDocumentAnnotation''''':  Contains information about document entities annotations resulting from NER processes..  
  
'''''REDocumentAnnotation''''':  Contains information about document entities and event annotation resulted by RE processes.  
+
'''''REDocumentAnnotation''''':  Contains information about document entities and event annotation resulting from RE processes.  
  
'''''NERProcess''''':  Represent a NER Process and contain a set of NERDocumentAnnotation.  
+
'''''NERProcess''''':  Represents a NER Process and contains a set of ''NERDocumentAnnotation''.
  
'''''REProcess''''':  Represent a RE Process and contain a set of REDocumentAnnotation.
+
'''''REProcess''''':  Represents a RE Process and contains a set of ''REDocumentAnnotations''.
  
 
=== '''''Operations:''''' ===
 
=== '''''Operations:''''' ===
Line 30: Line 54:
  
 
'''''CreateCorpusOperationByPublicationManager''''': Operation that permits corpus creation deriving for Queries of Publication Manager.
 
'''''CreateCorpusOperationByPublicationManager''''': Operation that permits corpus creation deriving for Queries of Publication Manager.
 +
 +
'''''NERAnnotationsMergeOperation''''': Operation for merging NER Schemas from Corpus.
  
 
'''''ExitOperation''''': Plug-in exit Operation.
 
'''''ExitOperation''''': Plug-in exit Operation.
  
'''''InitProject''''': Plug-in start operation.  
+
'''''InitProject''''': Plug-in start operation.
 +
 
 +
'''''LoadCorpusStatus:''''' Load Corpora Session.
 +
 
 +
'''''SaveCorpusStatus:''''' Save Corpora Session.
  
 
=== '''''Views:''''' ===
 
=== '''''Views:''''' ===
  
'''''CorporaView''''': Corpora view for Corpus visualization. Permit creation of Corpus data-type on clipboard.
+
'''''CorporaView''''': Allows the visualization of Corpus data-types on the clipboard.  
 +
 
 +
'''''CorpusDocumentsView''''': Allows the visualization of the documents belonging to each corpus.
 +
 
 +
'''''CorpusProcessesView''''': Allows the visualization of the processes applied to each corpus.
 +
 
 +
'''''NERAnnotatedDocumentView''''': NERDocumentAnnotation View; allows checking the document entity annotations.
  
'''''CorpusDocumentsView''''': Corpus View for documents viewer.
+
'''''NERProcessAnnotationDocumentsView''''': NERProcess View of all document and the creation of NERDocumentAnnotation.
  
'''''CorpusProcessesView''''': Corpus View for processes viewer. Permits the selection off Processes and put it in clipboard.
+
'''''NERStatisticsView''''': NERProcess View that contains statistics for entities in the corpus.
  
'''''NERAnnotatedDocumentView''''': NERDocumentAnnotation View for document entity annotations visualization.  
+
'''''REAnnotatedDocumentView''''': REDocumentAnnotation View for document entity and event annotations.
  
'''''NERProcessAnnotationDocumentsView''''': NERProcess View of all document and the creation of NERDocumentAnnotation.
+
'''''REProcessAnnotationDocumentsView''''': REProcess View that contains statistics for entities in REProcess.
 
'''''NerStatisticsView''''': NERProcess View that contains statistics for NERProcess.
 
  
'''''REAnnotatedDocumentView''''': REDocumentAnnotation View for document entity and event annotations visualization.
+
'''''RERelationsViewer''''': REProcess view of all Relation present in the process.
  
'''''REProcessAnnotationDocumentsView''''': REProcess View of all document and the creation of REDocumentAnnotation.
+
'''''REProcessRelationsResumeStats''''' REProcess Relations main statistics.

Latest revision as of 16:12, 27 February 2013

About Corpora Plug-in

Plug-in that defines central data-types for Corpora (Corpus Set). All information extraction processes are applied over a Corpus in @Note2. A Corpus is a set of documents that could be annotated with entities/events in IEProcesses. In this plug-in is already some standard View for Annotated documents.

User HowTOs

Corpora Load Corpus

Corpora Remove Corpus

Corpus Load Process

Corpus Remove Process

Annotated Document Default View

Process Entity Details View

Process Load Annotated Document

Create Corpus By Publication Manager

Process Relations view

Process Relations Resume Stats

Merging NER Schemas

REProcess Export To XGMML File

Change Class Colors

MVC AIBench Model:

Data-types:

Corpora: Represents Corpora and contains a Corpus Set. Contains methods for Corpus database management.

Corpus: Represents a set of publications. Contains information about Corpus properties, name, description, database id and lists of IEPRocess applied to corpus.

NERDocumentAnnotation: Contains information about document entities annotations resulting from NER processes..

REDocumentAnnotation: Contains information about document entities and event annotation resulting from RE processes.

NERProcess: Represents a NER Process and contains a set of NERDocumentAnnotation.

REProcess: Represents a RE Process and contains a set of REDocumentAnnotations.

Operations:

ChangeClassColor: Operation for changing class color. The color serves to view multi-colors.

CreateCorpusOperationByPublicationManager: Operation that permits corpus creation deriving for Queries of Publication Manager.

NERAnnotationsMergeOperation: Operation for merging NER Schemas from Corpus.

ExitOperation: Plug-in exit Operation.

InitProject: Plug-in start operation.

LoadCorpusStatus: Load Corpora Session.

SaveCorpusStatus: Save Corpora Session.

Views:

CorporaView: Allows the visualization of Corpus data-types on the clipboard.

CorpusDocumentsView: Allows the visualization of the documents belonging to each corpus.

CorpusProcessesView: Allows the visualization of the processes applied to each corpus.

NERAnnotatedDocumentView: NERDocumentAnnotation View; allows checking the document entity annotations.

NERProcessAnnotationDocumentsView: NERProcess View of all document and the creation of NERDocumentAnnotation.

NERStatisticsView: NERProcess View that contains statistics for entities in the corpus.

REAnnotatedDocumentView: REDocumentAnnotation View for document entity and event annotations.

REProcessAnnotationDocumentsView: REProcess View that contains statistics for entities in REProcess.

RERelationsViewer: REProcess view of all Relation present in the process.

REProcessRelationsResumeStats REProcess Relations main statistics.