Entity Extraction in Biochemical Text using Multiobjective Optimization

In this paper we propose a multiobjective modified differential evolution based feature selection and classifier ensemble approach for biochemical entity extraction. The algorithm performs in two layers. The first layer concerns with determining an appropriate set of features for the task within the framework of a supervised statistical classifier, namely, Conditional Random Field (CRF). This produces a set of solutions, a subset of which is used to construct an ensemble in the second layer. The proposed approach is evaluated for entity extraction in chemical texts, which involves identification of IUPAC and IUPAC-like names and classification of them into some predefined categories. Experiments that were carried out on a benchmark dataset show the recall, precision and F-measure values of 86.15%, 91.29% and 88.64%, respectively.

Saved in:
Bibliographic Details
Main Authors: Sikdar,Utpal Kumar, Ekbal,Asif, Saha,Sriparna
Format: Digital revista
Language:English
Published: Instituto Politécnico Nacional, Centro de Investigación en Computación 2014
Online Access:http://www.scielo.org.mx/scielo.php?script=sci_arttext&pid=S1405-55462014000300013
Tags: Add Tag
No Tags, Be the first to tag this record!