PADI-web: ASF corpora

Both corpora (news articles) have been manually collected using the query "african swine fever outbreak" with Google. These corpora in English have been semi-automatically normalized. They can be used as (a) input of BioTex tool in order to extract terminology, (b) input of Weka tool for data-mining tasks. Description: (1) ASFcorpus_epidemio.txt: 69 news about epidemiology aspects. The news contain a principal information of suspicion or confirmation of ASF, unknown disease or unexplained clinical signs in animals of the pig species, with a description of the event, such as place, time, number and species affected and clinical signs place, time, number and species affected and clinical signs (period: 2012-2013). (2) ASFcorpus_eco.txt: 69 news about socio-economic impact of an ASF outbreak to a country or a region, and a secondary information about the event (period: 2012-2014). (3) ASF_corpus_weka_final.arff: corpus (epidemio + socio-economic data) based on Weka format (ARFF file) for data mining tasks, e.g. classification.

Saved in:
Bibliographic Details
Main Authors: Roche, Mathieu, Arsevska, Elena
Language:English
Published: CIRAD Dataverse 2018
Subjects:Agricultural Sciences, Computer and Information Science, Medicine, Health and Life Sciences, Text Mining, Fouille de texte, Disease control, Controle de maladies,
Online Access:http://dx.doi.org/10.18167/DVN1/POIZMA
Tags: Add Tag
No Tags, Be the first to tag this record!