Discovering the spatial coverage of the documents through the SpatialCIM Methodology.
The main focus of this paper is to present the SpatialCIM methodology to identify the spatial coverage of the documents in the Brazilian geographic area. This methodology uses a linguistic tool to assist in the entity recognition process. The linguistic tool classifies the recognized entities as person, organization, time and localization, among others. The localization entities are checked using a geographic information system (GIS) in order to extract the Brazilian entity geographic paths. If there are multiple geographic paths for a single entity, the disambiguation process is carried out. This process attempts to locate the best geographic path for an entity considering all the geographic entities in the text. Another important objective of this paper is to show that the disambiguation process improves the geographic classification of the documents considering the obtained geographic paths. The validation process considers a set of news previously labeled by an expert and compared with the results of the disambiguated and non-disambiguated geographic paths. The results showed that the disambiguation process improves the classification compared with the classification without disambiguation. Keywords: Ambiguity problem resolution, spatial coverage identification, toponym resolution.
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Anais e Proceedings de eventos biblioteca |
Language: | English eng |
Published: |
2013-02-06
|
Subjects: | Cobertura espacial, Ambiguidade, Ferramenta lingüística, Spatial coverage identification, Ambiguity problem resolution, Toponym resolution, |
Online Access: | http://www.alice.cnptia.embrapa.br/alice/handle/doc/948445 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
dig-alice-doc-948445 |
---|---|
record_format |
koha |
spelling |
dig-alice-doc-9484452017-08-15T23:40:31Z Discovering the spatial coverage of the documents through the SpatialCIM Methodology. VARGAS, R. N. P. REZENDE, S. de O. MOURA, M. F. SPERANZA, E. A. RODRIGUEZ, E. ROSA NATHALIE PORTUGAL VARGAS, ICMC/USP; SOLANGE DE OLIVEIRA REZENDE, ICMC/USP; MARIA FERNANDA MOURA, CNPTIA; EDUARDO ANTONIO SPERANZA, CNPTIA; ERCILIA RODRIGUEZ. Cobertura espacial Ambiguidade Ferramenta lingüística Spatial coverage identification Ambiguity problem resolution Toponym resolution The main focus of this paper is to present the SpatialCIM methodology to identify the spatial coverage of the documents in the Brazilian geographic area. This methodology uses a linguistic tool to assist in the entity recognition process. The linguistic tool classifies the recognized entities as person, organization, time and localization, among others. The localization entities are checked using a geographic information system (GIS) in order to extract the Brazilian entity geographic paths. If there are multiple geographic paths for a single entity, the disambiguation process is carried out. This process attempts to locate the best geographic path for an entity considering all the geographic entities in the text. Another important objective of this paper is to show that the disambiguation process improves the geographic classification of the documents considering the obtained geographic paths. The validation process considers a set of news previously labeled by an expert and compared with the results of the disambiguated and non-disambiguated geographic paths. The results showed that the disambiguation process improves the classification compared with the classification without disambiguation. Keywords: Ambiguity problem resolution, spatial coverage identification, toponym resolution. 2013-02-06T11:11:11Z 2013-02-06T11:11:11Z 2013-02-06 2012 2020-01-22T11:11:11Z Anais e Proceedings de eventos In: AGILE INTERNATIONAL CONFERENCE ON GEOGRAPHIC INFORMATION SCIENCE, 15., 2012, Avignon. Bridging the geographic information sciences: proceedings. [S.l.: s.n.], 2012. 978-90-816960-0-5 http://www.alice.cnptia.embrapa.br/alice/handle/doc/948445 en eng openAccess p. 181-186. |
institution |
EMBRAPA |
collection |
DSpace |
country |
Brasil |
countrycode |
BR |
component |
Bibliográfico |
access |
En linea |
databasecode |
dig-alice |
tag |
biblioteca |
region |
America del Sur |
libraryname |
Sistema de bibliotecas de EMBRAPA |
language |
English eng |
topic |
Cobertura espacial Ambiguidade Ferramenta lingüística Spatial coverage identification Ambiguity problem resolution Toponym resolution Cobertura espacial Ambiguidade Ferramenta lingüística Spatial coverage identification Ambiguity problem resolution Toponym resolution |
spellingShingle |
Cobertura espacial Ambiguidade Ferramenta lingüística Spatial coverage identification Ambiguity problem resolution Toponym resolution Cobertura espacial Ambiguidade Ferramenta lingüística Spatial coverage identification Ambiguity problem resolution Toponym resolution VARGAS, R. N. P. REZENDE, S. de O. MOURA, M. F. SPERANZA, E. A. RODRIGUEZ, E. Discovering the spatial coverage of the documents through the SpatialCIM Methodology. |
description |
The main focus of this paper is to present the SpatialCIM methodology to identify the spatial coverage of the documents in the Brazilian geographic area. This methodology uses a linguistic tool to assist in the entity recognition process. The linguistic tool classifies the recognized entities as person, organization, time and localization, among others. The localization entities are checked using a geographic information system (GIS) in order to extract the Brazilian entity geographic paths. If there are multiple geographic paths for a single entity, the disambiguation process is carried out. This process attempts to locate the best geographic path for an entity considering all the geographic entities in the text. Another important objective of this paper is to show that the disambiguation process improves the geographic classification of the documents considering the obtained geographic paths. The validation process considers a set of news previously labeled by an expert and compared with the results of the disambiguated and non-disambiguated geographic paths. The results showed that the disambiguation process improves the classification compared with the classification without disambiguation. Keywords: Ambiguity problem resolution, spatial coverage identification, toponym resolution. |
author2 |
ROSA NATHALIE PORTUGAL VARGAS, ICMC/USP; SOLANGE DE OLIVEIRA REZENDE, ICMC/USP; MARIA FERNANDA MOURA, CNPTIA; EDUARDO ANTONIO SPERANZA, CNPTIA; ERCILIA RODRIGUEZ. |
author_facet |
ROSA NATHALIE PORTUGAL VARGAS, ICMC/USP; SOLANGE DE OLIVEIRA REZENDE, ICMC/USP; MARIA FERNANDA MOURA, CNPTIA; EDUARDO ANTONIO SPERANZA, CNPTIA; ERCILIA RODRIGUEZ. VARGAS, R. N. P. REZENDE, S. de O. MOURA, M. F. SPERANZA, E. A. RODRIGUEZ, E. |
format |
Anais e Proceedings de eventos |
topic_facet |
Cobertura espacial Ambiguidade Ferramenta lingüística Spatial coverage identification Ambiguity problem resolution Toponym resolution |
author |
VARGAS, R. N. P. REZENDE, S. de O. MOURA, M. F. SPERANZA, E. A. RODRIGUEZ, E. |
author_sort |
VARGAS, R. N. P. |
title |
Discovering the spatial coverage of the documents through the SpatialCIM Methodology. |
title_short |
Discovering the spatial coverage of the documents through the SpatialCIM Methodology. |
title_full |
Discovering the spatial coverage of the documents through the SpatialCIM Methodology. |
title_fullStr |
Discovering the spatial coverage of the documents through the SpatialCIM Methodology. |
title_full_unstemmed |
Discovering the spatial coverage of the documents through the SpatialCIM Methodology. |
title_sort |
discovering the spatial coverage of the documents through the spatialcim methodology. |
publishDate |
2013-02-06 |
url |
http://www.alice.cnptia.embrapa.br/alice/handle/doc/948445 |
work_keys_str_mv |
AT vargasrnp discoveringthespatialcoverageofthedocumentsthroughthespatialcimmethodology AT rezendesdeo discoveringthespatialcoverageofthedocumentsthroughthespatialcimmethodology AT mouramf discoveringthespatialcoverageofthedocumentsthroughthespatialcimmethodology AT speranzaea discoveringthespatialcoverageofthedocumentsthroughthespatialcimmethodology AT rodrigueze discoveringthespatialcoverageofthedocumentsthroughthespatialcimmethodology |
_version_ |
1756018040459231232 |