Named entity recognition aims at locating elements in a given text and classifying them according to pre-defined categories, such as the names of persons, organisations, locations, quantities, etc. This paper proposes an approach to recognise the location names by extracting them from unstructured Italian language texts. We put forward the use of the framework MapReduce for this task, since it is more robust than a classical analysis when data are unknown and assists at parallelising processing, which is essential for a large amount of data.

Extracting Location Names from Unstructured Italian Texts Using Grammar Rules and MapReduce

NAPOLI, CHRISTIAN;TRAMONTANA, EMILIANO ALESSIO;
2016-01-01

Abstract

Named entity recognition aims at locating elements in a given text and classifying them according to pre-defined categories, such as the names of persons, organisations, locations, quantities, etc. This paper proposes an approach to recognise the location names by extracting them from unstructured Italian language texts. We put forward the use of the framework MapReduce for this task, since it is more robust than a classical analysis when data are unknown and assists at parallelising processing, which is essential for a large amount of data.
2016
978-331946253-0
Mapreduce; Named entity recognition; Hadoop
File in questo prodotto:
File Dimensione Formato  
2016icistLocationNames.pdf

solo gestori archivio

Licenza: Non specificato
Dimensione 201.59 kB
Formato Adobe PDF
201.59 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/98769
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 1
social impact