Named entity recognition aims at locating elements in a given text and classifying them according to pre-defined categories, such as the names of persons, organisations, locations, quantities, etc. This paper proposes an approach to recognise the location names by extracting them from unstructured Italian language texts. We put forward the use of the framework MapReduce for this task, since it is more robust than a classical analysis when data are unknown and assists at parallelising processing, which is essential for a large amount of data.
Extracting Location Names from Unstructured Italian Texts Using Grammar Rules and MapReduce
NAPOLI, CHRISTIAN;TRAMONTANA, EMILIANO ALESSIO;
2016-01-01
Abstract
Named entity recognition aims at locating elements in a given text and classifying them according to pre-defined categories, such as the names of persons, organisations, locations, quantities, etc. This paper proposes an approach to recognise the location names by extracting them from unstructured Italian language texts. We put forward the use of the framework MapReduce for this task, since it is more robust than a classical analysis when data are unknown and assists at parallelising processing, which is essential for a large amount of data.File in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
2016icistLocationNames.pdf
solo gestori archivio
Licenza:
Non specificato
Dimensione
201.59 kB
Formato
Adobe PDF
|
201.59 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.