Due to the huge availability of documents in digital form, and the deception possibility raise bound to the essence of digital documents and the way they are spread, the authorship attribution problem has constantly increased its relevance. Nowa- days, authorship attribution, for both information retrieval and analysis, has gained great importance in the context of security, trust and copyright preservation.This work proposes an innovative multi-agent driven machine learning technique that has been developed for authorship attri- bution. By means of a preprocessing for word-grouping and time- period related analysis of the common lexicon, we determine a bias reference level for the recurrence frequency of the words within analysed texts, and then train a Radial Basis Neural Networks (RBPNN)-based classifier to identify the correct author.The main advantage of the proposed approach lies in the gen- erality of the semantic analysis, which can be applied to different contexts and lexical domains, without requiring any modification. Moreover, the proposed system is able to incorporate an external input, meant to tune the classifier, and then self-adjust by means of continuous learning reinforcement.

An agent-driven semantical identifier using radial basis neural networks and reinforcement learning

NAPOLI, CHRISTIAN;PAPPALARDO, Giuseppe;TRAMONTANA, EMILIANO ALESSIO
2014-01-01

Abstract

Due to the huge availability of documents in digital form, and the deception possibility raise bound to the essence of digital documents and the way they are spread, the authorship attribution problem has constantly increased its relevance. Nowa- days, authorship attribution, for both information retrieval and analysis, has gained great importance in the context of security, trust and copyright preservation.This work proposes an innovative multi-agent driven machine learning technique that has been developed for authorship attri- bution. By means of a preprocessing for word-grouping and time- period related analysis of the common lexicon, we determine a bias reference level for the recurrence frequency of the words within analysed texts, and then train a Radial Basis Neural Networks (RBPNN)-based classifier to identify the correct author.The main advantage of the proposed approach lies in the gen- erality of the semantic analysis, which can be applied to different contexts and lexical domains, without requiring any modification. Moreover, the proposed system is able to incorporate an external input, meant to tune the classifier, and then self-adjust by means of continuous learning reinforcement.
2014
Neural Netwoks; Text Recognition; Natural Languages
File in questo prodotto:
File Dimensione Formato  
paper14-2.pdf

solo gestori archivio

Dimensione 1.88 MB
Formato Adobe PDF
1.88 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/72472
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 31
  • ???jsp.display-item.citation.isi??? ND
social impact