Due to the huge availability of documents in digital form, and the deception possibility raise bound to the essence of digital documents and the way they are spread, the authorship attribution problem has constantly increased its relevance. Nowa- days, authorship attribution, for both information retrieval and analysis, has gained great importance in the context of security, trust and copyright preservation.This work proposes an innovative multi-agent driven machine learning technique that has been developed for authorship attri- bution. By means of a preprocessing for word-grouping and time- period related analysis of the common lexicon, we determine a bias reference level for the recurrence frequency of the words within analysed texts, and then train a Radial Basis Neural Networks (RBPNN)-based classifier to identify the correct author.The main advantage of the proposed approach lies in the gen- erality of the semantic analysis, which can be applied to different contexts and lexical domains, without requiring any modification. Moreover, the proposed system is able to incorporate an external input, meant to tune the classifier, and then self-adjust by means of continuous learning reinforcement.
An agent-driven semantical identifier using radial basis neural networks and reinforcement learning
NAPOLI, CHRISTIAN;PAPPALARDO, Giuseppe;TRAMONTANA, EMILIANO ALESSIO
2014-01-01
Abstract
Due to the huge availability of documents in digital form, and the deception possibility raise bound to the essence of digital documents and the way they are spread, the authorship attribution problem has constantly increased its relevance. Nowa- days, authorship attribution, for both information retrieval and analysis, has gained great importance in the context of security, trust and copyright preservation.This work proposes an innovative multi-agent driven machine learning technique that has been developed for authorship attri- bution. By means of a preprocessing for word-grouping and time- period related analysis of the common lexicon, we determine a bias reference level for the recurrence frequency of the words within analysed texts, and then train a Radial Basis Neural Networks (RBPNN)-based classifier to identify the correct author.The main advantage of the proposed approach lies in the gen- erality of the semantic analysis, which can be applied to different contexts and lexical domains, without requiring any modification. Moreover, the proposed system is able to incorporate an external input, meant to tune the classifier, and then self-adjust by means of continuous learning reinforcement.File | Dimensione | Formato | |
---|---|---|---|
paper14-2.pdf
solo gestori archivio
Dimensione
1.88 MB
Formato
Adobe PDF
|
1.88 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.