Split-word Architecture in Recurrent Neural Networks POS-Tagging

IRIS

We analyze Recurrent Neural Network (RNN) architectures to handle the problem of Part-of-Speech (POS) Tagging. When linguistic rules are inserted ad-hoc into the decision algorithm, there is a difficulty in understanding the role of prior information and learning. The real potential of recurrent networks is demonstrated in this paper on the Italian language in a purely data-driven approach, where we can reach the state-of-the-art on the UD Italian-ISTD (Italian Stanford Dependency Treebank) dataset in comparison to TINT. We propose a methodology for splitting words that are mapped to embedding spaces and fed to forward-backward networks.

Split-word Architecture in Recurrent Neural Networks POS-Tagging

Di Gennaro, G;Ospedale, A;Di Girolamo, A;Buonanno, A;Palmieri, F.;Fedele, G

2022

Abstract

We analyze Recurrent Neural Network (RNN) architectures to handle the problem of Part-of-Speech (POS) Tagging. When linguistic rules are inserted ad-hoc into the decision algorithm, there is a difficulty in understanding the role of prior information and learning. The real potential of recurrent networks is demonstrated in this paper on the Italian language in a purely data-driven approach, where we can reach the state-of-the-art on the UD Italian-ISTD (Italian Stanford Dependency Treebank) dataset in comparison to TINT. We propose a methodology for splitting words that are mapped to embedding spaces and fed to forward-backward networks.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Codice ISBN
	
				978-1-7281-8671-9
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11591/496669

Citazioni

ND

ND

1

social impact