A cloud-based approach for analyzing viral propagation of linguistic deviations by social networking: Current challenges and pitfalls for text analysis tools

IRIS

Social Networks activities offer rooms a non-trivial testbed for linguistic analysis, introducing significant revolutions into the creation of textual content. Linguistic solecisms, blunders, and generally speaking deviations from standard linguistic norms, are becoming the rule rather than the exception. Social Networks instantly and virally propagate deviations among users, who are increasingly moving away from standard language usage. Performing text analysis on deviated textual documents is a challenging and hard task. In this work, we propose an approach supporting text analysis tasks against a set of deviated textual documents. It exploits a “linguistic blundersonomy”, a taxonomy of linguistic deviations, progressively built by processing textual Big Data provided by social network, in a Cloud-Based environment (SAP-HANA). A preliminary case study for Italian language is presented, showing how the exploitation of a linguistic blundersonomy could improve the precision of a sentiment and opinion mining process, and more generally, of a text analysis process.

A cloud-based approach for analyzing viral propagation of linguistic deviations by social networking: Current challenges and pitfalls for text analysis tools

Marulli F.^Methodology;Nardaggio A.^Software;Racioppi A.^Validation;Vallifuoco L.^Software

2017

Abstract

Social Networks activities offer rooms a non-trivial testbed for linguistic analysis, introducing significant revolutions into the creation of textual content. Linguistic solecisms, blunders, and generally speaking deviations from standard linguistic norms, are becoming the rule rather than the exception. Social Networks instantly and virally propagate deviations among users, who are increasingly moving away from standard language usage. Performing text analysis on deviated textual documents is a challenging and hard task. In this work, we propose an approach supporting text analysis tasks against a set of deviated textual documents. It exploits a “linguistic blundersonomy”, a taxonomy of linguistic deviations, progressively built by processing textual Big Data provided by social network, in a Cloud-Based environment (SAP-HANA). A preliminary case study for Italian language is presented, showing how the exploitation of a linguistic blundersonomy could improve the precision of a sentiment and opinion mining process, and more generally, of a text analysis process.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2017
			
	Titolo della Serie
	
				LECTURE NOTES ON DATA ENGINEERING AND COMMUNICATIONS TECHNOLOGIES
			
	Tutti gli autori
	
						Marulli, F.; Nardaggio, A.; Racioppi, A.; Vallifuoco, L.
					
	Appare nelle tipologie:
	
				2.1 Contributo in volume (Capitolo o Saggio)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11591/442615

Citazioni

ND

1

0

social impact