Mostrar el registro sencillo del ítem
dc.rights.license | https://creativecommons.org/licenses/by-nc-nd/2.5/ar/ | es_ES |
dc.creator | Tessore, Juan Pablo | es_ES |
dc.creator | Esnaola, Leonardo Martín | es_ES |
dc.creator | Russo, Claudia Cecilia | es_ES |
dc.creator | Baldassarri, Sandra | es_ES |
dc.date.accessioned | 2021-07-26T15:24:35Z | |
dc.date.available | 2021-07-26T15:24:35Z | |
dc.date.issued | 2019-06-25 | |
dc.identifier.citation | Juan Pablo Tessore, Leonardo Martín Esnaola, Claudia Cecilia Russo, and Sandra Baldassarri. 2019. Comparative analysis of preprocessing tasks over social media texts in Spanish. In Proceedings of the XX International Conference on Human Computer Interaction (Interacción '19). Association for Computing Machinery, New York, NY, USA, Article 27, 1–8. DOI:https://doi.org/10.1145/3335595.3335632 | es_ES |
dc.identifier.isbn | 978-1-4503-7176-6/19/06 | es_ES |
dc.identifier.uri | https://repositorio.unnoba.edu.ar/xmlui/handle/23601/143 | |
dc.description.abstract | One of the key aspects of the texts coming from social media is that they tend to be very noisy. This is mainly because of the usage of informal language and none standard grammatical structures. So in order to use these contents as input for a text analysis process, it is highly recommended to previously clean and reduce the noise of the data. This work focuses on measuring the effectiveness that diverse cleaning and repairing tasks have on the data. The results obtained, indicate that the tasks of tokens with no letters removal, and stressed words correction are the most effective. In addition, some tasks like hashtags or usernames processing, which behave very well in other datasets, are not that relevant in this one. This research is part of a more general one that pursues to build an automatic emotion classifier that makes use of the preprocessed comments as input. | es_ES |
dc.description.sponsorship | Fil: Tessore, Juan Pablo. Universidad Nacional del Noroeste de la Provincia de Buenos Aires. Escuela de Tecnología. Instituto de Investigación y Transferencia en Tecnología, Centro Asociado CIC; Argentina | es_ES |
dc.description.sponsorship | Fil: Tessore, Juan Pablo. Comisión de Investigaciones Científicas de la Provincia de Buenos Aires. | es_ES |
dc.description.sponsorship | Fil: Esnaola, Leonardo Martín. Universidad Nacional del Noroeste de la Provincia de Buenos Aires. Escuela de Tecnología. Instituto de Investigación y Transferencia en Tecnología, Centro Asociado CIC; Argentina | es_ES |
dc.description.sponsorship | Fil: Russo, Claudia Cecilia. Universidad Nacional del Noroeste de la Provincia de Buenos Aires. Escuela de Tecnología. Instituto de Investigación y Transferencia en Tecnología, Centro Asociado CIC; Argentina | es_ES |
dc.description.sponsorship | Fil: Baldassarri, Sandra. Departamento de Informática e Ingeniería de Sistemas, Universidad de Zaragoza, Aragon, Zaragoza, España | es_ES |
dc.description.sponsorship | Fil: Baldassarri, Sandra. Instituto de Investigación en Ingeniería (I3A), Universidad de Zaragoza, Zaragoza, Aragon, España | es_ES |
dc.format | application/pdf | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Association for Computing Machinery (ACM) | es_ES |
dc.relation | info:eu-repo/grantAgreement/UNNOBA/SIB2019/EXP 536/2019/AR. Buenos Aires/Tecnología y Aplicaciones de Sistemas de Software: Calidad e Innovación en procesos, productos y servicios | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.source | Interacción 2019: XX International Conference on Human Computer Interaction | es_ES |
dc.subject | Text mining | es_ES |
dc.subject | Text preprocessing | es_ES |
dc.subject | Text classification, | es_ES |
dc.subject | Sentiment Analysis | es_ES |
dc.title | Comparative analysis of preprocessing tasks over social media texts in Spanish | es_ES |
dc.type | info:eu-repo/semantics/conferenceObject | es_ES |
dc.type | info:ar-repo/semantics/documento de conferencia | es_ES |
dc.type | info:eu-repo/semantics/acceptedVersion | es_ES |
dc.type | info:eu-repo/semantics/conferenceObject | es_ES |
dc.type | info:ar-repo/semantics/documento de conferencia | es_ES |
dc.type | info:eu-repo/semantics/acceptedVersion | es_ES |
dc.type | info:eu-repo/semantics/conferenceObject | es_ES |
dc.type | info:ar-repo/semantics/documento de conferencia | es_ES |
dc.type | info:eu-repo/semantics/acceptedVersion | es_ES |
dc.description.version | Con referato | es_ES |
dc.relation.publisherversion | https://doi.org/10.1145/3335595.3335632 | es_ES |
dc.contributor.orcid | 0000-0002-2111-0976 | es_ES |
dc.contributor.orcid | 0000-0001-6298-9019 | es_ES |
dc.contributor.orcid | 0000-0002-9315-6391 | es_ES |