Abstract
This research analyzes the impact of the filtering of word n-grams, using their grammatical category, on the identification of sentiment based on text comments from social networks in Spanish. The impact of filtering n-grams containing adjectives, adverbs and interjections was investigated. It was determined that it is possible to reduce the volume of processed data and at the same time achieve an improvement of up to 30 % in the accuracy when classifying an annotated corpus of test comments separating those that contain sentiment from those that do not.
References
Casasola, E. (2015). [casasola] (2015, Noviembre 26) Tecnologías del lenguaje en Costa Rica [Archivo de video]. Recuperado de https://www.youtube.com/watch?v=y66DlvEVomE&feature=youtu.be&t=34m13s
Casasola, E. y Leoni de León, A. (2017). Transferencia de la polaridad semántica de frases idiomáticas a comentarios de opinión. Revista Káñina, 40(3), 65-76.
Hatzivassiloglou, V. y Wiebe, J. M. (2000, Julio). Effects of adjective orientation and gradability on sentence subjectivity. En Proceedings of the 18th conference on Computational linguistics. New Buswick, NJ. Association for Computational Linguistics. I, 299-305. Recuperado de http://www.aclweb.org/anthology/C00-1044
Liu, B. (2012). Sentiment Analysis and Opinion Mining. San Rafael, California: Morgan & Claypool Publishers.
Martínez-Cámara, E., García-Cumbreras, M., Villena-Román, J. y García-Morera, J. (2016). TASS 2015 - The Evolution of the Spanish Opinion Mining Systems. Procesamiento del Lenguaje Natural, LVI, 33-40. Recuperado de http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/issue/view/218
Padró, L. y Stanilovsky, E. (2011). Analizadores Multilingües. FreeLing. Linguamática, III(2), 13-20.
Phan, Dang-Hung y Cao, Tuan-Dung. (2014). Applying skip-gram word estimation and SVM-based classification for opinion mining Vietnamese food places text reviews. En Proceedings of the Fifth Symposium on Information and Communication Technology (pp. 232-239). Recuperado de http://dl.acm.org/citation.cfm?doid=2676585.2676606
Rish, I. (2001). An empirical study of the naive Bayes classifier. Recuperado de http://www.research.ibm.com/people/r/rish/papers/RC22230.pdf
Wiebe, J., Wilson, T., Bruce, R., Bell, M. y Martin, M. (2006). Learning subjective language. Learning, 30(3), 277-308. Recuperado de http://www.mitpressjournals.org/doi/pdf/10.1162/0891201041850885