Explorando la opinión de los usuarios de Twitter: análisis de sentimientos de marca mediante Deep Learning

Soto Sarria, Diego Fernando; Muñoz Bacca, Julian

Explorando la opinión de los usuarios de Twitter: análisis de sentimientos de marca mediante Deep Learning

Files

T03093.pdf (3.06 MB)

Date

2023-07-09

Authors

Soto Sarria, Diego Fernando

Muñoz Bacca, Julian

Thesis Director / Advisor

Diaz Cely, Javier Gustavo

Publisher

Universidad Icesi

Documentos PDF

Resumen

Identificar efectivamente las oportunidades de mejora es fundamental para toda organización; es por esto, que es de gran interés para las mismas tener conocimiento de la percepción de su marca en redes sociales como Twitter, donde sus clientes pueden expre sarse pública y libremente. Este estudio propone una solución teórico - práctica aplicando técnicas de minería de texto y Deep Learning sobre los tweets recopilados de los usuarios en 3 de las principales empresas prestadoras de servicios de telecomunicacio nes como son Movistar, Claro y Tigo. Comparando las métricas de evaluación, en dos de las redes neuronales recurrentes mayormente usados en el análisis de sentimiento de texto, como son L ST M (Long Short - Term Memory) y GRU (Gated Recurrent Units). Una vez realizada esta comparación, tanto GRU como LSTM obtuvieron muy buenos resultados en la métrica de evaluación y con poco sobre ajuste. Las pruebas ejecutadas con los modelos seleccionados m ostraron una alta precisión en la clasificación de Tweets co n sentimiento Negativo, con un porcentaje de Sensibilidad (Recall) en los datos de validación superiores al 94%. Sin embargo, en los Tweets con sentimientos No Negativos, la precisión fue más baja, con un a Especificidad (Specificity) del 68%, 82.4% y 42.4% para Movistar, Claro y Tigo respectivamente, siendo significativamente bajo para este último. La baja precisión para clasificar los Tweets no Negativos, se atribuyen a la gran variedad de temas para esta categoría, además de la baja cantidad de datos en comparación con los Negativos. Por lo tanto, para futuros estudios se recomienda el uso de un set de datos (Tweets) mucho más grande para mejorar la precisión en la clasificación de ambos grupos . Gracias a esta clasificación y la identificación de aspectos negativos detectados en los diferentes comentarios en Twitter l a solución propuesta permite gestionar de manera efectiva la experiencia de usuar io mediante un tablero de control desarrollado en Power BI , este facilitará la supervisión de su presencia en esta red social, generando información clave que permita a la organización desarrollar estrategias de negocio basadas en datos que busquen abordar los problemas de manera efectiva y mejorar la calidad del servicio para satisfacer las necesidades del mercado.

Abstract

Effectively identifying opportunities for improvement is fundamental for every organization; this is why it is of great interest for them to know the perception of their brand on social networks like Twitter, where their customers can express themselves publicly and freely. This study proposes a theoretical-practical solution applying text mining and Deep Learning techniques on tweets collected from users of 3 of the main telecommunications service providers such as Movistar, Claro, and Tigo. Comparing the evaluation metrics of two of the most commonly used recurrent neural networks in text sentiment analysis, such as LSTM (Long Short-Term Memory) and GRU (Gated Recurrent Units). Once this comparison was made, both GRU and LSTM obtained very good results in the evaluation metric and with little overfitting. The tests executed with the selected models showed high precision in the classification of Tweets with Negative sentiment, with a Sensitivity (Recall) percentage in the validation data exceeding 94%. However, for Tweets with Non-Negative sentiments, the precision was lower, with a Specificity of 68%, 82.4%, and 42.4% for Movistar, Claro, and Tigo respectively, being significantly low for the latter. The low precision for classifying Non-Negative Tweets is attributed to the wide variety of topics in this category, in addition to the low amount of data compared to Negative ones. Therefore, for future studies, the use of a much larger dataset (Tweets) is recommended to improve the precision in the classification of both groups. Thanks to this classification and the identification of negative aspects detected in the different comments on Twitter, the proposed solution allows for effective management of the user experience through a dashboard developed in Power BI, which will facilitate the supervision of their presence on this social network, generating key information that allows the organization to develop data-driven business strategies that seek to effectively address problems and improve service quality to meet market needs.

Palabras clave

Análisis de sentimientosDeep LearningTwitterMarcaTesis de Maestría en Ciencia de Datos

Keywords

Sentiment analysisDeep LearningTwitterBrand

OLIB

https://biblioteca2.icesi.edu.co/cgi-olib/?oid=366460

URI

https://hdl.handle.net/10906/130586

Collections

Maestría en Ciencia de Datos

Creative Commons license

Except where otherwised noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 International

Full item page

Explorando la opinión de los usuarios de Twitter: análisis de sentimientos de marca mediante Deep Learning

Files

Date

Authors

Thesis Director / Advisor

Journal Title

Journal ISSN

Volume Title

Publisher

Documentos PDF

Resumen

Abstract

Description

Palabras clave

Keywords

OLIB

ISBN

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license