CONTENT ANALYSIS OF SOCIAL MEDIA USING MACHINE LEARNING

Main Article Content

Renat AKHMEDOV
Vitalii BEZKOROVAINYI
Vasyl DERBENTSEV

Abstract

Introduction. During the last two decades, the rapid development of social media has caused a revolution in means of communication in modern society. Therefore, a significant number of the world's leading companies began to rebuild their business models using the capabilities of modern means of communication through social networks and other platforms, for which content analysis technologies are successfully used.


Purpose. The purpose of the article is to develop methodological principles for conducting content analysis of electronic resources (social media) based on using Artificial Intelligence technologies, in particular, Machine and Deep Learning.


Results. The paper analyzes the phenomenon of social media and identifies the key factors that determine the effectiveness of their use for both business and consumers. Based on this, the paper explores the features of the content analysis of social media, which take into account their mass character, as well as the presence of large arrays of unstructured information. The large amount of information on various electronic platforms requires adequate means for their monitoring and processing, analysis of content as well. To solve these problems, the paper substantiates the use of modern Natural Language Processing technologies based on Machine and Deep Learning approaches. An alternative to the existing services for content analysis is developing systems based on such tight forwarding motion models, like BERT (provided by Google), or GPT-3 (provided by Open AI), which was implemented on the Transformer Deep Neural Networks architecture. The article also proposes the use of Transfer Learning technology to transfer knowledge from pre-learned language models to another domain or another language, in particular, Ukrainian from other Slavic languages.


Originality. The main findings of this paper are the follows: (i) the main advantages of using social media for businesses and consumers are substantiated; (ii) the characteristic features of conducting content analysis in social media are determined; (iii) the advantages and disadvantages of using Natural Language Processing methods for solving problems of content analysis in social media are shown; (iv) Transfer Learning approach to transfer knowledge from pre-learned language models to another domain or another language, in particular, Ukrainian from other Slavic languages has been proposed for solving content analysis tasks.


Conclusion. The accumulation of a sufficient amount of training data, the development of multi-core CPU and graphics processors, as well as the formation of powerful pre-trained language models and the development of effective algorithms for processing extremely large amounts of information are factors that determine the efficiency of the use of Machine and Deep Learning technology for content analysis tasks in recent years. Therefore, the development of computer systems for content analysis of social media, in particular, using modern technologies of Artificial Intelligence (Machine and Deep Learning), does not lose its relevance and requires further research.

Article Details

Section
Статті

References

Yahoo Finance. URL: https://finance.yahoo.com (дата звернення 15.09.2022).

Krippendorff K. Content Analysis: An introduction to its methodology. London: Sage, 1980.

Костенко Н., Іванов В. Досвід контент-аналізу: Моделі та практики. Київ: Центр вільної преси, 2003. 200 с.

Таршис А. Е. Контент-анализ: Принципы методологии. (Построение теоретической базы. Онтология, аналитика и феноменология текста. Программа исследования). м.: Книжный дом ЛИБРОКОМ, 2014. 182 с.

Іванов В.Ф., Костенко Н.В. Контент-аналіз. Велика українська енциклопедія. URL: https://vue.gov.ua/Контент-аналіз (дата звернення: 15.09.2022).

Берко А. Ю. Системи електронної контент-комерції: монографія / А. Ю. Берко, В. А. Висоцька, В. В. Пасічник. Львів: Вид-во Нац. ун-ту “Львівська політехніка”, 2009. 612 с.

Войтович О. П., Буда А. Г., Головенько В. О. Дослідження методів аналізу соціальних мереж як середовища інформаційних війн. URL: https://epsi.vntu.edu.ua/uploads/2017/76-86ycc0hnc6o8o3xgkr97hrynqd5m0obr.pdf (дата звернення 15.09.2022).

Кісь Я. П., Висоцька В. А., Чирун Л. Б. Застосування контент-аналізу для опрацювання текстових масивів даних. Вісник Національного університету «Львівська політехніка». Інформаційні системи та мережі. 2015. Вип. 814. С. 282-292.

Фольтович В., Коробчинський М., Чирун Л., Висоцька В. Метод контент-аналізу текстової інформації Інтернет-газети. Вісник Національного університету «Львівська політехніка». Комп’ютерні науки та інформаційні технології. 2017. Вип. 864. С. 7-19.

Ахмедов Р.Р., Безкоровайний В.С., Данильченко Т.В. Методологія аналізу контенту електронних засобів масової інформації. Економічний простір. 2021. Вип.176. С. 141-145. DOI: https://doi.org/10.32782/2224-6282/176-25

Digital 2022 Global Overview Report. URL: https://datareportal.com/reports/digital-2022-global-overview-report (дата звернення 15.09.2022).

Statista. URL: https://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users/ (дата звернення 15.09.2022).

Hobson, L., Cole, H., Hannes, H. Natural Language Processing in Action Understanding, analyzing, and generating text with Python. Manning Publications (P), 2019.

Kamath, U., Liu, J.,·Whitaker, J. Deep Learning for NLP and Speech Recognition. Springer Nature Switzerland AG, 2019. DOI: https://doi.org/10.1007/978-3-030-14596-5.

Predictive Analytics. Today. URL: https://www.predictiveanalyticstoday.com/top-qualitative-data-analysis-software/ (дата звернення 15.11.2022).

Intellspot. URL: https://www.intellspot.com/content-analysis-software/ (дата звернення 15.11.2022).

Bizzzdev. URL: https://bizzzdev.com/top-10-content-analysis-tools-in-2022/ (дата звернення 15.11.2022).

Devlin, J., Chang, M., Lee, K., and Toutanova, K. BERT: pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT, P. 4171-4186, 2019.

Brown T. et al. Language models are few-shot learners. arXiv:2005.14165. 2020.

Vaswani, A., Shazeer, N. et al. Attention is all you need. In Proc. of the 1st Conference on Neural Information Processing Systems (NIPS 2017), 2017, pp. 6000-6010.