Loading…
Friday April 10, 2026 3:00pm - 5:00pm GMT+07

Authors - Ying Tang, Chuanchen BI
Abstract - This article presents a comprehensive analysis of methods and recent research in the sentiment analysis of Uzbek-language social media posts. A balanced corpus of 100,000 posts from Telegram, Instagram, Twitter, and Facebook was constructed as the object of study, in which positive, neutral, and negative classes are equally represented. The data were subjected to thorough preprocessing steps including cleaning, normalization, tokenization, removal of stop words, stemming, and lemmatization. The evaluated models include Naive Bayes, Support Vector Machines (SVM), Conditional Random Fields (CRF), Long Short-Term Memory networks (LSTM), and transformer-based architectures such as BERT and RoBERTa. The accuracy, F1-score, and runtime performance of each model were compared. Experimental results indicate that transformer-based models achieved the highest accuracy (~92%), followed by LSTM (~90%) and SVM (~88%). Despite being a simple method, Naive Bayes served as a baseline (~78% accuracy). The literature review highlights prior research conducted in Uzbek sentiment analysis, emphasizing the importance of corpus creation and accounting for language-specific features. The results indicate that transformer models provide the highest accuracy, whereas classical methods remain competitive even in low-resource settings. The article concludes with a discussion of promising directions and potential practical applications in the field of Uzbek-language sentiment analysis.
Paper Presenter
avatar for Ying Tang

Ying Tang

Thailand

Friday April 10, 2026 3:00pm - 5:00pm GMT+07
Virtual Room E Bangkok, Thailand

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link