Sentiment analysis using unsupervised learning for local government elections in South Africa

dc.contributor.advisorMarivate, Vukosi
dc.contributor.coadvisorOlaleye, Kayode
dc.contributor.emailu22826476@tuks.co.zaen_US
dc.contributor.postgraduateMatloga, Mokgadi Penelope
dc.date.accessioned2024-09-13T11:57:27Z
dc.date.available2024-09-13T11:57:27Z
dc.date.created2024-04
dc.date.issued2023-11
dc.descriptionMini Dissertation (MIT (Big Data Science))--University of Pretoria, 2023.en_US
dc.description.abstractUnderstanding public sentiment is vital for political parties in order for them to be able to structure their election campaigns around voter expectations. The study focuses on unsupervised learning to assess the variation of polarity sentiment in tweets during the 2021 South African local government election campaign. The study uses a pre-trained twitter-roberta-base-sentiment-latest model from Hugging Face and unsupervised lexicon based pre-trained approaches, namely: VADER and TextBlob to determine the polarity sentiment in order to gain insight that could be applied towards informing political campaigns and to see if there are any distinct sentiment patterns or shifts during different phases of the 2021 local government elections campaigns. Furthermore, the study applies the use of suspicious patterns and K-Means methods to classify the users as either bots and human using to be able to identify the user behind the keyboard. The study also make use of OpenAI GPT model to label the dataset for fine-tuning and addresses the issue of class imbalance. VADER and TextBlob results show a significant difference from that of the twitter-roberta-base-sentiment-latest models when comparing the statistical distribution based on the sentiment results and the user classification results. Based on the results, there is a significant variation across all sentiment classes and they vary over time. Furthermore, the results revealed TRBSL and TRBSL** outperforms VADER and TextBlob based on the scores for weighted accuracy and F1-scores. It was discovered that most of the tweets were generated by humans, with only few being identified as bot-generated and having a negative sentiments.en_US
dc.description.availabilityUnrestricteden_US
dc.description.degreeMIT (Big Data Science)en_US
dc.description.departmentComputer Scienceen_US
dc.description.facultyFaculty of Engineering, Built Environment and Information Technologyen_US
dc.identifier.citation*en_US
dc.identifier.otherA2024en_US
dc.identifier.urihttp://hdl.handle.net/2263/98196
dc.language.isoenen_US
dc.publisherUniversity of Pretoria
dc.rights© 2021 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectSentiment analysisen_US
dc.subjectOpenAIen_US
dc.subjectFine-tuningen_US
dc.subjectSuspicious patternsen_US
dc.subjectUser classificationen_US
dc.subjectLocal government electionen_US
dc.titleSentiment analysis using unsupervised learning for local government elections in South Africaen_US
dc.typeMini Dissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Matloga_Sentiment_2023.pdf
Size:
7.74 MB
Format:
Adobe Portable Document Format
Description:
Mini Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: