Exploring COVID-19 public perceptions in South Africa through sentiment analysis and topic modelling of Twitter posts

dc.contributor.authorKekere, Temitope
dc.contributor.authorMarivate, Vukosi
dc.contributor.authorHattingh, Maria J. (Marie)
dc.date.accessioned2024-07-23T04:56:14Z
dc.date.available2024-07-23T04:56:14Z
dc.date.issued2023
dc.description.abstractThe narratives shared on social media during a health crisis such as COVID-19 reflect public perceptions of the crisis. This article provides findings from a study of the perceptions of South African citizens regarding the government’s response to the COVID-19 pandemic from March to May 2020. The study analysed Twitter data from posts by government officials and the public in South Africa to measure the public’s confidence in how the government was handling the pandemic. A third of the tweets dataset was labelled using valence aware dictionary and sentiment reasoner (VADER) lexicons, forming the training set for four classical machinelearning algorithms—logistic regression (LR), support vector machines (SVM), random forest (RF), and extreme gradient boosting (XGBoost)—that were employed for sentiment analysis. The effectiveness of these classifiers varied, with error rates of 17% for XGBoost, 14% for RF, and 7% for both SVM and LR. The best-performing algorithm (SVM) was subsequently used to label the remaining two-thirds of the tweet dataset. In addition, the study used, and evaluated the effectiveness of, two topic-modelling algorithms—latent dirichlet allocation (LDA) and non-negative matrix factorisation (NMF)—for classification of the most frequently occurring narratives in the Twitter data. The better-performing of these two algorithms, NMF, identified a prevalence of positive narratives in South African public sentiment towards the government’s response to COVID-19.en_US
dc.description.departmentInformaticsen_US
dc.description.librarianam2024en_US
dc.description.sdgSDG-09: Industry, innovation and infrastructureen_US
dc.description.sponsorshipThe University of Pretoria, Canada’s International Development Research Centre (IDRC) and ABSA.en_US
dc.description.urihttp://link.wits.ac.zajournal/journal.htmlen_US
dc.identifier.citationKekere, T., Marivate, V., & Hattingh, M. (2023). Exploring COVID-19 public perceptions in South Africa through sentiment analysis and topic modelling of Twitter posts. The African Journal of Information and Communication (AJIC), 31, 1-27. https://DOI.org/10.23962/ajic.i31.14834.en_US
dc.identifier.issn1449-2679
dc.identifier.other10.23962/ajic.i31.14834
dc.identifier.urihttp://hdl.handle.net/2263/97160
dc.language.isoenen_US
dc.publisherLearning Information Networking and Knowledge (LINK) Centre, Graduate School of Public and Developmenten_US
dc.rights© 2023 Learning Information Networking and Knowledge (LINK) Centre, Graduate School of Public and Development. This work is distributed under the Creative Commons Attribution-NonCommercial licence.en_US
dc.subjectSentiment analysisen_US
dc.subjectSentiment classificationen_US
dc.subjectTopic modellingen_US
dc.subjectSocial mediaen_US
dc.subjectTwitteren_US
dc.subjectNatural language processing (NLP)en_US
dc.subjectGovernment responseen_US
dc.subjectPublic perceptionsen_US
dc.subjectCOVID-19 pandemicen_US
dc.subjectCoronavirus disease 2019 (COVID-19)en_US
dc.subjectSouth Africa (SA)en_US
dc.subjectSDG-09: Industry, innovation and infrastructureen_US
dc.subjectValence aware dictionary and sentiment reasoner (VADER)en_US
dc.subjectLogistic regression (LR)en_US
dc.subjectExtreme gradient boosting (XGBoost)en_US
dc.subjectSupport vector machines (SVM)en_US
dc.subjectRandom forest (RF)en_US
dc.subjectNon-negative matrix factorisation (NMF)en_US
dc.subjectLatent dirichlet allocation (LDA)en_US
dc.titleExploring COVID-19 public perceptions in South Africa through sentiment analysis and topic modelling of Twitter postsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kekere_Exploring_2023.pdf
Size:
2.03 MB
Format:
Adobe Portable Document Format
Description:
Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: