Spatial-temporal topic modelling of COVID-19 tweets in South Africa

dc.contributor.advisorMazarura, Jocelyn
dc.contributor.coadvisorFabris-Rotelli, Inger Nicolette
dc.contributor.emailu18073159@tuks.co.zaen_US
dc.contributor.postgraduateJafta, Papama Hlumela Gandhi
dc.date.accessioned2024-02-13T09:41:21Z
dc.date.available2024-02-13T09:41:21Z
dc.date.created2024-04
dc.date.issued2023-12-07
dc.descriptionMini Dissertation (MSc (Advanced Data Analytics))--University of Pretoria, 2023.en_US
dc.description.abstractIn the era of social media, the analysis of Twitter data has become increasingly important for understanding the dynamics of online discourse. This research introduces a novel approach for tracking the spatial and temporal evolution of topics in Twitter data. Leveraging the spatial and temporal labels provided by Twitter for tweets, we propose the Clustered Biterm Topic Model. This model combines the Biterm Topic Model with K-medoid clustering to uncover the intricate topic development patterns over space and time. To enhance the accuracy and applicability of our model, we introduce an innovative element: a covariate-dependent matrix. This matrix incorporates essential covariate information and geographic proximity into the dissimilarity matrix used by K-Medoids clustering. By considering the inherent semantic relationships between topics and the contextual information provided by covariates and geographic proximity, our model captures the complex interplay of topics as they emerge and evolve across different regions and timeframes on Twitter. The proposed Clustered Biterm Topic Model offers a robust and versatile tool for researchers, policymakers, and businesses to gain deeper insights into the dynamic landscape of online conversations, which are inherently shaped by space and time.en_US
dc.description.availabilityRestricteden_US
dc.description.degreeMSc (Advanced Data Analytics)en_US
dc.description.departmentStatisticsen_US
dc.description.facultyFaculty of Natural and Agricultural Sciencesen_US
dc.description.sponsorshipSTATOMET TUKS Cricketen_US
dc.identifier.citation*en_US
dc.identifier.doi10.25403/UPresearchdata.25208939en_US
dc.identifier.otherA2024en_US
dc.identifier.urihttp://hdl.handle.net/2263/94530
dc.language.isoenen_US
dc.publisherUniversity of Pretoria
dc.rights© 2023 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectShort-text topic modelling
dc.subjectCOVID-19
dc.subjectCovariate-dependent weighting matrix
dc.subjectSpatial-temporal
dc.subjectK-medoids
dc.subject.otherSustainable Development Goals (SDGs)
dc.subject.otherSDG-09: Industry, innovation and infrastructure
dc.subject.otherNatural and agricultural sciences theses SDG-09
dc.subject.otherSDG-16: Peace, justice and strong institutions
dc.subject.otherNatural and agricultural sciences theses SDG-16
dc.titleSpatial-temporal topic modelling of COVID-19 tweets in South Africaen_US
dc.typeMini Dissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Jafta_Spatial_2023.pdf
Size:
7.73 MB
Format:
Adobe Portable Document Format
Description:
Mini Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: