Misinformation detection : a review for high and low resource languages

dc.contributor.authorRananga, Seani
dc.contributor.authorModupe, Abiodun
dc.contributor.authorIsong, Abiodun
dc.contributor.authorMarivate, Vukosi
dc.contributor.emailvukosi.marivate@up.ac.zaen_US
dc.date.accessioned2025-04-22T12:45:23Z
dc.date.available2025-04-22T12:45:23Z
dc.date.issued2024-12
dc.description.abstractThe rapid spread of misinformation on platforms like Twitter, and Facebook, and in news headlines highlights the urgent need for effective ways to detect it. Currently, researchers are increasingly using machine learning (ML) and deep learning (DL) techniques to tackle misinformation detection (MID) because of their proven success. However, this task is still challenging due to the complexity of deceptive language, digital editing tools, and the lack of reliable linguistic resources for non-English languages. This paper provides a comprehensive analysis of relevant research, providing insights into advanced techniques for MID. It covers dataset assessments, the importance of using multiple forms of data (multimodality), and different language representations. By applying the Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) methodology, the study identified and analyzed literature from 2019 to 2024 across five databases: Google Scholar, Springer, Elsevier, ACM, and IEEE Xplore. The study selected thirty-one papers and examined the effectiveness of various ML and DL approaches with a focal point on performance metrics, datasets, and false or misleading information detection challenges. The findings indicate that most current MID models are heavily dependent on DL techniques, with approximately 81% of studies preferring these over traditional ML methods. In addition, most studies are text-based, with much less attention given to audio, speech, images, and videos. The most effective models are mainly designed for highresource languages, with English datasets being the most used (67%), followed by Arabic (14%), Chinese (11%), and others. Less than 10% of the studies focus on low-resource languages (LRLs). Therefore, the study highlighted the need for robust datasets and interpretable, scalable MID models for LRLs. It emphasizes the critical need to prioritize and advance MID research for LRLs across all data types, including text, audio, speech, images, videos, and multimodal approaches. This study aims to support ongoing efforts to combat misinformation and promote a more informed understanding of underresourced African languages.en_US
dc.description.departmentComputer Scienceen_US
dc.description.librarianam2025en_US
dc.description.sdgSDG-09: Industry, innovation and infrastructureen_US
dc.description.urihttps://journal-isi.org/index.php/isien_US
dc.identifier.citationRananga, S., Isong, B., Modupe, A. et al. 2024, 'Misinformation detection : a review for high and low resource languages', Journal of Information Systems and Informatics, vol. 6, no. 4, pp. 2892-2922. DOI: 10.51519/journalisi.v6i4.931.en_US
dc.identifier.issn2656-5935 (print)
dc.identifier.issn2656-4882 (online)
dc.identifier.other10.51519/journalisi.v6i4.931
dc.identifier.urihttp://hdl.handle.net/2263/102180
dc.language.isoenen_US
dc.publisherUniversitas Bina Darmaen_US
dc.rightsThis work is licensed under a Creative Commons Attribution 4.0 International License.en_US
dc.subjectMisinformation detectionen_US
dc.subjectLow-resource languagesen_US
dc.subjectHigh-resource languagesen_US
dc.subjectAfrican languagesen_US
dc.subjectSDG-09: Industry, innovation and infrastructureen_US
dc.titleMisinformation detection : a review for high and low resource languagesen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Rananga_Misinformation_2024.pdf
Size:
1.02 MB
Format:
Adobe Portable Document Format
Description:
Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: