Optimizing translation for low-resource languages : efficient fine-tuning with custom prompt engineering in large language models

Khoboko, Pitso Walter; Marivate, Vukosi; Sefara, Joseph

Optimizing translation for low-resource languages : efficient fine-tuning with custom prompt engineering in large language models

dc.contributor.author	Khoboko, Pitso Walter
dc.contributor.author	Marivate, Vukosi
dc.contributor.author	Sefara, Joseph
dc.contributor.email	u21824772@tuks.co.za
dc.date.accessioned	2025-09-05T05:53:00Z
dc.date.available	2025-09-05T05:53:00Z
dc.date.issued	2025-06
dc.description	DATA AVAILABILITY : Data will be made available on request.
dc.description.abstract	Training large language models (LLMs) can be prohibitively expensive. However, the emergence of new Parameter-Efficient Fine-Tuning (PEFT) strategies provides a cost-effective approach to unlocking the potential of LLMs across a variety of natural language processing (NLP) tasks. In this study, we selected the Mistral 7B language model as our primary LLM due to its superior performance, which surpasses that of LLAMA 2 13B across multiple benchmarks. By leveraging PEFT methods, we aimed to significantly reduce the cost of fine-tuning while maintaining high levels of performance. Despite their advancements, LLMs often struggle with translation tasks for low-resource languages, particularly morphologically rich African languages. To address this, we employed customized prompt engineering techniques to enhance LLM translation capabilities for these languages. Our experimentation focused on fine-tuning the Mistral 7B model to identify the best-performing ensemble using a custom prompt strategy. The results obtained from the fine-tuned Mistral 7B model were compared against several models: Serengeti, Gemma, Google Translate, and No Language Left Behind (NLLB). Specifically, Serengeti and Gemma were fine-tuned using the same custom prompt strategy as the Mistral model, while Google Translate and NLLB Gemma, which are pre-trained to handle English-to-Zulu and English-to-Xhosa translations, were evaluated directly on the test data set. This comparative analysis allowed us to assess the efficacy of the fine-tuned Mistral 7B model against both custom-tuned and pre-trained translation models. LLMs have traditionally struggled to produce high-quality translations, especially for low-resource languages. Our experiments revealed that the key to improving translation performance lies in using the correct prompt during fine-tuning. We used the Mistral 7B model to develop a custom prompt that significantly enhanced translation quality for English-to-Zulu and English-to-Xhosa language pairs. After fine-tuning the Mistral 7B model for 30 GPU days, we compared its performance to the No Language Left Behind (NLLB) model and Google Translator API on the same test dataset. While NLLB achieved the highest scores across BLEU, G-Eval (cosine similarity), and Chrf++ (F1-score), our results demonstrated that Mistral 7B, with the custom prompt, still performed competitively. Additionally, we showed that our prompt template can improve the translation accuracy of other models, such as Gemma and Serengeti, when applied to high-quality bilingual datasets. This demonstrates that our custom prompt strategy is adaptable across different model architectures, bilingual settings, and is highly effective in accelerating learning for low-resource language translation.
dc.description.department	Computer Science
dc.description.librarian	hj2025
dc.description.sdg	SDG-09: Industry, innovation and infrastructure
dc.description.uri	https://www.elsevier.com/locate/mlwa
dc.identifier.citation	Khoboko, P.W., Marivate, V. & Sefara, J. 2025, 'Optimizing translation for low-resource languages: efficient fine-tuning with custom prompt engineering in large language models', Machine Learning with Applications, vol. 20, art. 100649, pp. 1-18, doi : 10.1016/j.mlwa.2025.100649.
dc.identifier.issn	2666-8270 (online)
dc.identifier.other	10.1016/j.mlwa.2025.100649
dc.identifier.uri	http://hdl.handle.net/2263/104222
dc.language.iso	en
dc.publisher	Elsevier
dc.rights	© 2025 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
dc.subject	Large language model (LLM)
dc.subject	Parameter-efficient fine-tuning (PEFT)
dc.subject	Natural language processing (NLP)
dc.subject	Mistral 7B
dc.subject	Prompt engineering
dc.subject	In-context-learning (ICL)
dc.subject	English-to-Zulu (En-Zul)
dc.subject	English-to-Xhosa (En-Xh)
dc.subject	BLUE-score
dc.subject	F1score
dc.subject	G-Eva (mean cosine-similarity score)
dc.title	Optimizing translation for low-resource languages : efficient fine-tuning with custom prompt engineering in large language models
dc.type	Article

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Khoboko_Optimizing_2025.pdf
Size:: 3.4 MB
Format:: Adobe Portable Document Format
Description:: Article

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Research Articles (Computer Science)
Research Articles (University of Pretoria)

Simple item page