The automatic determination of translation equivalents in lexicography : what works and what doesn't?

Denisova, Michaela; De Schryver, Gilles-Maurice; Rychly, Pavel

The automatic determination of translation equivalents in lexicography : what works and what doesn't?

Files

Denisova_Automatic_2024.pdf (358.36 KB)

Date

2024-12

Authors

Denisova, Michaela

De Schryver, Gilles-Maurice

Rychly, Pavel

Publisher

European Association for Lexicography

Abstract

Cross-lingual embedding models act as facilitator of lexical knowledge transfer and offer many advantages, notably their applicability to low-resource and non-standard language pairs, making them a valuable tool for retrieving translation equivalents in lexicography. Despite their potential, these models have primarily been developed with a focus on Natural Language Processing (NLP), leading to significant issues, including flawed training and evaluation data, as well as inadequate evaluation metrics and procedures. In this paper, we introduce cross-lingual embedding models for lexicography, addressing the challenges and limitations inherent in the current NLP-focused research. We demonstrate the problematic aspects across three baseline cross-lingual embedding models and three language pairs and outline possible solutions. We show the importance of high-quality data, advocating that its role is vital compared to algorithmic optimisation in enhancing the effectiveness of these models.

Description

This paper is part of the publication: Despot, K. Š., Ostroški Anić, A., & Brač, I. (Eds.). (2024). Lexicography and Semantics. Proceedings of the XXI EURALEX International Congress. Institute for the Croatian Language.

Keywords

Translation equivalent determination, Cross-lingual embedding models, Evaluation

Sustainable Development Goals

SDG-04: Quality Education

Citation

Denisova, M., De Schryver, G.-M., Rychly, P. 2024, 'The automatic determination of translation equivalents in lexicography : what works and what doesn't?', EURALEX Proceedings, pp. 305-316.

URI

http://hdl.handle.net/2263/104347

Collections

Research Articles (African Languages)
Research Articles (University of Pretoria)

Full item page

The automatic determination of translation equivalents in lexicography : what works and what doesn't?

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Sustainable Development Goals

Citation

URI

Collections