The automatic determination of translation equivalents in lexicography : what works and what doesn't?
dc.contributor.author | Denisova, Michaela | |
dc.contributor.author | De Schryver, Gilles-Maurice | |
dc.contributor.author | Rychly, Pavel | |
dc.date.accessioned | 2025-09-16T13:09:38Z | |
dc.date.available | 2025-09-16T13:09:38Z | |
dc.date.issued | 2024-12 | |
dc.description | This paper is part of the publication: Despot, K. Š., Ostroški Anić, A., & Brač, I. (Eds.). (2024). Lexicography and Semantics. Proceedings of the XXI EURALEX International Congress. Institute for the Croatian Language. | |
dc.description.abstract | Cross-lingual embedding models act as facilitator of lexical knowledge transfer and offer many advantages, notably their applicability to low-resource and non-standard language pairs, making them a valuable tool for retrieving translation equivalents in lexicography. Despite their potential, these models have primarily been developed with a focus on Natural Language Processing (NLP), leading to significant issues, including flawed training and evaluation data, as well as inadequate evaluation metrics and procedures. In this paper, we introduce cross-lingual embedding models for lexicography, addressing the challenges and limitations inherent in the current NLP-focused research. We demonstrate the problematic aspects across three baseline cross-lingual embedding models and three language pairs and outline possible solutions. We show the importance of high-quality data, advocating that its role is vital compared to algorithmic optimisation in enhancing the effectiveness of these models. | |
dc.description.department | African Languages | |
dc.description.librarian | am2025 | |
dc.description.sdg | SDG-04: Quality Education | |
dc.description.uri | https://euralex.org/publications/ | |
dc.identifier.citation | Denisova, M., De Schryver, G.-M., Rychly, P. 2024, 'The automatic determination of translation equivalents in lexicography : what works and what doesn't?', EURALEX Proceedings, pp. 305-316. | |
dc.identifier.issn | 2521-7100 | |
dc.identifier.uri | http://hdl.handle.net/2263/104347 | |
dc.language.iso | en | |
dc.publisher | European Association for Lexicography | |
dc.rights | © European Association for Lexicography. All materials here are licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. | |
dc.subject | Translation equivalent determination | |
dc.subject | Cross-lingual embedding models | |
dc.subject | Evaluation | |
dc.title | The automatic determination of translation equivalents in lexicography : what works and what doesn't? | |
dc.type | Article |