| | SLO | ENG | Cookies and privacy

Bigger font | Smaller font

Show document Help

Title:Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation
Authors:ID Dugonik, Jani (Author)
ID Sepesy Maučec, Mirjam (Author)
ID Verber, Domen (Author)
ID Brest, Janez (Author)
Files:.pdf Dugonik-2023-Reduction_of_Neural_Machine_Trans.pdf (400,40 KB)
MD5: 78CC15F779EB59334CBE411B83149E2D
 
URL https://www.mdpi.com/2227-7390/11/11/2484
 
Language:English
Work type:Article
Typology:1.01 - Original Scientific Article
Organization:FERI - Faculty of Electrical Engineering and Computer Science
Abstract:This paper proposes a hybrid machine translation (HMT) system that improves the quality of neural machine translation (NMT) by incorporating statistical machine translation (SMT). Therefore, two NMT systems and two SMT systems were built for the Slovenian-English language pair, each for translation in one direction. We used a multilingual language model to embed the source sentence and translations into the same vector space. From each vector, we extracted features based on the distances and similarities calculated between the source sentence and the NMT translation, and between the source sentence and the SMT translation. To select the best possible translation, we used several well-known classifiers to predict which translation system generated a better translation of the source sentence. The proposed method of combining SMT and NMT in the hybrid system is novel. Our framework is language-independent and can be applied to other languages supported by the multilingual language model. Our experiment involved empirical applications. We compared the performance of the classifiers, and the results demonstrate that our proposed HMT system achieved notable improvements in the BLEU score, with an increase of 1.5 points and 10.9 points for both translation directions, respectively.
Keywords:neural machine translation, statistical machine translation, sentence embedding, similarity, classification, hybrid machine translation
Publication status:Published
Publication version:Version of Record
Submitted for review:21.04.2023
Article acceptance date:25.05.2023
Publication date:28.05.2023
Publisher:MDPI
Year of publishing:2023
Number of pages:Str. 1-22
Numbering:Letn. 11, Št. 11, št. članka 2484
PID:20.500.12556/DKUM-87120 New window
UDC:004.5
ISSN on article:2227-7390
COBISS.SI-ID:154543107 New window
DOI:10.3390/math11112484 New window
Publication date in DKUM:20.02.2024
Views:322
Downloads:32
Metadata:XML DC-XML DC-RDF
Categories:Misc.
:
DUGONIK, Jani, SEPESY MAUČEC, Mirjam, VERBER, Domen and BREST, Janez, 2023, Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation. Mathematics [online]. 2023. Vol. 11, no. Št. 11,  članka 2484, p. 1–22. [Accessed 1 April 2025]. DOI 10.3390/math11112484. Retrieved from: https://dk.um.si/IzpisGradiva.php?lang=eng&id=87120
Copy citation
  
Average score:
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
(0 votes)
Your score:Voting is allowed only for logged in users.
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:Mathematics
Shortened title:Mathematics
Publisher:MDPI AG
ISSN:2227-7390
COBISS.SI-ID:523267865 New window

Document is financed by a project

Funder:ARRS - Slovenian Research Agency
Project number:P2-0069
Name:Napredne metode interakcij v telekomunikacijah

Funder:ARRS - Slovenian Research Agency
Project number:P2-0041
Name:Računalniški sistemi, metodologije in inteligentne storitve

Funder:ARRS - Slovenian Research Agency
Project number:P2-0057
Name:Informacijski sistemi

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.
Licensing start date:28.05.2023

Secondary language

Language:Slovenian
Keywords:nevronsko strojno prevajanje, statistično strojno prevajanje, podobnost, klasifikacija, hibridno strojno prevajanje


Comments

Leave comment

You must log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica