Source-Word decomposition for neural machine translation. Nguyen, T., Le, H., & Pham, V. H. Mathematical Problems in Engineering, 2020. doi abstract bibtex End-to-end neural machine translation does not require us to have specialized knowledge of investigated language pairs in building an effective system. On the other hand, feature engineering proves to be vital in other artificial intelligence fields, such as speech recognition and computer vision. Inspired by works in those fields, in this paper, we propose a novel feature-based translation model by modifying the state-of-the-art transformer model. Specifically, the encoder of the modified transformer model takes input combinations of linguistic features comprising of lemma, dependency label, part-of-speech tag, and morphological label instead of source words. The experiment results for the Russian-Vietnamese language pair show that the proposed feature-based transformer model improves over the strongest baseline transformer translation model by impressive 4.83 BLEU. In addition, experiment analysis reveals that human judgment on the translation results strongly confirms machine judgment. Our model could be useful in building translation systems translating from a highly inflectional language into a noninflectional language.
@article{Pham2020,
title = {Source-{Word} decomposition for neural machine translation},
volume = {2020},
issn = {15635147},
doi = {10.1155/2020/4795187},
abstract = {End-to-end neural machine translation does not require us to have specialized knowledge of investigated language pairs in building an effective system. On the other hand, feature engineering proves to be vital in other artificial intelligence fields, such as speech recognition and computer vision. Inspired by works in those fields, in this paper, we propose a novel feature-based translation model by modifying the state-of-the-art transformer model. Specifically, the encoder of the modified transformer model takes input combinations of linguistic features comprising of lemma, dependency label, part-of-speech tag, and morphological label instead of source words. The experiment results for the Russian-Vietnamese language pair show that the proposed feature-based transformer model improves over the strongest baseline transformer translation model by impressive 4.83 BLEU. In addition, experiment analysis reveals that human judgment on the translation results strongly confirms machine judgment. Our model could be useful in building translation systems translating from a highly inflectional language into a noninflectional language.},
journal = {Mathematical Problems in Engineering},
author = {Nguyen, Thien and Le, Hoai and Pham, Van Huy},
year = {2020},
}
Downloads: 0
{"_id":"AmXJxaw3pKFgEbHkh","bibbaseid":"nguyen-le-pham-sourceworddecompositionforneuralmachinetranslation-2020","author_short":["Nguyen, T.","Le, H.","Pham, V. H."],"bibdata":{"bibtype":"article","type":"article","title":"Source-Word decomposition for neural machine translation","volume":"2020","issn":"15635147","doi":"10.1155/2020/4795187","abstract":"End-to-end neural machine translation does not require us to have specialized knowledge of investigated language pairs in building an effective system. On the other hand, feature engineering proves to be vital in other artificial intelligence fields, such as speech recognition and computer vision. Inspired by works in those fields, in this paper, we propose a novel feature-based translation model by modifying the state-of-the-art transformer model. Specifically, the encoder of the modified transformer model takes input combinations of linguistic features comprising of lemma, dependency label, part-of-speech tag, and morphological label instead of source words. The experiment results for the Russian-Vietnamese language pair show that the proposed feature-based transformer model improves over the strongest baseline transformer translation model by impressive 4.83 BLEU. In addition, experiment analysis reveals that human judgment on the translation results strongly confirms machine judgment. Our model could be useful in building translation systems translating from a highly inflectional language into a noninflectional language.","journal":"Mathematical Problems in Engineering","author":[{"propositions":[],"lastnames":["Nguyen"],"firstnames":["Thien"],"suffixes":[]},{"propositions":[],"lastnames":["Le"],"firstnames":["Hoai"],"suffixes":[]},{"propositions":[],"lastnames":["Pham"],"firstnames":["Van","Huy"],"suffixes":[]}],"year":"2020","bibtex":"@article{Pham2020,\n\ttitle = {Source-{Word} decomposition for neural machine translation},\n\tvolume = {2020},\n\tissn = {15635147},\n\tdoi = {10.1155/2020/4795187},\n\tabstract = {End-to-end neural machine translation does not require us to have specialized knowledge of investigated language pairs in building an effective system. On the other hand, feature engineering proves to be vital in other artificial intelligence fields, such as speech recognition and computer vision. Inspired by works in those fields, in this paper, we propose a novel feature-based translation model by modifying the state-of-the-art transformer model. Specifically, the encoder of the modified transformer model takes input combinations of linguistic features comprising of lemma, dependency label, part-of-speech tag, and morphological label instead of source words. The experiment results for the Russian-Vietnamese language pair show that the proposed feature-based transformer model improves over the strongest baseline transformer translation model by impressive 4.83 BLEU. In addition, experiment analysis reveals that human judgment on the translation results strongly confirms machine judgment. Our model could be useful in building translation systems translating from a highly inflectional language into a noninflectional language.},\n\tjournal = {Mathematical Problems in Engineering},\n\tauthor = {Nguyen, Thien and Le, Hoai and Pham, Van Huy},\n\tyear = {2020},\n}\n","author_short":["Nguyen, T.","Le, H.","Pham, V. H."],"key":"Pham2020-1-1-1-1-1-1-1-1-1-1-1-1","id":"Pham2020-1-1-1-1-1-1-1-1-1-1-1-1","bibbaseid":"nguyen-le-pham-sourceworddecompositionforneuralmachinetranslation-2020","role":"author","urls":{},"metadata":{"authorlinks":{}}},"bibtype":"article","biburl":"https://api.zotero.org/groups/2168152/items?key=VCdsaROd5deDY3prqqG8kI0c&format=bibtex&limit=100","dataSources":["syJjwTDDM32TsM2iF","QwrFbRJvXF69SEShv","HbngRCZLbLed2q9QT","LtEFvT85hYpNg4Esp","iHfnnAr7wKJJxkNMt","PrvBTxn4Zgeep29e5","78Yd9ZHcx783Wkffe","SKRhTA7ok4L4waPkZ","GfrMfnKTkYdcYTRsy","RqqCdXGEyWH4dZ76k","cbiwaQPQJSZeJDDY9","2Jak7xK39ytqcgqQ4","CDfDBPD6CDScj6Ty4","WgiCycoQjRx6KArBy","KBdipwowTNXWiKqYd","yjd6eECyb3TYZpZ3R","D9jmZ7aoHfJnYQ4ES","R8dLFAvyQ2oFRijDJ","dc6SzEK4S9LfC3XpA","kGWABmrDfhF29uibh","YE9GesxGLCsBc3vvC","v3qfuosZ66nvD85FK","BSxBG5ms26R2teZn9"],"keywords":[],"search_terms":["source","word","decomposition","neural","machine","translation","nguyen","le","pham"],"title":"Source-Word decomposition for neural machine translation","year":2020}