TLSPG: Transfer learning-based semi-supervised pseudo-corpus generation approach for zero-shot translation

Show simple item record

dc.contributor.author Kumar, Amit
dc.contributor.author Mundotiya, Rajesh Kumar
dc.contributor.author Pratap, Ajay
dc.contributor.author Singh, Anil Kumar
dc.date.accessioned 2023-04-19T05:02:04Z
dc.date.available 2023-04-19T05:02:04Z
dc.date.issued 2022-10
dc.identifier.issn 13191578
dc.identifier.uri http://localhost:8080/xmlui/handle/123456789/2101
dc.description This paper is submitted by the author of IIT (BHU), Varanasi, India en_US
dc.description.abstract Machine Translation (MT) has come a long way in recent years, but it still suffers from data scarcity issue due to lack of parallel corpora for low (or sometimes zero) resource languages. However, Transfer Learning (TL) is one of the directions widely used for low-resource machine translation systems to overcome this issue. Creating parallel corpus for such languages is another way of dealing with data scarcity, yet costly, time-consuming and laborious task. In order to avoid the above listed limitations of parallel corpus formation, we present a TL-based Semi-supervised Pseudo-corpus Generation (TLSPG) approach for zero-shot MT systems. It generates the pseudo corpus by exploiting the relatedness between low resource language pairs and zero-resource language pairs via TL approach. It is further empirically ascertained in our experiments that such relatedness helps improve the performance of zero-shot MT systems. Experiments on zero-resource language pairs show that our approach effectively outperforms the existing state-of-the-art models, yielding improvement of +15.56,+8.13,+3.98 and +2 BLEU points for Bhojpuri→Hindi, Magahi→Hindi, Hindi→Bhojpuri and Hindi→Magahi, respectively. en_US
dc.description.sponsorship Science and Engineering Research Board , IIT (BHU), Varanasi, India en_US
dc.language.iso en_US en_US
dc.publisher King Saud bin Abdulaziz University en_US
dc.relation.ispartofseries Journal of King Saud University - Computer and Information Sciences;Volume 34, Issue 9, Pages 6552 - 6563
dc.subject Transfer Learning en_US
dc.subject Zero-shot Translation en_US
dc.subject Semi-supervised en_US
dc.subject Machine Translation en_US
dc.title TLSPG: Transfer learning-based semi-supervised pseudo-corpus generation approach for zero-shot translation en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search in IDR


Advanced Search

Browse

My Account