dc.contributor.author |
Kumar, Amit |
|
dc.contributor.author |
Mundotiya, Rajesh Kumar |
|
dc.contributor.author |
Pratap, Ajay |
|
dc.contributor.author |
Singh, Anil Kumar |
|
dc.date.accessioned |
2023-04-19T05:02:04Z |
|
dc.date.available |
2023-04-19T05:02:04Z |
|
dc.date.issued |
2022-10 |
|
dc.identifier.issn |
13191578 |
|
dc.identifier.uri |
http://localhost:8080/xmlui/handle/123456789/2101 |
|
dc.description |
This paper is submitted by the author of IIT (BHU), Varanasi, India |
en_US |
dc.description.abstract |
Machine Translation (MT) has come a long way in recent years, but it still suffers from data scarcity issue due to lack of parallel corpora for low (or sometimes zero) resource languages. However, Transfer Learning (TL) is one of the directions widely used for low-resource machine translation systems to overcome this issue. Creating parallel corpus for such languages is another way of dealing with data scarcity, yet costly, time-consuming and laborious task. In order to avoid the above listed limitations of parallel corpus formation, we present a TL-based Semi-supervised Pseudo-corpus Generation (TLSPG) approach for zero-shot MT systems. It generates the pseudo corpus by exploiting the relatedness between low resource language pairs and zero-resource language pairs via TL approach. It is further empirically ascertained in our experiments that such relatedness helps improve the performance of zero-shot MT systems. Experiments on zero-resource language pairs show that our approach effectively outperforms the existing state-of-the-art models, yielding improvement of +15.56,+8.13,+3.98 and +2 BLEU points for Bhojpuri→Hindi, Magahi→Hindi, Hindi→Bhojpuri and Hindi→Magahi, respectively. |
en_US |
dc.description.sponsorship |
Science and Engineering Research Board , IIT (BHU), Varanasi, India |
en_US |
dc.language.iso |
en_US |
en_US |
dc.publisher |
King Saud bin Abdulaziz University |
en_US |
dc.relation.ispartofseries |
Journal of King Saud University - Computer and Information Sciences;Volume 34, Issue 9, Pages 6552 - 6563 |
|
dc.subject |
Transfer Learning |
en_US |
dc.subject |
Zero-shot Translation |
en_US |
dc.subject |
Semi-supervised |
en_US |
dc.subject |
Machine Translation |
en_US |
dc.title |
TLSPG: Transfer learning-based semi-supervised pseudo-corpus generation approach for zero-shot translation |
en_US |
dc.type |
Article |
en_US |