Machine Translation for Indian Languages

MTIL (2023)

Dataset

Training Data:
The primary source of parallel language pairs is Bharat Parallel Corpus Collection (BPCC), released by AI4Bharat.

Participants can access the dataset through the following link: https://ai4bharat.iitm.ac.in/bpcc
Participants are encouraged to add datasets of their choice, including parallel corpora and monolingual datasets, to train their models.