AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations
-
Updated
Mar 29, 2023
AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations
A corpus of transcripts of Christmas messages and New Year's speeches from Denmark, France, Italy, Norway, Spain and the United Kingdom, with a comparative analysis.
Aligned multilingual corpus of Lancelot en prose, processed with Aquilign.
Scripts that were used to creative an interactive website displaying the stats for the Indic multilingual train corpus - Boli, developed by us
Created a multilingual training corpus across 15 Indian languages (including English) by compiling different sources
Add a description, image, and links to the multilingual-corpus topic page so that developers can more easily learn about it.
To associate your repository with the multilingual-corpus topic, visit your repo's landing page and select "manage topics."