Research & Publications
Our work focuses on advancing the state of AI for the languages, cultures, and knowledge systems of Northeast India.
KhasiBERT: A Foundational Transformer Language Model for the Khasi Language
Architecture: RoBERTa-base Parameters: ~110M Corpus Size: 3.6M sentences (63M tokens)
Kren v1.0: A Publicly Documented Encoder-to-Decoder Generative Language Model
Encoder-to-decoder conversion producing a generative language model for an Indian language (Khasi), derived from KhasiBERT
Other Works & Technical Notes
Join Us in Building Inclusive AI
MWirelabs invites researchers, educators, and developers to collaborate in shaping technology that reflects the world’s diversity.


