Research & Publications

Our work focuses on advancing the state of AI for the languages, cultures, and knowledge systems of Northeast India.

KhasiBERT: A Foundational Transformer Language Model for the Khasi Language

Architecture: RoBERTa-base Parameters: ~110M Corpus Size: 3.6M sentences (63M tokens)

Kren v1.0: A Publicly Documented Encoder-to-Decoder Generative Language Model

Encoder-to-decoder conversion producing a generative language model for an Indian language (Khasi), derived from KhasiBERT

Join Us in Building Inclusive AI

MWirelabs invites researchers, educators, and developers to collaborate in shaping technology that reflects the world’s diversity.