Northeast India · AI Deployment Lab · Shillong, Meghalaya

AI that works
where it matters.

We build and deploy language AI for Northeast India’s 220+ indigenous languages, from government services to offline community tools.

Language pairs
0 +
Open models
0 +
Target Langauges
0 +
Open datasets
0 +
Production-ready · Offline-capable · CERT-In aligned · DPDP Act compliant · GIGW 3.0 ready

Who we work with

01

Government & Public Sector

Multilingual citizen services, document processing, and offline-capable public tools — compliant with Indian data governance standards.

Explore Government Solutions →

02

Developers & Startups

REST APIs, model weights, and integration support. Build NE-language products on our stack. Commercial licensing via KREN APIs.

Start Building →

03

Researchers & Academics

Open datasets, pretrained models, and co-authorship opportunities at ACL, EACL, EMNLP. All models released under CC-BY-4.0.

View publications & datasets →

The NE-Stack

NE-ASR

8 languages · Whisper-medium fine-tuned

NE-OCR

9 scripts · 95.51% accuracy · ViTSTR-Base unified model

Kren – M

Bilingual (Khasi–English), Generative Language Model. ~2.6B params

NE-BERT

149M params · 9 NE languages · ModernBERT

Also open-source: NE-Trans · NE-CLIP · NE-LID · NE-MultiSpeech · NE-SpeechEmbed

Built for the region

220+

Real deployment.
Not just research output.

Every model we build is designed for Northeast India’s real conditions — low connectivity, regional scripts, and multilingual contexts that global AI ignores.

  • Offline-capable models for areas with limited internet access.
  • Government document processing in Khasi, Garo, Mizo, and Meitei.
  • Voice interfaces that work in local languages, not transliterated English.
  • Data collected from communities with consent and shared benefit.

Built in the open

Open source. For everyone.

All our foundational models are freely available on HuggingFace and listed on AIKosh, India’s National AI Repository. We believe language AI should serve the communities it’s built from.

  • Free to download and use: Research, education, and product development; no restrictions.
  • Peer-reviewed research: Every model backed by a published or submitted paper at ACL, EACL, EMNLP, or similar.
  • Community datasets: Collected with consent, annotated locally, and shared openly on Huggingface and AIKosh.
  • Commercial licensing available: Building a product? We offer commercial licensing and integration support via APIs and on prem deployments.

Ready to deploy AI
in your language?

Government departments, NGOs, developers, researchers; we work with all of them. Let’s talk about what you’re building.