Open-Source Datasets

High-quality, open-source data is the foundation of inclusive and responsible AI. Explore our curated datasets for Northeast India.

Northeast India: Districts and Villages

A comprehensive dataset of all districts and villages across the eight states of Northeast India, perfect for geographical and demographic analysis.

49,000+ villages

Northeast India: Tribes and Subtribes

A curated list of over 200 tribes and subtribes from the eight states of Northeast India, essential for cultural and anthropological research.

200+ tribes & Subtribes

Have a Dataset to Share?

The future of regional AI depends on community collaboration. If you have a dataset for a Northeast Indian language or culture that you’d like to share, we would love to hear from you.