Open-Source Datasets
High-quality, open-source data is the foundation of inclusive and responsible AI. Explore our curated datasets for Northeast India.
Northeast India: Districts and Villages
A comprehensive dataset of all districts and villages across the eight states of Northeast India, perfect for geographical and demographic analysis.
49,000+ villages
Northeast India: Tribes and Subtribes
A curated list of over 200 tribes and subtribes from the eight states of Northeast India, essential for cultural and anthropological research.
200+ tribes & Subtribes
Other Works & Technical Notes
Have a Dataset to Share?
The future of regional AI depends on community collaboration. If you have a dataset for a Northeast Indian language or culture that you’d like to share, we would love to hear from you.


