Back to Home

Open Source Data

Grounded datasets for the global community.

Our Datasets

We provide open-source, socio-culturally grounded datasets to help researchers build more authentic and less biased AI models.

Topo-Text 1.0

A dataset linking regional dialects with topographical telemetry.

OralTrad 500

High-fidelity audio recordings of oral-first communities with semantic metadata.