Open research infrastructure

Reproducible stimulus and data preparation.

A modular toolkit for linguistic enrichment, sampling, and audit-ready export. Start with wordlists in Persian, English, and Japanese.

Enrich

G2P, syllables, POS, and length features appended to your wordlist.

Sample

Stratified selection and synchronized shuffle for balanced sets.

Reproduce

Every run captures seed, parameters, and version in a portable manifest.