Each collection may have corpora from different languages, and each corpus may also have transcripts in different languages (majority of the cases have a mixture of monolingual and bilingual transcripts). I'm thinking at a general data-querying level (shiny apps + API) we may have to allow for querying for a particular language in a collection or corpus. Collections are not completely irrelevant though because the Clinical collection contains children who speak English but with language disorders.
Each collection may have corpora from different languages, and each corpus may also have transcripts in different languages (majority of the cases have a mixture of monolingual and bilingual transcripts). I'm thinking at a general data-querying level (shiny apps + API) we may have to allow for querying for a particular language in a collection or corpus. Collections are not completely irrelevant though because the Clinical collection contains children who speak English but with language disorders.
I was planning to put all the collections on childes-db: http://childes.talkbank.org/access/