langcog / childes-db

A SQL interface for the CHILDES child language corpora
13 stars 5 forks source link

collection != language #26

Closed amsan7 closed 6 years ago

amsan7 commented 7 years ago

Each collection may have corpora from different languages, and each corpus may also have transcripts in different languages (majority of the cases have a mixture of monolingual and bilingual transcripts). I'm thinking at a general data-querying level (shiny apps + API) we may have to allow for querying for a particular language in a collection or corpus. Collections are not completely irrelevant though because the Clinical collection contains children who speak English but with language disorders.

I was planning to put all the collections on childes-db: http://childes.talkbank.org/access/

amsan7 commented 6 years ago

new childesr update allows for querying using language param. language column also available in transcript, participant, token, utterance tables