langcog / wordbank

open repository of children's vocabulary data
http://wordbank.stanford.edu
GNU General Public License v2.0
64 stars 10 forks source link

Finnish data #252

Closed alvinwmtan closed 9 months ago

alvinwmtan commented 2 years ago

From #232 – data to look at in the future:

Finnish.Long.form.versions.of.the.CDI.WG..at.12.and.at.15.months.of.age._.Expressive.words.csv Finnish.Short.form.versions.of.the.CDI..Toddler.version..at.18.and.at.24.months._.Expressive.words.csv Finnnish.Short.form.versions.of.the.CDI..Infant.version..at.9.12.15.18.months._.Expressive.words.csv

WG: only contains production values (no comprehension) Short forms: to return to in the future

alvinwmtan commented 2 years ago

Note: WS Short Form resolved in #262; pending WG data updates

alvinwmtan commented 1 year ago

WG Production: [FinnishWGProd].csv [FinnishWGProdShort].csv FinnishWGProd_Stolt_data.csv FinnishWGProd_Stolt_fields.csv FinnishWGProd_Stolt_values.csv FinnishWGProdShort_Stolt_data.csv FinnishWGProdShort_Stolt_fields.csv FinnishWGProdShort_Stolt_values.csv

Note that these forms are production-only.

HenryMehta commented 12 months ago

@alvinwmtan should there be WG and WGShort or should they also include Prod?

alvinwmtan commented 12 months ago

@HenryMehta they include prod because these are prod-only forms (whereas normal WGs have both prod and comp); we decided to make new forms so they wouldn't be confused with the existing forms

HenryMehta commented 12 months ago

@alvinwmtan Sorry, I'm also getting invalid characters in the file and I need item columns - I can copy definition to item if that fits

alvinwmtan commented 11 months ago

@HenryMehta Fixed:

[FinnishWGProd].csv [FinnishWGProdShort].csv

HenryMehta commented 11 months ago

@alvinwmtan I still have an issue with FinnishWGProd_Stolt_fields.csv

alvinwmtan commented 11 months ago

@HenryMehta Try this: FinnishWGProd_Stolt_fields.csv

HenryMehta commented 11 months ago

@alvinwmtan deploying to dev now

alvinwmtan commented 10 months ago

Citation: Stolt, S. & Vehkavuori, S-M. 2018. Sanaseula. Finnish short form versions of the MacArthur Communicative Development Inventories. Jyväskylä: Niilo Mäki Instituutti.

Contributor: Suvi Stolt, University of Helsinki.