langcog / wordbank

open repository of children's vocabulary data
http://wordbank.stanford.edu
GNU General Public License v2.0
64 stars 10 forks source link

japanese CAT #296

Closed mcfrank closed 5 months ago

mcfrank commented 1 year ago

sho tsuji is pulling together the datasets

alvinwmtan commented 7 months ago

Data complete; I will upload the formatted data soon

alvinwmtan commented 6 months ago

@HenryMehta files ready:

[Japanese_WG].csv [Japanese_WS].csv JapaneseWG_Tsuji_data.csv JapaneseWG_Tsuji_fields.csv JapaneseWG_Tsuji_values.csv JapaneseWS_Hagihara_data.csv JapaneseWS_Hagihara_fields.csv JapaneseWS_Hagihara_values.csv JapaneseWS_Minagawa_data.csv JapaneseWS_Minagawa_fields.csv JapaneseWS_Minagawa_values.csv JapaneseWS_Tsuji_data.csv JapaneseWS_Tsuji_fields.csv JapaneseWS_Tsuji_values.csv

HenryMehta commented 6 months ago

@alvinwmtan Lines 778-780 in [Japanese_WS].csv and final 3 lines in [Japanese_WG].csv are not valid. I'm not sure if you just want them deleted. Please advise

alvinwmtan commented 6 months ago

@HenryMehta Yes please, thanks

HenryMehta commented 6 months ago

@alvinwmtan For Japanese WG I needed to add categories sounds2, conversations and others to the list of categories in Wordbank. I set these with a lexical category and class of other. If you want something different, let me know

HenryMehta commented 6 months ago

All deployed to dev

alvinwmtan commented 6 months ago

@HenryMehta Great, looks good in dev!

alvinwmtan commented 6 months ago

@HenryMehta citation here:

Hiromichi Hagihara, Monica Barbir, Mikako Ishibashi, Yasuhiro Kanakogi, Masaharu Kato, Irena Lovcevic, Youtao Lu, Yasuyo Minagawa, Yusuke Moriguchi, Masa-aki Sakagami, Yuta Shinya, Hiroki Yamamoto, & Sho Tsuji (2023). A sharable merged dataset on Japanese children’s vocabulary measures using the Japanese MacArthur–Bates Communicative Development Inventory. https://doi.org/10.17605/osf.io/s5ydw

HenryMehta commented 5 months ago

@alvinwmtan Is this the citation for all 4 datasets? And is Sho the contributor for all 4?

alvinwmtan commented 5 months ago

@HenryMehta yes this is the citation for all 4 datasets, and I think we can put Sho as the contributor for all 4.