k2-fsa / libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
Apache License 2.0
183 stars 11 forks source link

Is there any speaker information in Libriheavy? #3

Open hjzzju opened 1 year ago

hjzzju commented 1 year ago

I want to know if a record id belongs to a single speaker

hjzzju commented 1 year ago

"recording": { "id": "small/100/sea_fairies_0812_librivox_64kb_mp3/01_baum_sea_fairies_64kb", Is all files under "small/100/sea_fairies_0812_librivox_64kb_mp3/01_baum_sea_fairies_64kb" belong to one speaker ?

pkufool commented 1 year ago

I think so, 100 is the speaker id, each cut has a speaker id.