speech-corpora Search Results

humlab-swedeb/swedeb-api #20

Download larger datasets

It should be possible to download larger datasets. To decide: when to do download and when to direct to github repo of data or other pre-compiled datasets? Should there be an upper limit to how much d…

rebeckahw updated 3 weeks ago

PhonologicalCorpusTools/CorpusTools #310

Speech corpora: Allow editing of Speakers

mmcauliffe updated 9 years ago

unitedstates/unitedstates.github.io #17

Feature Suggestion: Congressional Speech Corpora Builder

I've been working on a project at [this repo](https://github.com/Plaba/US-Congress-Corpora-Builder). This downloads the congressional transcripts from congress.gov and converts them to text. Since…

Plaba updated 4 years ago

lwang114/UnsupTTS #3

Speech_Audio Alignment

Hi. I am trying to understand you approach and I still don't quite see how alignments are done for unrelated text and speech corporas. Could you please explain that and point out the files in the code…

Curiosci updated 4 months ago

dracor-org/georgdracor #1

xml:ids SHOULD be latin characters

In the current test file there are already attributes `@xml:id`s for characters `` in the ``. They are in Georgina script, which seems not be be a problem for the wellformedness of the XML though. I…

ingoboerner updated 3 days ago

OlegBaskov/language-learning #35

Implement corpora cleanup for baseline tests

Corpora word space cleanup for larger corpora (Child Directed Speech, Gutenberg Children Books). Clean Gutenberg Children corpus to ~ 12,000 words to get PA/PQ in reasonable time.

OlegBaskov updated 5 years ago

CentreForDigitalHumanities/I-analyzer #726

Include citation info with documents

Question from Jo Guldi: > What about including a recommended citation format (or series of formats) for each speech? I like this a lot! Including citation information is probably relevant for no…

lukavdplas updated 8 months ago

PhonologicalCorpusTools/CorpusTools #377

Allow for exporting of discourses

Allow discourses and spontaneous speech corpora in general to be exported from PCT

mmcauliffe updated 7 years ago

clarin-eric/VLO #374

Make "include only collection resources" more prominent

Both Henk and Darja indicated that the results in the VLO are often overshadowed by records located in the lower sections of the CMDI hierarchy (e.g. "sessions" in speech corpora). There is the _Only …

dietervu updated 3 months ago

UAlbertaALTLab/korp-config #5

korp to speechdb

Access speech db exact entries from searches in korp.

fbanados updated 1 month ago

594 results for speech-corpora

594 results
for speech-corpora