Question about Recommended Method for Using Alignments

lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.

Apache License 2.0

936 stars 214 forks source link

I have a long recording split into shorter supervision segments, and I have obtained alignments.

When attaching alignments to supervision segments as AlignmentItem, is it recommended to use start time with respect to the start of the supervision segment, or the start of the entire recording?

Also, I have been studying on TemporalArray, but since TemporalArray is per-frame and my alignments are per-word, I am not sure how to use TemporalArray for alignments.

If there is a recipe that uses the Lhotse recommended way for alignments, from data preparation to dataset objects, please let me know too and I will start from there.

lhotse-speech / lhotse

Question about Recommended Method for Using Alignments #1230