I am beginning to realize that as more transcript variants become available, there will probably need to be a header section in transcripts that describes which variant a transcript is.
"content_structure": "timestamped-sentences"
OR
"content_structure": "timestamped-words"
OR
"content_structure": "raw"
OR
"content_structure": "timestamped-annotated-sentences"
This will be a long opened issue as it dives pretty deep into metadata management practices and I could probably pull from literature some best practices.
I am beginning to realize that as more transcript variants become available, there will probably need to be a header section in transcripts that describes which variant a transcript is.
"content_structure": "timestamped-sentences" OR "content_structure": "timestamped-words" OR "content_structure": "raw" OR "content_structure": "timestamped-annotated-sentences"
This will be a long opened issue as it dives pretty deep into metadata management practices and I could probably pull from literature some best practices.