-
We have previously imported this data set:
https://docs.google.com/spreadsheets/d/1-Zz1TCm1bVygngmWeXuskzlfQHN3ER6HERX0DC-VRI0/edit#gid=283848305
We should make it available in parallel to the n…
-
/Users/joro/Documents/Phd/UPF/voxforge/myScripts/AlignmentStep/doForceAligment.sh: line 113: 98769 Abort trap $HTK_34_PATH/HVite -l "'*'" -o SW -A -D -T 1 -b sil -C $PARENT_OF_INTERIM_AND…
-
I want to alignment my audio recording files with corresponding transcripts. There are a lot of pauses and silence in my audios. I want **multi-level alignment** (mainly word-level and segment/paragra…
-
It would be very helpful to have finer control over the way that SILNLP produces translations. It would be ideal to be able to specify what should happen with the data in each marker or group of marke…
-
## Story Explanation
### User Story
As an aligner, I want to see partial alignment progress so that I can see partial progress for my effort.
As an aligner, I want to be able to get full credit for…
-
# Crosslingual phonological feature discrimination
Similar to existing SSLR classification probes ([Cormac English et al 2022](https://aclanthology.org/2022.sigmorphon-1.9.pdf)), we evaluate whethe…
-
Certain lines represent properties of an utterance (e.g. `\txn`, `\sp`), while other lines are properties of words (e.g. `\w`, `\wlt`) or morphemes (e.g. `\m`, `\gl`). Clarify which lines are of which…
-
Hi,
I am using this library for speaker diarization. I have tested a few of my audios and transcription from openai's whisper look much better than faster-whisper's.
1) Is there a way to replac…
-
I tried the demo for the bullets points and numbering https://github.com/dolanmiu/docx/blob/master/demo/3-numbering-and-bullet-points.ts. The bullet points seem to work just fine but while the text fo…
-
In the OCR-D workflow, there are several steps that likely require input or output to be able to represent __word segmentation ambiguity__ and confidence values of word boundaries (whitespace characte…