Closed nagyrajmund closed 3 years ago
Another thing we have to consider here is the possibility of silence between words. The current implementation reuses the last word until a new one starts, but instead we should use the "silence" word embeddings between two words.
Let me investigate and fix that today.
As for the silence, we already have "silence" word embeddigns for them in the code just above:
curr_file_X_data = np.zeros((total_number_of_frames, 7, 769))
Hi, I think that the words get misaligned in the code snippet below:
https://github.com/Svito-zar/probabilistic-gesticulator/blob/1a5ac949720b42e9e8efd50b4df1b91110decfdf/my_code/data_processing/annotations/create_dataset.py#L274-L283
bisect(word_starts, time_st)
returns the index wheretime_st
should be inserted intoword_starts
so that it remains sorted. For example, ifword_starts = [0, 1, 2]
andtime_st = 0.2
, thenbisect(...)
will return the index 1.Therefore
curr_word_id
is always the index of the first word that starts aftertime_st
.