Add speech-to-text alignment, QC, word-to-TR alignment scripts + environment set up files

This PR adds the scripts and Python package versions to extract the language annotations for the CneuroMod naturalistic stimuli sets.

Codes were developed by @MarieStLaurent as part of the Algonouts project for the Friends dataset and refactored and adapted to be used for the rest of the @courtois-neuromod naturalistic datasets.

_speech-to-textAA.py: Extracts the transcripts from .mkv files of the movies, aligns them with the audio and extracts time stamps of each word using AssemblyAI
_caption_timingQC.ipynb: Takes word time stamps extracted by _speech-to-textAA.py and creates karaoke-style captions that play simultaneously over audio to check the quality of the alignment**
_text_alignAA.py : Finds the words feel into TR (fMRI repetition time) windows, and creates word chunks per each volume.
_requirement_caption_timingQC.txt: Includes pip freeze of the venv used for the QC virtual environment.
_requirement_speech_to_text_TRaligment.txt: Includes pip freeze of the venv used for the QC virtual environment.

courtois-neuromod / ds_prep

Add speech-to-text alignment, QC, word-to-TR alignment scripts + environment set up files #17