This plugin is still a work in progress. Currently it allows users to do automatic speaker diarization and phone transcription using the CMU Linguistic Annotation Backend server APIs. Users can even upload data to fine-tune the pre-trained phone recognizer (Allosaurus)
Note: The plugin requires Python 3.
Download the latest version of ELAN from here and install it:
wget https://www.mpi.nl/tools/elan/ELAN-XX_linux.tar.gz
tar xzf ELAN-XX_linux.tar.gz
Download a copy of this repo and unzip it. Copy the cmulab_elan_extension-main/
folder into ELAN's extensions dir (ELAN-XX/lib/app/extensions/
).
/Applications/ELAN_XX
directory, where XX
is the name of the version.cmulab_elan_extension-main
containing the contents of this repo.ELAN_XX
and click "Show Package Contents", then copy your cmulab_elan_extension-main
folder into ELAN_XX.app/Contents/app/extensions
.Note: The built-in Tk GUI library in Apple-supplied Python 3 in some macOS versions (such as macOS 12 Monterey) have bugs that might cause dialog boxes to not display properly. In that case please install the latest python from python.org.
cmulab_elan_extension-main/
folder into ELAN's extensions dir (ELAN-XX/app/extensions/
).If the extension is not visible in the "Recognizers" tab or doesn't run after following the above instructions, please make sure that the contents of the zip file are extracted to:
ELAN-6-2/app/extensions/cmulab_elan_extension-main/
rather than:
ELAN-6-2/app/extensions/cmulab_elan_extension-main/cmulab_elan_extension-main/
Also the downloaded cmulab_elan_extension-main.zip
should be removed from the extensions directory ELAN-6-2/app/extensions/
, otherwise ELAN tries to load the zip file rather than the extracted cmulab_elan_extension-main/
folder.
Start ELAN with the provided test audio file
ELAN_6-3/bin/ELAN Downloads/test_file.wav &
Switch to the "Recognizers" tab and then select "CMU Linguistic Annotation Backend" from the Recognizer dropdown list at the top and then click the "Start" button. If this is your first time using this plugin, you will be prompted to login to the CMULAB backend server and get an access token (you can create an account or simply login with an existing Google account).
More detailed instructions for each specific service (phone transcription, diarization etc.) can be found here
https://user-images.githubusercontent.com/2358298/165535030-1548fd36-53d2-4ac6-b815-ae6c742ab88b.mp4
Note: if the "Parameters" section in the "Recognizers" tab is too small, you can pop it out as a separate window by clicking the button below it or by dragging the section border to re-size it. See the highlighted areas in the screenshot below: