HumanSignal / label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format
https://labelstud.io
Apache License 2.0
19.47k stars 2.42k forks source link

Waveform and audio not synchronised in label studio 1.7.2 #3924

Open CrisRodriguez opened 1 year ago

CrisRodriguez commented 1 year ago

Describe the bug In a speech to text annotation task, when listening to the audio it is unsync with the waveform. It happens with :

Additional info:

Screenshots

First image in 1.4.1 shows the sync between box and waveform. Second image in 1.7.2 shows the unsync between box and waveform, when listening, the audio is synced with the box, therefore it is the waveform that is unsynced. image

Environment (please complete the following information):

Thanks,

bmartel commented 1 year ago

Hi @CrisRodriguez 👋. A fix has already been made for this and will be coming in LabelStudio 1.7.3. For the time being, if you are able, you can workaround this and use the latest source version of LabelStudio. The 1.7.3 official release should be out within 1-2 weeks time.

bmartel commented 1 year ago

I will keep this ticket open for visibility, and close upon the official release of 1.7.3.

CrisRodriguez commented 1 year ago

Hi @bmartel,

Thanks for your message.

I got it. Meanwhile I am using the 1.4.x version that works pretty well for my less than 20-minute audios.

Thanks,

Cristian

Cristian

CrisRodriguez commented 1 year ago

Hello @makseq @bmartel,

Can u please confirm that this issue has been solved in 1.7.3, and if yes, close the issue ?

Thanks,

Cristian

bmartel commented 1 year ago

Hey @CrisRodriguez , I apologize for the late reply. So this is fixed in the 1.7.3 release, I just tested the build right now. However, I noticed something incorrect with the feature flag states for this build.

Currently during local testing of 1.7.3 release build, it appears to default to the old audio player (which would also work in your case, just with less performance and improvements). If you want the current (and fixed) audio player in the 1.7.3 release, you will have to set the following flag as an environment variable when running your LabelStudio instance.

ff_front_dev_2715_audio_3_280722_short=1

As well, the config would have to specify the following:

<Audio name="audio" value="$audio" decoder="webaudio" />

As it seems this builds feature flag state has decoder="ffmpeg" as the default option (the option which is producing the issue for you here).

Reference documentation: https://labelstud.io/tags/audio.html#Parameters

Again I apologize for this inconvenience, if this works for you I will close this issue.

mogwai commented 6 months ago

I'm also noticing this on browser and edge on version 1.12:

{
  "release": "1.12.0",
  "label-studio-os-package": {
    "version": "1.12.0",
    "short_version": "1.12",
    "latest_version_from_pypi": "1.12.0.post0",
    "latest_version_upload_time": "2024-05-06T19:31:45",
    "current_version_is_outdated": true
  },

  "label-studio-os-backend": {
    "message": "fix: LEAP-947: Check task when we get a response from ML (#5742)",
    "commit": "b2fb5819f214d06f9300bd6c3a7853c78bdd176d",
    "date": "2024/04/17 15:26:37",
    "branch": "",
    "version": "1.12.0+0.gb2fb581"
  },

  "label-studio-converter": {
    "version": "0.0.58"
  }
}

I've marked here the delay of when I hear the sond and where it is marked as starting. This is using the default html5 audio player and I've tried both decoding techniques.

The delay seems to get worse as the audio is played.

image