Open ceewanna opened 1 year ago
I am no expert on this but FFmpeg simply copies the audio from the stream so the issue is likely your camera.
What you can try is to set audio_codec
under recorder
to something like aac
which will make FFmpeg re-encode the audio and maybe fix the pts issue
Every camera working now is aac audio and I have tried your suggestion and it made no difference.
I just found an issue where it discussed audio and video pts not synchronized. I am no expert on any subject here but what I can imagine is the synchronization if any will be at the time it created or it played. The other source of non-synchronization could be at the time the system received audio and video. However, when I put the rtsp on real-time stream I don't see any issue. Still this is a puzzle to me.
https://github.com/mpv-player/mpv/issues/9217
From this I tried playing some vdo files with vlc. It didn't seem to encounter such issue.
Both use_wallclock_as_timestamps
and avoid_negative_ts
are used in the default FFmpeg command when saving segments to disk.
When a recording is finished, Viseron concatenates these segments into a single mp4
.
Maybe the problem occurs during the concatenation.
What you can do is to play around with the concatenation command by setting output_args
under recorder
.
I have been observing the playback of vdo files for quite a while. There seems to be quite a number of these invalid audio pts. Initially I thought it was caused by wifi problem or gpu problem. However, after pinging and adding gpu card, the issue remains. These vdo files will often get hiccups or stumbled from time to time. I took a sample from mpv playback of a file in this case. There seems to be quite a number received packets dropped while recording the video file.
Playing: rec/ch30_barn/2023-08-06/174024.mp4 (+) Video --vid=1 () (hevc 2560x1440 19.569fps) (+) Audio --aid=1 () (aac 1ch 16000Hz) AO: [pulse] 16000Hz mono 1ch float VO: [gpu] 2560x1440 yuv420p AV: 00:00:00 / 00:00:46 (1%) A-V: 0.000 Invalid audio PTS: 0.391937 -> 0.589500 AV: 00:00:00 / 00:00:46 (2%) A-V: -0.118 Dropped: 1 Invalid audio PTS: 1.190500 -> 1.832250 AV: 00:00:02 / 00:00:46 (6%) A-V: -0.107 ct: -0.075 Dropped: 3 Invalid audio PTS: 2.984688 -> 3.218313 AV: 00:00:03 / 00:00:46 (7%) A-V: -0.113 ct: -0.068 Dropped: 3 Invalid audio PTS: 3.466625 -> 3.574625 AV: 00:00:03 / 00:00:46 (8%) A-V: -0.093 ct: -0.084 Dropped: 3 Invalid audio PTS: 3.790812 -> 3.893375 AV: 00:00:05 / 00:00:46 (11%) A-V: -0.081 ct: -0.130 Dropped: 3 Invalid audio PTS: 5.357875 -> 5.549500 AV: 00:00:06 / 00:00:46 (13%) A-V: -0.014 ct: -0.147 Dropped: 3 Invalid audio PTS: 6.430438 -> 7.059000 AV: 00:00:06 / 00:00:46 (13%) A-V: -0.014 ct: -0.149 Dropped: 3