Working with Longer duration video

Yes, it's not optimal, see https://github.com/Telecommunication-Telemedia-Assessment/bitstream_mode3_p1204_3/issues/25 and https://github.com/Telecommunication-Telemedia-Assessment/bitstream_mode3_p1204_3/issues/16#issue-722268021 — a good optimization would be to stream the feature parsing output to line-delimited JSON files instead of one big array and then parse that step by step. Right now there are no resources to rework this part though.

Note that the use case of this model is for really short videos of 4–10 seconds length. Anything longer you should definitely split up. I think a poor man's solution would be to do the splitting manually via ffmpeg beforehand and then calling the tool on each file individually.

ffmpeg -i "$input_video" -c copy -f segment -segment_time 10 -reset_timestamps 1 "$output_directory/segment_%03d.mkv"

for segment_file in "$output_directory"/*.mkv; do
  # call P.1204.3 bitstream model and store in JSON
  # ... parse the JSON individually
done

Telecommunication-Telemedia-Assessment / bitstream_mode3_p1204_3

Working with Longer duration video #31