hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All
https://hpcaitech.github.io/Open-Sora/
Apache License 2.0
21.48k stars 2.06k forks source link

About data processing pipeline #386

Closed rongpan123 closed 3 months ago

rongpan123 commented 3 months ago

Will you disclose the thresholds for data filtering in data processing later(ocr,match score)

zhengzangw commented 3 months ago

Not for now since we use different thresholds for different datasets. We are still trying to work out which threshold is better.