ubicomplab / rPPG-Toolbox

rPPG-Toolbox: Deep Remote PPG Toolbox (NeurIPS 2023)
https://arxiv.org/abs/2210.00716
Other
414 stars 99 forks source link

Regarding the issue of missed frames #265

Closed 408550969 closed 2 months ago

408550969 commented 3 months ago

Hi,I have a video with a frame rate of 30 frames per second, but it experiences frame dropping. Assuming it drops one frame every 30 frames, should I set FS to 29 or FS to 30 in the training config?

yahskapar commented 3 months ago

You should be fine with setting FS to 30. 29 versus 30 is a fairly minor discrepancy and shouldn't affect your results significantly. If you want, you can try looking into video interpolation methods to try and correct the frame dropping, maybe something like this using ffmpeg. Again, I should stress for the task of rPPG you really shouldn't worry about this small of a discrepancy.

408550969 commented 3 months ago

Thanks, I have another question. Will the delay between the image and label cause the model to fail to converge? I collected some videos and found that when using EfficientPhys, as long as the difference between the image (already calculated camera delay) and the label exceeds plus or minus 133ms, the model cannot converge. Is RPPG very sensitive to latency?

yahskapar commented 3 months ago

Hi @408550969,

There's definitely a possibility that some combinations of models and datasets are more sensitive to synchronization error, which in turn could lead to a failure to converge. Can you share more details (e.g., how you identified the model being unable to converge, such as plots of the training and loss curves which can be produced by this toolbox)?

Usually this sensitivity has more to do with the loss function than a specific model from what I understand. If correcting the synchronization error is challenging, you could try using loss functions such as the Maximum Cross-Correlation (MCC) as suggested by this paper.

408550969 commented 2 months ago

I identified by testing the MAE of both the test set and the training set that when the delay is large, not only does the MAE of the test set reach tens, but the MAE of the training set also reaches tens. Thanks, I will consider using MCC as the loss.

408550969 commented 2 months ago

I have another question, does video encoding have a significant impact on accuracy? For example, if I use lossy compression, what is the difference in MAE compared to lossless images?

yahskapar commented 2 months ago

Hi @408550969,

I recommend reading section 10.5 (titled 'Video Compression') of this excellent review article, as well as any of the cited works in that section that sound interesting to you and relate to the effects of video compression on the task of rPPG. To put it briefly and based on my understanding, yes, there is a difference and I'd expect compression that has more temporal effects to subsequently have a greater effect on making your SNR and possibly your MAE worse.

408550969 commented 2 months ago

Thanks!