zhaoyue-zephyrus / TeSTra

Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
Apache License 2.0
99 stars 6 forks source link

RGB-Only Baseline for Temporal Action Detection #4

Open AphroDatalyst opened 1 year ago

AphroDatalyst commented 1 year ago

I'd like to explore the feasibility of using the repository using only RGB images, without the dependency on flow kinetics data. I'm open to hearing any feedback or suggestions you may have.

Anirudh257 commented 1 year ago

You can modify the dataloaders and comment the motion/flow part and retrain the model again.

zhaoyue-zephyrus commented 1 year ago

Hi @ZohrehBayramalizadeh

Changing MODALITY: 'twostream' to MODALITY: 'visual' in the config will hopefully satisfy your request. Please see https://github.com/zhaoyue-zephyrus/TeSTra/blob/main/src/rekognition_online_action_detection/models/feature_head.py#L33 for all possible values. Feel free to re-open the comment if you still have any issue.

Anirudh257 commented 1 year ago

Hi @zhaoyue-zephyrus there are some more changes needed in addition to the one suggested by you in the config and in the dataloaders part.

Chenhongchang commented 8 months ago

Hi @zhaoyue-zephyrus there are some more changes needed in addition to the one suggested by you in the config and in the dataloaders part.

HI! WHAT THE MORE CHANGES ARE? COULD YOU SHARE? @Anirudh257

Anirudh257 commented 8 months ago

Hi @Chenhongchang change the dimensions.