doc-doc / CoVGT

Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
Apache License 2.0
17 stars 1 forks source link

Experiment on STAR dataset #5

Open kimia-cvengineer opened 11 months ago

kimia-cvengineer commented 11 months ago

Hello,

I want to conduct some experiments on STAR dataset and noticed that there are some parts in the code that you tried to load its data. I was wondering if I could have access to the files needed to load STAR dataset and extract its features, e.g.: if self.dset == 'star': self.vid_clips = load_file(osp.dirname(csv_path)+f'/clips_{self.mode}.json')

It would be great if you could also share how you sampled data into some clips based on the id (referring to the json file). Did you follow the same clip-wise sampling exists in the preprocess_features for every qid? ` def get_video_feat_star(self, video_name, qid, width=320, height=240):

    clips = self.vid_clips[qid]
    video_feature_path = f'/raid/jbxiao/data/star/'
    app_feats = []
    roi_feats, roi_bboxs = [], []
    for cid, clip in enumerate(clips):
        clip_feat, clip_rfeat, clip_rbbox = [], [], []
        for fid in clip:
            frame_feat_file = osp.join(video_feature_path, f'frame_feat/{video_name}/{fid:06d}.npy')
            frame_feat = np.load(frame_feat_file)
            clip_feat.append(frame_feat)

...

`

I sincerely appreciate your help.

doc-doc commented 11 months ago

Hi, please find the feature for STAR here.

kimia-cvengineer commented 11 months ago

Thank you so much @doc-doc. That's way much better and helpful.

Regrading the data, did you use the [Raw Videos from Charades(scaled to 480p) mp4] with the AG dump tool to extract the frames or just directly extracted features on [RGB frames at 24fps (76 GB)]? I am asking this question in a sense that, they have different fps (AG dump tools extract some samples given the annotation file) based on each videos original fps and I may not capture the frames you extracted the features out of them.

I appreaciate your assistance.

doc-doc commented 11 months ago

Hi, we use ffmpeg and decode each video ( or QA related segment for STAR) at 3pfs.

kimia-cvengineer commented 11 months ago

@doc-doc Thank you so much for your explanation. Did you use the original scale or the downscaled one (480p)?

doc-doc commented 11 months ago

It shoud be the original scale.

Baiiiiiiiiii commented 9 months ago

Hello, I am also interested in doing some experiments in STAR dataset. Could you share the feature for STAR again? The link above just expired. Thank you!

doc-doc commented 9 months ago

hi, please find the updated link here.

Baiiiiiiiiii commented 9 months ago

Thank you so much :)