-
Hello,
Thank you for your amazing work! I have some doubts when I am trying to train my own colmap dataset by SCGS.
Here's the thing:I want to model the whole scene(both dynamic and static) but not…
-
Hello! I notice in your code that the model's input remains consistent during training and inference, i.e., paired images `imgs`, paired labels `tgts`, and mask `bool_masked_pos`. During `forward()`, …
-
AssertionError: Could not infer task type from {'_name': 'av_hubert_pretraining', 'is_s2s': True, 'data': '/checkpoint/bshi/data/lrs3//exp/ls-hubert/tune-modality/all_tsv/', 'label_dir': '/checkpoint/…
-
# Describe the feature
**Motivation**
There is no implementaiton of SwinV2 for semantic segmentation
**Related resources**
The original implementation is only for Image Classification.
**Addi…
rznas updated
2 years ago
-
Currently I'm trying to adapt the [tutorial code](https://github.com/NielsRogge/Transformers-Tutorials/blob/master/LayoutLMv3/Fine_tune_LayoutLMv3_on_FUNSD_(HuggingFace_Trainer).ipynb) for LayoutLMv3 …
-
hello,
I was trying to retrain the conST model without using the trained weights conST_151673.pth, but i I encountered difficulties while performing the following step . So can you share the code ab…
-
Hello, I have read your paper 《Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation 》and find it interesting. I noticed that you said in the abstract
> The code w…
-
Sik-Ho Tang. [Review — BEiT: BERT Pre-Training of Image Transformers](https://sh-tsang.medium.com/review-beit-bert-pre-training-of-image-transformers-c14a7ef7e295).
-
### Model description
**The corresponding paper has been accepted by International Journal of Computer Vision (IJCV).**
We present a novel masked image modeling (MIM) approach, context autoencoder…
-
hello, for CLIP knowledge distilation paper, i.e.,A Unified View of Masked Image Modeling:
when the teacher is CLIP vit-large/14 for 196's input resolution, and the student is vit-base/16 for 224's i…