spatial-temporal-transformer Search Results

FoundationVision/OmniTokenizer #20

error when initializing the OmniTokenizer

File "/export/scratch/ra63nev/lab/discretediffusion/OmniTokenizer/omnitokenizer.py", line 108, in __init__ spatial_depth=args.spatial_depth, temporal_depth=args.temporal_depth, causal_in_temporal…

dongzhuoyao updated 1 month ago

ouusan/some-papers #23

Exploiting Temporal Information

1.(HMMR) Learning 3d human dynamics from video(2019) temporal encoder: **1D temporal** convolutional layers, **precompute** the image features on each frame, get current and ±∆t frames prediction. c…

ouusan updated 2 weeks ago

kijai/ComfyUI-CogVideoXWrapper #213

Error when the CogVideoXFUN sampler uses the PAB

I think this is a new issue with some recent update, as it was running fine before. BTW, it's no problem with CogVideo Sampler. Init Pyramid Attention Broadcast. steps: 50. spatial broadcast: True…

cheezecrisp updated 5 hours ago

ouusan/some-papers #24

Utilizing Attention Mechanism

1.PARE: Part Attention Regressor for 3D Human Body Estimation(2021) img-->volumetric features(before the global average pooling)-->part branch: estimates attention weights +feature branch: performs S…

ouusan updated 3 weeks ago

HKUDS/UrbanGPT #24

Clarification Needed: Is the Bike Flow Prediction Example Tr…

I came across the Example-1: Bike Flow Prediction (Zero-shot scenario) in your paper, and I have some concerns regarding the classification of this task as “zero-shot.” As I understand it, a zero-s…

Kaleemullahqasim updated 14 hours ago

ouusan/some-papers #20

Developing Various Representations.

1.TexturePose: Supervising Human Mesh Estimation with Texture Consistency(2019) Texture map (texel): A corresponding UVmap un-warps the template surface onto an image, A, which is the texture map co…

ouusan updated 1 month ago

huggingface/diffusers #7909

Support Lumina T2I 5B flow matching T2I DiT model

# Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers > Sora unveils the potential of scaling Diffusion Transformer (DiT) for gener…

AmericanPresidentJimmyCarter updated 1 month ago

hohonu-vicml/TrailBlazer #8

Cross Attention maps

Hello, Thank you so much for your great work and codebase! I would appreciate your clarifications on a few items. 1) From within ```TextToVideoSDPipelineCall.py```, at this [line](https://g…

danielajisafe updated 5 months ago

ZcyMonkey/AttT2M #2

Question on the VQVAE spatial encoder design

Thank you for sharing the great work ! I have a question regarding the design choice of the VQVAE spatial encoder. Currently only the encoder has included the spatial transformer to encode the relati…

kingchurch updated 10 months ago

G-U-N/AnimateLCM #22

Issue in using training AnimateLCM SVD

Thanks for the great work, also for releasing the training script `train_svd_lcm.py`. I am trying to reproduce the results using the provided `train_svd_lcm.py`, but after half of the training (20,…

habibian updated 3 months ago

187 results for spatial-temporal-transformer

187 results
for spatial-temporal-transformer