-
I use 1 node with 4*V100,got 700it/s,and 1 node with 4*P40,got 300it/s,
but when I use 2 nodes with 4*V100 and 4*P40 by deepspeed,got
**“4 pytorch allocator cache flushes since last step. this happ…
-
File "Moore-AnimateAnyone/src/models/mutual_self_attention.py", line 180, in hacked_basic_transformer_inner_forward
norm_hidden_states[_uc_mask],
IndexError: The shape of the mask [2] at index 0…
-
---
### Summary
This proposal suggests a unified approach to integrating MRF-like moderation capabilities [](https://docs-develop.pleroma.social/backend/configuration/mrf/) within the ActivityPo…
-
### 🚀 The feature, motivation and pitch
Certain models require execution methods other than `forward()`. A good example family of models are generation models, for example Huggingface's `transforme…
-
I am currently exploring recommender systems for a masters project at university. My dataset consists of tweets by users with a bunch of brief user metadata such as location and item metadata such as …
-
I'm using Sagemaker Studio to train a MQCNN model, under default layer settings model runs without any error using CPU instance. But once I switched to 'ml.p3.2xlarge' instance and change ctx from 'cp…
-
**Background and Context**$
Hi, it seems that all the models can generate false triplets by inversing the subjects and objects of existing ones. However, I try to generate embedding from a graph wh…
-
Hi, thank you for releasing this implementation! I have several questions regarding how this works relative to my understanding of the original paper:
1. Why is the action always the mode (i.e. arg…
-
NOTE: ISSUES ARE NOT FOR CODE HELP - Ask for Help at https://stackoverflow.com
Your issue may already be reported!
Also, please search on the [issue tracker](../) before creating one.
* **I'm s…
-
Hi,
Thank you for providing this useful dataset. I tried to run your model on another dataset but I met some difficulties. I also have some questions regarding the code. It would be great if you c…