-
Hello!
I actually have two models - CodeLLaMa-13b-Python and CodeLLaMa-13b, that need to be merged. The overall goal is to merge two models (one trained on Python and another trained on any other lan…
-
The current Jacobian helper function implementation in `gradient.py` seems like it might be slower than it needs to be. Given a vector output (size _N_) of a graph with a vector input (size _M_), it c…
-
What is the max token length that this model can support? Can it support more than 10k?
-
**What is your question?**
I have a dataset that includes a column feature of pre-trained embeddings. I couldn't find any documentations or examples on how this column should be passed to NVTabular. …
-
Hi Oscar,
First of all many thanks for your tutorials, they are incredibly useful to learn quantization and get hands-on experience on this!
I have the following situation and perhaps you could …
-
I want see the model's detail in the code,but i found the Pytorch Lightning in the pycharm can't debug, they just run,how can i see the training data flows in the model? And it will makes me understan…
-
Hello again, I got this training error when running "train.py", how can I solve this?
```
(hiface) G:\HiFaceGAN\Face-Renovation-master>python train.py
train.py
dataset [TrainDataset] of size 7 w…
-
### Your current environment
```text
Collecting environment information...
PyTorch version: 2.2.1+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
…
-
한줄 평 : 우리 모델, 빠름. 가벼움. 쓰셈
Transformer와 관련해서 다양한 모델들이 나왔습니다.
이들 중에서 장점만을 모아서, 가장 Efficiency가 좋은 모델을 만들었습니다.
Observation 1 :
Patch Embedding -> Convolution Stem
Larger Kernel과 stride를 사용하는 Pat…
-
### System Info
ubuntu 20.04
tensorrt 10.0.1
tensorrt-cu12 10.0.1
tensorrt-cu12-bindings 10.0.1
tensorrt-cu12-libs 10.0.1
tensorrt-llm …