-
-
## Keyword: efficient
### End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs
- **Authors:** Javier Campos, Zhen Dong, Javier Duarte, Amir Gholami, Michael W. Mahoney,…
-
The paper shows the loss of training the codebook, however, the code shows that the residual vector quantization codebook is not learnable and there is also no commitment_weight or orthogonal_loss for…
-
Thanks for your work!
I tried to use the table dataset in ShapeNetV2 and test the code, the code works fine. And here is my issue:
During training phase, if I test the code using small dataset,…
-
MPJPE of RVQ on HumanML3D from [MoMask](https://yxmu.foo/pdfs/momask.pdf) is **29.5**,
but the statistic from your paper is **63.1**.
Maybe RVQ is not that bad? Can you provide more details ab…
-
As of ~~b2579~~ b2646, prompts (among other parameters) are internally stored as `std::string`s, which is basically glorified `std::vector` and do not care or handle character encodings. This will not…
-
**Is your feature request related to a problem? Please describe.**
ETL Indexing operation is preventing con-current searching.
In its most basic form, a single indexing operation involves deleti…
-
I'm working with a long context model (gradientai/Llama-3-8B-Instruct-262k) that exceeds the memory of a single A100 GPU. While the model weights are loaded, when I try to run inference, I get CUDA Ou…
-
**问题描述 / Problem Description**
我用ChatGLM-Efficient-Tuning进行lora微调后的模型,在本项目中启动报错
**复现问题的步骤 / Steps to Reproduce**
![image](https://github.com/chatchat-space/Langchain-Chatchat/assets/143932534/0f8…
-
So I was thinking about the following idea.
It is probably completely bogus, but I would definitely investigate it when and if I had the time to, so maybe someone else would be interested as well.
…