-
Transformer currently takes up a lot of CPU resources per request. We need to cache the tree generation process to reduce the CPU load.
Details listed here:
https://docs.google.com/spreadsheets/d/1q4…
-
I will continue the discussion here, until or if @keean can get Github to restore the thread. Luckily I still have a copy of the deleted #35 thread loaded, so I can copy and paste from it. If the orig…
-
## Keyword: differential privacy
### State-of-the-Art Approaches to Enhancing Privacy Preservation of Machine Learning Datasets: A Survey
- **Authors:** Chaoyu Zhang
- **Subjects:** Cryptography an…
-
### Preliminary Checks
- [X] This issue is not a duplicate. Before opening a new issue, please search existing issues: https://github.com/gatsbyjs/gatsby/issues
- [X] This issue is not a question, fe…
-
Hi,
Thanks for the great work! Can you please share the training scripts for CIFAR10/100?
I tried to train it with your code and the hyperparameters mentioned in the supplementary materials of t…
-
Is the 4090 fully supported in SD?
I am getting the same performance with the 4090 that my 3070 was getting.
-
I need your device name
-
### System Info
Hello, I've been working with dhokas who finetuned Mistral's official instruct model. I have been trying to finetune mistral with several datasets over dozens of ablations. There is v…
-
### System Info
transformers 4.33 (unreleased).
```
model_id = "TheBloke/Yarn-Llama-2-7B-128K-GPTQ"
model = AutoModelForCausalLM.from_pretrained(
model_id,
torch_dtype=torch.float1…
-
**Issue type**
- [ ] Bug Report
- [ ] Feature Request
- [x] Help wanted
- [ ] Other
**SpikingJelly version**
`0.0.0.0.14`
**Description**
你好:
在目标检测任务中,大多数模型都可以分为主干网络+后处理解码的形式,显然转换只能…