-
I encountered an issue while trying to reproduce the results by loading the gpt2_bc_webshop_history.pt model and running the run.py script. The training was initiated with the following parameters:
…
-
1. 对于test_dataloader_list = [copy.deepcopy(1) for _ in range(args.num_clients)]赋值初始值为1 但是再下面部分却用append 导致错误
2. 对于iid的数据集没有对test测试集进行划分
-
## Bug Description
Configuration : llm_examples_main branch, current torch version : 2.4, transformers==4.41.2
Error message:
```py
File "/home/dperi/Downloads/TensorRT/examples/dynamo/torch_e…
-
### Issue Type
Build/Install
### Modules Involved
MPC protocol
### Have you reproduced the bug with SPU HEAD?
Yes
### Have you searched existing issues?
Yes
### SPU Version
0.9.3.dev20241013
…
-
privacy_engine = PrivacyEngine(
File "/home/idris/.local/lib/python3.10/site-packages/private_transformers/privacy_engine.py", line 176, in __init__
raise ValueError(
ValueError: Model type …
-
When test MPS(Metal Performance Shading) implements using examples/gpt2
tensor creating lot of backend and compute struct in short time.
this type of tasks occurring `Context leak detected, msgtra…
-
Exception raised after adding 4th layer:
```
java.nio.BufferOverflowException
at java.base/java.nio.Buffer.nextPutIndex(Buffer.java:744)
at java.base/java.nio.HeapByteBuffer.putInt…
-
**The bug**
When using `models.LlamaCpp` the selected tokenizer is always gpt2 (This can be seen in the outut when `verbose=True` arg is set). I have pasted the dumped KV metadat keys
```
llama_mod…
-
支持GPT2吗,例如:[GPT2-Chinese](https://github.com/Morizeyao/GPT2-Chinese),可以给出对应的示例吗,谢谢
-
Python is doing a lot of heavy lifting and hiding a lot of the complexity. It'd be interesting to compare the Python GPT code with Kaparthy's pure C/CUDA implementaton of the same:
https://github.c…