Open nasjlwk opened 3 months ago
No response
from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("G:\model\chatglm_6B", trust_remote_code=True) model = AutoModel.from_pretrained("G:\model\chatglm_6B", trust_remote_code=True).quantize(4).half().cuda() model = model.eval() response, history = model.chat(tokenizer, "你好", history=[]) print(response) response, history = model.chat(tokenizer, "晚上睡不着应该怎么办", history=history) print(response) 直接运行这段代码
accelerate 0.28.0 aiofiles 23.2.1 altair 5.2.0 annotated-types 0.6.0 anyio 4.3.0 attrs 23.2.0 certifi 2024.2.2 charset-normalizer 3.3.2 click 8.1.7 colorama 0.4.6 contourpy 1.2.0 cpm-kernels 1.0.11 cycler 0.12.1 exceptiongroup 1.2.0 fastapi 0.110.0 ffmpy 0.3.2 filelock 3.13.3 fonttools 4.50.0 fsspec 2024.3.1 gradio 4.23.0 gradio_client 0.14.0 h11 0.14.0 httpcore 1.0.4 httpx 0.27.0 huggingface-hub 0.22.1 idna 3.6 importlib_resources 6.4.0 Jinja2 3.1.3 jsonschema 4.21.1 jsonschema-specifications 2023.12.1 kiwisolver 1.4.5 latex2mathml 3.77.0 Markdown 3.6 markdown-it-py 3.0.0 MarkupSafe 2.1.5 matplotlib 3.8.3 mdtex2html 1.3.0 mdurl 0.1.2 mpmath 1.3.0 networkx 3.2.1 numpy 1.26.4 orjson 3.9.15 packaging 24.0 pandas 2.2.1 pillow 10.2.0 pip 23.3.1 protobuf 5.26.0 psutil 5.9.8 pydantic 2.6.4 pydantic_core 2.16.3 pydub 0.25.1 Pygments 2.17.2 pyparsing 3.1.2 python-dateutil 2.9.0.post0 python-multipart 0.0.9 pytz 2024.1 PyYAML 6.0.1 referencing 0.34.0 regex 2023.12.25 requests 2.31.0 rich 13.7.1 rpds-py 0.18.0 ruff 0.3.4 safetensors 0.4.2 semantic-version 2.10.0 sentencepiece 0.2.0 setuptools 68.2.2 shellingham 1.5.4 six 1.16.0 sniffio 1.3.1 starlette 0.36.3 sympy 1.12 tokenizers 0.13.3 tomlkit 0.12.0 toolz 0.12.1 torch 1.13.0+cu116 torchaudio 0.13.0+cu116 torchvision 0.14.0+cu116 tqdm 4.66.2 transformers 4.27.1 typer 0.11.0 typing_extensions 4.10.0 tzdata 2024.1 urllib3 2.2.1 uvicorn 0.29.0 websockets 11.0.3 wheel 0.41.2
Is there an existing issue for this?
Current Behavior
Expected Behavior
No response
Steps To Reproduce
from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("G:\model\chatglm_6B", trust_remote_code=True) model = AutoModel.from_pretrained("G:\model\chatglm_6B", trust_remote_code=True).quantize(4).half().cuda() model = model.eval() response, history = model.chat(tokenizer, "你好", history=[]) print(response) response, history = model.chat(tokenizer, "晚上睡不着应该怎么办", history=history) print(response) 直接运行这段代码
Environment
Anything else?
No response