flm Search Results - Githubissues

1000+ results
for flm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ztxz16/fastllm #181

使用多GPU部署之后，和原始模型推理速度持平了，无加速效果了？

我分别对ChatGLM2-6B原始模型、chatglm2-6b-int4.flm模型单卡部署、chatglm2-6b-int4.flm模型多卡部署推理速度做了对比： ChatGLM2-6B 原始模型推理速度为 `100 token/s` 左右 chatglm2-6b-int4.flm模型单卡部署推理速度为 `220 token/s` 左右 chatglm2-6b-int4.flm模型多卡…

ray-008 updated 11 months ago
3
ztxz16/fastllm #389

benchmark 测试的时候会卡住，如何解决呢？

./benchmark -p /opt/Convert/flm/qwen-14b-chart-int4.flm -f ../example/benchmark/prompts/beijing.txt -b 1 Load (323 / 323) Warmup... finish. AVX: ON AVX2: ON AARCH64: OFF Neon FP16: OFF Neon D…

2213601279 updated 9 months ago
1
ztxz16/fastllm #153

chatglm2-6b使用fastllm疑问

**chatglm2-6b使用fastllm通过README，有如下两种方式：** **方式一：** ``` # 这是原来的程序，通过huggingface接口创建模型 from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("THUDM/chatgl…

onlyfish79 updated 1 year ago
6
ztxz16/fastllm #413

转化模型格式(.bin->.flm)时

在转化SUS-Chat-34B模型(该模型完全兼容llama架构)为flm格式时，报了这个错： ```python root@5ce5bafeea81:/app# python glm_trans_flm.py Loading checkpoint shards: 100%|██████████████████████████████████████████████████| 7/7 [0…

ColorfulDick updated 7 months ago
2
ztxz16/fastllm #172

chatglm2-6b上使用加速不明显

先用chatglm2-6b生成fp16.flm和int4.flm，然后分别测速： cuda：11.6 GPU：matrox g200eh3 batch=1 测速代码： start = time.time() text = "中国法定货币是什么？" outs = "" for i in range(10): out = model.response(text) …

sun1092469590 updated 1 year ago
6
appliedgrg/flm #15

Pre-tagging tool error

Pre-tagging tool is looking for FLM_ZT_output. Also it cannot find memory\FLM_PT_Extract_***. ![image](https://user-images.githubusercontent.com/502763/144925649-ba73274f-56e0-48d6-8b22-88791a6a9b6b.…

RichardQZeng updated 2 years ago
1
brightonfrc/2024 #45

remember to reassign CAN ids

FLM:1 FLT:2 BLM:3 BLT:4 BRM:5 BRT:6 FRM:7 FRT:8 Intake motors: 10-19 Shooter motors: 20-29

spamtactics updated 5 months ago
1
ztxz16/fastllm #402

make_input和model.weight.tokenizer.encode会产生多余空格问题

模型：baichuan2-13B-chat 问题1：复现代码块： In [4]: import pyfastllm In [5]: model = pyfastllm.create_model("baichuan2-int8.flm") In [6]: prompt = model.make_input("", 0, "你好") In [7]: prompt Out[7]: '…

yiguanxian updated 7 months ago
3
ztxz16/fastllm #446

千问qwen1.5-14B-chat解码错误

【现象】 qwen1.5-14B-Chat模型在解码时报UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 1: unexpected end of data。【描述】模型输入是：假设f(x)=x，那么f(x)1到2的积分是多少。模型输出的tokenId包含11995、18137，这两个tokenId会…

yiguanxian updated 4 months ago
2
pyocd/pyOCD #1628

pyOCD picks wrong flash algorithm for STM32L4R5 with Keil.ST…

ST has updated the Keil.STM32L4xx_DFP package to 2.6.2 in June 2023. They have introduced a second flash algorithm for the STM32L4R5 family in the *Keil.STM32L4xx_DFP.pdsc* file: ``` …

csamrn updated 10 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for flm

1000+ results
for flm