bsz Search Results - Githubissues

1000+ results
for bsz

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #11822

Training Transformer XL from scratch

Hello, I am trying to recreate this notebook https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb for transformer XL I made changes to the tokenizer a…

vishrawas updated 10 months ago
6
FlagAI-Open/Aquila2 #85

finetune parameters

微调脚本 **finetune/34B/finetune.sh** 中： --learning_rate 9.65e-6 \ --lr_scheduler_type 'linear' \ **请问**：以上两个参数是经验总结的最优值吗？或者是针对34B参数的最优值吗？很多微调脚本中（比如FastChat项目） lr_scheduler_type 设置的都是 cosine，learni…

onlyfish79 updated 11 months ago
2
pytorch/TensorRT #1697

🐛 [Bug] isBool() INTERNAL ASSERT FAILED while compiling Sequ…

## Bug Description I am receving this error when compling a torchscript module. The module is a light wrapper around a [SequenceGenerator](https://github.com/facebookresearch/fairseq/blob/main/fair…

Csinclair0 updated 8 months ago
17
casper-hansen/AutoAWQ #357

Broken Inference after v0.2.2 release

Hi, Its gonna be hard for me to pinpoint this broken inference because I am not getting any logs except sometimes I get CUDA errors. But its intermittent. After the new version release I have have …

christian-ci updated 6 months ago
7
huggingface/transformers #25065

llama2 training has nan

### System Info - `transformers` version: 4.31.0 - Platform: Linux-5.19.0-42-generic-x86_64-with-glibc2.35 - Python version: 3.9.16 - Huggingface_hub version: 0.14.1 - Safetensors version: 0.3.1 …

LZY-the-boys updated 9 months ago
19
NJUNLP/knn-box #17

KeyError: 'keys'

请问出现这个问题是什么原因呢？ ``` 2023-02-09 08:23:10 | INFO | fairseq.tasks.translation | /data/home/likai/NMT-offline/knn-box/knnbox-scripts/vanilla-knn-mt/../../data-bin/zh2en-ziyan-03 train zh-en 721 exampl…

AIikai updated 10 months ago
4
mit-han-lab/fastcomposer #26

environment.yml

Please find the (inofficial) environment.yml below. I had some issues setting up the environment correctly myself, and hope this will help someone else. DISCLAIMER: I have not yet checked if the m…

malteprinzler updated 7 months ago
1
tomaarsen/attention_sinks #1

Trying a minimal example with LlamaForCasualLM, sadly it fai…

My minimal example: ```python import torch device = "cuda" if torch.cuda.is_available() else "cpu" from transformers import AutoTokenizer, StoppingCriteria, StoppingCriteriaList repo = "meta-llam…

alexbalandi updated 11 months ago
16
baudm/parseq #12

Support onnx

Does the model can be converted to onnx model?

phamkhactu updated 1 month ago
61
firslov/llama2-api #2

请问怎么把返回的所有结果合并起来？

我修改了streamllama.py文件chat_completion，执行完for循环后直接返回结果，然后直接return结果，但是这样第一次调用是没问题，第二次之后就会超时失败了

Maxhyl updated 11 months ago
5

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for bsz

1000+ results
for bsz