-
貼吧活動:(請查閱 [SARS-CoV-2 Timeline by 2020.02.21](https://github.com/agorahub/_meta/blob/agoran/theagora/sari/Memorandum_2020-02-21_SARS-CoV-2-Timeline_Nathan.pdf?raw=true), by Nathan :cloud: )
- Colla…
-
### System Info
Hello, I've been working with dhokas who finetuned Mistral's official instruct model. I have been trying to finetune mistral with several datasets over dozens of ablations. There is v…
-
Submitting Author: Drew Dimmery (@ddimmery)
Repository: https://github.com/ddimmery/tidyhte/
Submission type: Pre-submission
---
- Paste the full DESCRIPTION file inside a code block belo…
-
### System Info
```shell
optimum-habana 1.6.1
text-generation 0.6.0
text-generation-server 0.9.2
langchain …
-
Hello, thanks for this really cool repository. I'm recently learning about `pjit` and your repo is a valuable reference resource.
I was having some issues running the example code. In particular, i…
-
### System Info
INFO text_generation_launcher: Runtime environment:
Target: x86_64-unknown-linux-gnu
Cargo version: 1.70.0
Commit sha: 31b36cca21fcd0e6b7db477a7545063e1b860156
Docker label: sha…
-
Hi, I met following issue when I compared output results of ```bloomz-560m``` with OV, Pytorch and ORT on CPU.
It seems that Pytorch,ORT and OV without cache generate same outputs, while OV with cach…
-
Hi, so I have also [opened this issue](https://github.com/huggingface/accelerate/issues/752) with HuggingFace Accelerate, since I am using deepspeed zero stage 3 through their API, but feel this may b…
-
I am trying to save a simple LSTM model for text classification. The input of the model is padded vectorized sentences.
```
model = Sequential()
model.add(LSTM(40, input_shape=(16, 32)))
model.add…
-
### System Info
- `transformers` version: 4.31.0
- Platform: Linux-5.4.119-19.0009.28-x86_64-with-glibc2.35
- Python version: 3.10.6
- Huggingface_hub version: 0.15.1
- Safetensors version: 0.3.1…