-
Let's create a higher bandwidth compressed representation of our communication using a new language that we invent on the Fly that is emergent using emojis and text and mathematical symbols in a f…
-
Thanks for sharing this interesting work.
I was wondering how do you do the inference on text-only tasks such as MMLU? Do you just use Llama3?
If so, this work actually keeps two models, one is Llam…
viyjy updated
2 months ago
-
We need to convert keras.io examples to work with Keras 3.
This involves two stages:
## Stage 1: tf.keras backwards compatibility check
Keras 3 is intended as a drop-in replacement for tf.ker…
-
-
@ggerganov do you have any interest in producing more models in GGML format?
I'm now convinced your approach of zero dependency, no memory allocation cpu-first ideaology will make it accessible to…
-
# Vision Transformer Adapter for Dense Predictions
Info.
- ICLR 2023 spotlight
- https://github.com/czczup/ViT-Adapter
- https://arxiv.org/abs/2205.08534
### Summary
- plain ViT
- whi…
-
Post a link for a "possibility" reading of your own on the topic of Sound and Image Learning [for week 7], accompanied by a 300-400 word reflection that: 1) briefly summarizes the article (e.g., as we…
lkcao updated
2 years ago
-
### Feature Name
Llava-next -34B
### Feature Description
Research about Llava-next -34B
### Research Findings
### LLaVA-NeXT-34B
**LLaVA-NeXT-34B** is a model in the LLaVA-NeXT series, which e…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [X] 2. The bug has not been fixed in the latest version.
### Describe the bug
1.session length长度不一致,…
-
See example output below. The example does not work - no "human input" is ever sought - and lacks any explanation of how the feature is supposed to be used, making it useless.
```
[DEBUG]: == Wor…