-
Issue #817 was closed with this response:
> We fell a little behind. The Python bindings are done in SWIG. I think that can be quickly repurposed into .Net bindings, once done. So it shouldn't be too…
-
According to your paper you use a large batch size of ~32k samples which means that the raw untrained network initially has a chance of ~1/32k of predicting the correct pair.
I am wondering, how t…
-
Hi, what a fantastic resource for developing Persona-based conversational agents!
I want to highlight a recent paper presented at ACL 2024 Findings: [TimeChara: Evaluating Point-in-Time Character H…
-
比如 [MiniCPM-V](https://github.com/OpenBMB/MiniCPM-V)
-
### Feature request
Paper: Mixture-of-Subspaces in Low-Rank Adaptation
Link: https://arxiv.org/abs/2406.11909
Reference Code: https://github.com/wutaiqiang/MoSLoRA/tree/main/visual_instruction_…
-
```text
2024-05-16 03:24:16.542 | INFO | hydit.inference:__init__:160 - Got text-to-image model root path: ckpts/t2i
2024-05-16 03:24:21.606 | INFO | hydit.inference:__init__:172 - Loading C…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
Hello,
I am reflecting on how to tackle the problem of searching over a knowledge base …
-
### Misc discussion on performance
I was running `llava-hf/llava-1.5-7b-hf` vs. `meta-llama/Meta-Llama-3-8B-Instruct` on vLLM 0.5.2 and noticed that Llava 7B runs with a significantly smaller batch…
-
Hi, what a fantastic resource for developing PersonaLLM agents!
I want to highlight a recent paper presented at ACL 2024 Findings: [TimeChara: Evaluating Point-in-Time Character Hallucination of Ro…
-
Kohya has added preliminary support for Flux.1 LoRA to his SD3 branch. I have created a `sd3-flux.1` branch and updated to the latest sd-scripts sd3 branch code... No GUI integration yet... I will sta…