-
Hi,
I'm looking to run a local version of gddo to point at a private github organisation.
I have followed the dev env setup in the wikki however each time I run the command to launch `./godo-server…
-
### Steps to reproduce the behavior (Required)
1. Create a partitionned table with primary key
```
CREATE TABLE IF NOT EXISTS mydata (
id BIGINT,
created_on DATETIME
)
ENGINE=OLAP
PR…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
How can I use astream_chat method with chatEngine, How can I use it with API (fastapi) t…
-
## Goal
- Jan should be able to seamlessly move from Nitro to cortex.cpp
- What is the scope of change?
- Different inference extensions? (e.g. `nitro-extension`, and `cortex-extension`?)
-…
-
I am trying to accelerate the inference speed of LLama 3 8b on a 4090 using quantization. I noticed this https://github.com/huggingface/optimum-nvidia which should allow to use fp8 and have huge speed…
-
Graphhopper has a /navigate endpoint that is very useful for integration of Graphhopper into projects like MapLibre Navigation or Ferrostar, which both support OSRM type responses. But the documentati…
-
### What went wrong?
im getting OnCall was not able to load the current user. Try refreshing the page
my plugin is connecting to grafana but it is not loading getting OnCall was not able to load t…
-
Java 21 ships with virtual threads: https://openjdk.org/jeps/444
The JUnit Platform should provide an implementation of the `HierarchicalTestExecutorService` using virtual threads as an opt-in feat…
-
### Bug summary
This is just a restatement of: https://github.com/PrefectHQ/prefect/issues/13584 with more info and the motivation to follow it up.
If I can fix this, I'll be recommending my clien…
-
### How would you like to use vllm
I want to run inference of a [TheBloke/Llama-2-7B-Chat-GPTQ](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GPTQ). I don't know how to use it with vllm.
I try t…