-
The example should show tensor parallelism. I am not sure if Serve + vLLM + tensor parallelism works at the moment because the Serve deployment will request N GPUs, then each vLLM worker will request …
-
Problem: CatBoostError: /src/catboost/catboost/private/libs/algo/tensor_search_helpers.cpp:99: This should be unreachable
catboost version: 1.26.4
Operating System: Ubuntu Linux
CPU: Intel x86
GPU…
8W9aG updated
3 months ago
-
Thank you very much for your open-source project. But in this job, I found that the ComfyUI node you provided and the MOCHI inference result of genmoai are different. I don't know where my operation w…
-
Can I use scripts/demo.py to estimate exact depth by inputting my own images?
I mean if the min distance in an image is 1cm, the max distance in an image is 10m, will I get a same colorized image as …
-
Assumming gpt manager scheduled 1 context-phase sequence and 2 generation-phase sequences at a certain moment:
```
[ctx_a, gen_a, gen_b]
```
Then, my understanding about `gpt_attention` inputs:
…
-
I want to make the green's functions for 1D local velocity model with these parameters:
freqmin = 0.025
freqmax = 1.0
npts = int(2048)
dt = 0.02
The synthetics have been generated. But it is diff…
-
The app is crushing with no error when I apply this LORA:
https://civitai.com/models/251417
`sd -m "D:\Stable-diffusion\ComfyUI\models\checkpoints\SDXL\himerosxl_v206.safetensors" --lora-model-…
-
@MaheshRavishankar @bjacob and I had a discussion today about not having unset_encoding ops at Flow. This can make fusion logics simpler; it also make mmt4d fusion easier. The proposal is only setting…
-
```
import os
import torch
import numpy as np
import triton
import triton.language as tl
import triton
@triton.jit
def compute_min_distance_coord(input_ptr: tl.tensor,
…
-
Hello! Having studied the documentation provided, I still could not understand whether there is support for GGUF quantized models on AMD GPU. I would like to use the Q8 or even Q4 model based on Mistr…