phi-3-vision Search Results

468 results
for phi-3-vision

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kymatio/kymatio #645

kymatio for complex input signals

Hi, Is Kymatio able to work with complex signals, given that the wavelets are already analytic wavelets. Essentially, the constant-Q fitlers banks need to cover the whole digital frequency to 1, in…

Buffy-nevc updated 3 years ago
8
mbzuai-oryx/VideoGPT-plus #24

Issue of garbled text

Thanks for your exciting work! I try to use `eval/vcgbench/inference/run_ddp_inference.sh` to reproduce the performance on VCGBench with 4*A100 GPUs, but the generated texts are garbled as follows:…

ShuyUSTC updated 1 month ago
2
microsoft/Phi-3CookBook #50

Fine tune vision model with multiple images

Hello, is it possible to fine tune the vision model with multiple images?

pbarker updated 2 months ago
4
2U1/Phi3-Vision-Finetune #7

Trouble of merging werights

Hi! Thank you for your great work. I have a problem when I want to do merging weights. 1. I have a fake small data to do lora fine tuning. Getting 6 files, **adapter_config.json adapter_model.safet…

WYY220062 updated 2 months ago
5
huggingface/trl #1802

Phi-3 SFT training and padding tokens

Hi, I'm trying to perform SFT training with Phi-3-vision, I followed the example with llava here https://github.com/huggingface/trl/blob/main/examples/scripts/vsft_llava.py. That however didn't work o…

DavidePaglieri updated 6 days ago
7
microsoft/onnxruntime-genai #565

Loading Image from bytes

Hey, I am trying to integrate your phi-3-vision script into LitServe. How can I use the already loaded image (bytes) and pass it to the processor? (I don't want to save it as local/tmp file) Tha…

msciancalepore98 updated 2 months ago
1
SlimeVRX/SlimeVRX #27

[paper] Transpose

Abstract Motion capture is facing some new possibilities brought by the inertial sensing technologies which do not suffer from occlusion or wide-range recordings as vision-based solutions do. Howev…

SlimeVRX updated 2 years ago
6
NVlabs/eg3d #18

FFHQ dataset camera pose labeling

"We use an off-the-shelf face detection and pose-extraction pipeline to both identify the face region and label the image with a pose" Would it be possible to add the code to generate camera pose l…

l4rz updated 1 year ago
26
pytorch/pytorch #104296

affine_grid and grid_sample operators merge/accelleration

### 🚀 The feature, motivation and pitch Hi, To warp some data according to a (batch) of affine transformations, two functions called sequentially need to be used: 1. [affine_grid](https://p…

g-moschetti updated 11 months ago
29
vllm-project/vllm #5883

[Misc]: Curious why this is happening: Running phi-3-vision …

### Anything you want to discuss about vllm. I was wondering why does this happen? I am new to this space and was playing around with different machines, models and frameworks. I am able to infere…

chandeldivyam updated 2 months ago
4

上一页 1...20 21 22 23 24 25 26...47 下一页

468 results for phi-3-vision

468 results
for phi-3-vision