t5-model Search Results

1000+ results
for t5-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

salesforce/LAVIS #695

what should be samples["text_output"] during finetuning

Hi, I am trying to fine-tune blip2 with coco. In the caption_dataset there is a sample['input_text']. However the forward method for blip2_t5 expects sample['output_text']. The prompt is being attach…

abhidipbhattacharyya updated 4 months ago
3
ELS-RD/transformer-deploy #145

Zero copy may lead to wrong text generation results

Hi, I was trying the "zero copy" method in the t5 notebook on a seq2seq transformer model. When I set the `clone_tensor` to True everything looks fine, just not as much speedup as I expected. W…

brevity2021 updated 1 year ago
6
core-power/Chinese_Chat_T5_Base #4

请问大佬有考虑适配qlora训练吗

如题，持续关注中

xdnjust updated 11 months ago
1
tensorflow/mesh #368

Mesh-tf model conversion to onnx?

Hi, I've been trying to deploy an mtf model to the NVIDIA Triton Inference Server by converting the SavedModel (output of model.export()) to an onnx file with no luck. I've been receiving several erro…

b-analyst updated 2 years ago
2
salesforce/LAVIS #184

BLIP-2 on Replicate parameters

Hi thanks for this great library! I'm trying to get BLIP-2 running on Replicate, using `blip2_t5` with `caption_coco_flant5xl`. I was able to get the original BLIP working fine, but when I try BLIP-2 …

key88sf updated 1 year ago
3
google/flax #3860

Problem while using checkpoints.restore_checkpoint with gra…

Provide as much information as possible. At least, this should include a description of your issue and steps to reproduce the problem. If possible also provide a summary of what steps or workarounds y…

vaishnkv updated 4 months ago
1
Ki6an/fastT5 #34

GPU Optimization

Thanks for sharing the repo . It is really helpful. I'm exploring ways to do the optimization on GPU. I know its not presently supported. Could you share some approach or references to implement th…

ashissamal updated 2 years ago
7
NVIDIA/TensorRT-LLM #1879

batch inference is different with single

### System Info x85-64 4 A10 0.9.0 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported tas…

1096125073 updated 3 weeks ago
11
philschmid/deep-learning-pytorch-huggingface #19

CPU offload when not using offload deepspeed config file

Hi Philipp, thanks for your awesome blog on training Flan T5 XXL. I am playing around with it and doing just zero-shot inference using ds_flan_t5_z3_config_bf16.json deepspeed config file. I believ…

siddharthvaria updated 1 year ago
3
huggingface/transformers #29064

Swapping `tqdm` to `rich`

### Feature request Hi, for `AutoTokenizer.train_new_from_iterator` there's a hardcoded `tqdm` progress bar I want to swap to `rich` and I'm happy to PR it back. I can see on github it's at `trans…

alexge233 updated 7 months ago
6

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for t5-model

1000+ results
for t5-model