transformer-tutorial Search Results

1000+ results
for transformer-tutorial

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sktime/sktime-neuro #12

Add common average re-referencing

Looking into standard preprocessing pipelines, it would be good to add common average re-referencing. See article here: https://eeglab.org/tutorials/ConceptsGuide/rereferencing_background.html …

SveaMeyer13 updated 2 years ago
2
triton-inference-server/server #6888

Doesn't allow huggingface transformers to shard 1 model acro…

**Description** I would like to shard one large LLM model across multiple GPUs, but Triton wants to load separate copies of the model onto each GPU, which result in OOM. **Triton Information** Wh…

moruga123 updated 7 months ago
4
sirius-ai/LPRNet_Pytorch #95

添加stn的问题 issue about adding stn

我往lprnet网络结构的顶端添加了一个stn，那个空间变换网络，但是似乎这个新结构训练难度很大，loss不下降。有人知道怎么做吗？ i added a STN network(the spacial transforming network) to the top of LPRNet, but i find training this new structure quite difficu…

Powerfulidot updated 3 days ago
4
facebookresearch/fairseq #4460

Position embedding does not support one-hot input

## ❓ Questions and Help Hi all, I am using the pre-trained transformer for my project. I follow ['reproduce ende-wmt14'](https://github.com/facebookresearch/fairseq/issues/346) to train the tran…

KevinZhoutianyi updated 2 years ago
6
Lightning-AI/litgpt #1086

Error loading converted litgpt checkpoints in `pytorch_model…

Hi, we're using the litgpt framework to train models and then would like to export them to huggingface format for continued tuning and evaluation. The steps we're using after completing training ar…

jwkirchenbauer updated 2 weeks ago
7
microsoft/i-Code #87

Generating bounding boxes with UDOP

Hi, by reading the UDOP paper, my understanding is that during pre-training the model is taught to predict the layout of a target (textual) sequence using special layout tokens. I was wondering …

AleRosae updated 4 months ago
7
aws/studio-lab-examples #193

Error while following the tutorial on Deploy A Hugging Face …

**Describe the bug** Installation error while on the first step of the tutorial using Studio Lab (https://github.com/aws/studio-lab-examples/blob/main/connect-to-aws/Access_AWS_from_Studio_Lab_Deploy…

mwfongAWS updated 1 year ago
1
cvg/glue-factory #49

A failed attempt to use official PTH as the initial training…

Hello, first of all, thank you for your open source training, which is very important for many of us developers. At present, I want to use my personal small dataset for fine-tuning under the origin…

Zhaoyibinn updated 1 month ago
6
LLaVA-VL/LLaVA-NeXT #41

Training/Finetunning code please

Hi, Dear author: It seems the llava-next is really insightful exploreing work. Please kindly release the training and inference code asap, thank you very much.

dragen1860 updated 3 months ago
5
gradio-app/gradio #8791

Better data capture story?

Argilla integration, dataset integration etc. detail to follow.

pngwn updated 1 month ago
7

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for transformer-tutorial

1000+ results
for transformer-tutorial