transformer-encoder Search Results

1000+ results
for transformer-encoder

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/candle #1399

Candle cross-encoder model support

@LaurentMazare How can I use candle for a cross-encoder from sentence-transformers models (msmarco models: e.g. msmarco-distilroberta-base-v3)? Does it require differents stack of implementation …

bm777 updated 6 days ago
4
TencentARC/InstantMesh #147

Requires 800 gigabytes of video memory

I finished rendering and when I was ready to train nerf, I only used 20 data sets and found out that I needed quite a lot of memory. What happened? I need your help。 (instantmesh1) mrguanglei@guang…

Mrguanglei updated 1 day ago
2
openai/CLIP #452

Issue with Text Encoder Output Dimensions in Fine-Tuned CLIP…

I'm encountering an issue with the dimensions of the text encoder output in a fine-tuned CLIP model. The fine-tuning output of my CLIP model based on RN50 is (1, 1024), whereas the output from CLIPTex…

QXGeraldMo updated 1 month ago
1
ictnlp/LLaMA-Omni #11

Error when starting model_work

(base) root@autodl-container-aa9a42a072-5939b670:~/autodl-tmp# conda activate llama-omni (llama-omni) root@autodl-container-aa9a42a072-5939b670:~/autodl-tmp# python -m omni_speech.serve.model_worker…

Jun-Howie updated 23 hours ago
2
huggingface/transformers #9526

Siamese Multi-depth Transformer-based Hierarchical Encoder

# 🌟 New model addition ## Model description Recently Google is published paper titled ["Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matchin…

lalitpagaria updated 3 years ago
3
GuyTevet/motion-diffusion-model #199

replace transformerencoder with mamba

Hi！ After replacing an eight-layer Transformer encoder with Mamba, the training loss fails to decrease. Could it be that Mamba doesn't perform as effectively as the Transformer in the diffusion model…

sunxin010205 updated 2 months ago
9
ogkalu2/Merge-Stable-Diffusion-models-without-distortion #45

Weird bg connections?

Hi, I'm interested in understanding what the code does. ``` **easyblock("model.diffusion_model.output_blocks.6.0", "P_bg208","P_bg209"), **conv("model.diffusion_model.output_blocks.6.0…

Xynonners updated 5 months ago
2
huggingface/diffusers #9227

Problem running train_dreambooth_lora_flux.py with model "bl…

### Describe the bug I am using the training script documented here https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/README_flux.md to train a LORA on my dataset. here is th…

lizi85 updated 3 weeks ago
4
zilliztech/GPTCache #648

[Bug]: order of similarity matters, WHY?

### Current Behavior Using the default onnx model, **Score function** ``` def get_score(a, b): return evaluation.evaluation( { 'question': a }, { …

dhandhalyabhavik updated 2 days ago
6
elastic/eland #723

Error importing third party model: dunzhang/stella_en_400M_v…

I am trying to import the aforesaid model using the following command: ``` eland_import_hub_model --url --hub-model-id dunzhang/stella_en_400M_v5 \ --task-type text_embedding --es-username e…

ivssh updated 11 hours ago
1

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for transformer-encoder

1000+ results
for transformer-encoder