linear-transformer Search Results

1000+ results
for linear-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #28441

Proposal for Adding a New Scheduler Strategy for Language Mo…

### Feature request We try to propose the addition of a new and widely-adopted scheduler strategy for language model pretraining in the Transformers repository. Upon reviewing the current schedulers …

gmftbyGMFTBY updated 8 months ago
4
facebookresearch/detr #419

Train DETR using small dataset (3k examples)

## ❓ How to fine-tune DETR using a small dataset (3k examples) Hi everyone, I'm using DETR in my master's thesis: it concerns the development of a door recognizer. This is my first experience wit…

micheleantonazzi updated 3 years ago
1
facebookresearch/audiocraft #74

Python script using the medium model doesn't work

I am trying to run the following python script, which uses the medium model: ``` import torchaudio from audiocraft.models import MusicGen from audiocraft.data.audio import audio_write model =…

vegandiet705 updated 6 months ago
11
rl-institut/multi-vector-simulator #707

Add life cycle emissions

Currently emissions in MVS are defined by an `emission_factor` and the flow of the asset. Life cycle emissions will probably be defined by an `emission_factor` and the installed capacity. These emis…

SabineHaas updated 3 years ago
1
intel-analytics/ipex-llm #12080

1xArc encounters OOM when running Qwen2-7B, load_in_bit=sym_…

### script ``` numactl -C 11-15 python ./benchmark_docker_throughput.py \ --backend vllm \ --dataset /data/ShareGPT_V3_unfiltered_cleaned_split.json \ --model Qwen2-7B-Instruct \ --tru…

aprilhu01 updated 2 weeks ago
2
NVIDIA/apex #1764

apex installation failures

command "pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./" error log: "DEPRECATION: --build-option and --global-option are depr…

momo1986 updated 6 months ago
1
WisdomShell/codeshell #71

CodeShell-7B-Chat-int4 启动wed_demo.py发送信息时报错，AttributeError: …

Exception in thread Thread-6 (generate): Traceback (most recent call last): File "/opt/conda/lib/python3.10/threading.py", line 1016, in _bootstrap_inner self.run() File "/opt/conda/lib/py…

Coooolrui updated 6 months ago
2
THUDM/ChatGLM-6B #988

GPU推理时，history不为空就报cuda error

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior 用的是/chatGLM-6B-int4模型，在cpu推理时一切正常。当在gpu推理时，如果history为空，则不报错。如果history不为空，则直接报cuda错误。具体错误如下： …

wwlaoxi updated 1 year ago
2
scikit-learn/scikit-learn #22827

Improve tests by using global_random_seed fixture to make th…

## Context: the new `global_random_seed` fixture #22749 introduces a new `global_random_seed` fixture to make it possible to run the same test with any seed between 0 and 99 included. By default, w…

ogrisel updated 3 months ago
14
JohnSnowLabs/spark-nlp #14215

When Attempting to loadSavedModel, I Encountered 'java.lang.…

### Is there an existing issue for this? - [X] I have searched the existing issues and did not find a match. ### Who can help? _No response_ ### What are you working on? I fine-tuned a T5 model…

TerryLaw535 updated 4 months ago
16

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for linear-transformer

1000+ results
for linear-transformer