long-context-modeling Search Results

1000+ results
for long-context-modeling

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

yangalan123/anhp-andtt #5

About the implementations of SAHP and THP

Why is it said that "Please note that the RMSE and Accuracy stuffs in the training log is not reliable" for SAHP and THP ?

zhjcp updated 2 months ago
7
hpcaitech/Open-Sora #596

RuntimeError: CUDA error: no kernel image is available for e…

``` Traceback (most recent call last): File "/root/miniconda3/envs/opensora/lib/python3.10/site-packages/gradio/queueing.py", line 541, in process_events response = await route_utils.call_pro…

baoblei updated 1 month ago
4
huggingface/transformers #29205

Device Mismatch Error when Exporting Hugging Face's BERTMode…

### System Info - `transformers` version: 4.37.1 - Platform: Linux-4.18.0-477.27.1.el8_8.x86_64-x86_64-with-glibc2.31 - Python version: 3.10.13 - Huggingface_hub version: 0.20.3 - Safetensors ver…

lsrock1 updated 4 weeks ago
5
huggingface/transformers #32885

Multi GPU generate with llama shape error

### System Info - `transformers` version: 4.44.0 - Platform: Linux-5.15.0-91-generic-x86_64-with-glibc2.31 - Python version: 3.10.12 - Huggingface_hub version: 0.23.4 - Safetensors version: 0.4…

dakinggg updated 4 days ago
18
pytorch/pytorch #103580

Support ByteTensor and ShortTensor for nn.Embedding and nn.E…

### 🚀 The feature, motivation and pitch Torch's embedding layers only accept int32 and int64 as input. However, for sequences with a small number of distinct possible tokens (e.g., ASCII character em…

JohnHBrock updated 1 year ago
6
biomodhub/biomod2 #459

Error in BIOMOD_ModelingOptions

**Context and question** Getting an error in BIOMOD_ModelingOptions **Code used** ## setup environment ---- > library(mda) > library(gam) > library(earth) > library(maxnet) > library(xg…

SHANTIKUMARI12345 updated 3 months ago
8
oasis-tcs/osim #14

FAQ on what information modeling languages will OSIM use

FAQ on what information modeling languages will OSIM use. I propose we allow UML, ASN.1, and JADN, and potentially any other standard information modelling language TC Members propose. I propose w…

sparrell updated 2 months ago
9
dvlab-research/MGM #63

You are using a model of type mini_gemini_mixtral to instant…

I managed to finetune the mini-gemini mixtral model, however post finetuning I am unable to infer with the model. I tried to launch a model worker per described on the repo: `python -m minigemini.serv…

lightmatmul updated 4 months ago
1
zihangdai/xlnet #41

Long Sequence in SQuAD

**Case: SQuAD task, sequence length > 512** Does your script utilizes cached memory/extended context in a segment, such that the predictions are inferred from sequence longer than 512 tokens? If…

ecchochan updated 5 years ago
3
InternLM/InternLM #789

[QA] Why an OpenAI account is needed for long context demo?

### Describe the question. When I run the demo of long context modeling as in https://github.com/InternLM/InternLM/tree/main/long_context I have the following issue: openai.OpenAIError: The api…

hitalex updated 1 week ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for long-context-modeling

1000+ results
for long-context-modeling