redpajama Search Results

504 results
for redpajama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlc-ai/mlc-llm #1643

[Bug] RedPajama-INCITE-Chat-3B-v1-q4f16_1 crashes on iOS whe…

## 🐛 Bug RedPajama-INCITE-Chat-3B-v1-q4f16_1 crashes on iOS when using iPhone 15 (Non Pro) ## To Reproduce Steps to reproduce the behavior: 1. Download the MLCChat app on an iPhone 15 (Non…

winstonschen updated 8 months ago
1
locuslab/wanda #45

llama_7b wikitext perplexity 7.0915350914

`bash scripts/llama_7b.sh` the source model: wikitext perplexity is 5.67702 prune this model, sparsity 50%, get wikitext perplexity is 7.09153509 but the paper is : 50% 7.26 why?

xiaopengaia updated 4 months ago
3
opening-up-chatgpt/opening-up-chatgpt.github.io #60

Keeping track of issues filed at LLM+RL projects

This issue is just a place to keep track of issues filed at other github projects or HuggingFace hubs to ask for documentation

mdingemanse updated 4 months ago
11
RoboFlamingo/RoboFlamingo #31

Evaluation with comes up with strange results

Hi, It is a impressing work, however, when I tried to evaluate with the config - mpt-1b-redpajama-200b-dolly - Loading origin flamingo checkpoint from OpenFlamingo-3B-vitl-mpt1b-langinstruct/chec…

Qianrj0735 updated 5 months ago
13
huggingface/datasets #6350

Different objects are returned from calls that should be ret…

### Describe the bug 1. dataset = load_dataset("togethercomputer/RedPajama-Data-1T-Sample", cache_dir=training_args.cache_dir, split='train[:1%]') 2. dataset = load_dataset("togeth…

phalexo updated 11 months ago
2
togethercomputer/RedPajama-Data #70

What does "default" do in `load_dataset('togethercomputer/Re…

Code is asking me for a name e.g., ``` `load_dataset('togethercomputer/RedPajama-Data-1T', "default")`? ``` I want to use all the data sets. Is the "default" the right argument?

brando90 updated 1 year ago
3
hpcaitech/ColossalAI #5188

what's the difference of theColossalAI\examples\language\lla…

### 🐛 Describe the bug can we run both of it with the same dataset like RedPajama-Data-1T-Sample ### Environment _No response_

SeekPoint updated 8 months ago
1
RoboFlamingo/RoboFlamingo #39

Failed on all eval tasks

Language model: anas-awadalla/mpt-1b-redpajama-200b Vision encode: openai CLIP ViT-L/14 finetune model: pretrained model checkpoint_gripper_post_hist_1_aug_10_4_traj_cons_ws_12_mpt_3b_4.pth(down…

FanZhang91 updated 4 months ago
3
openlm-research/open_llama #86

Any plans to do openLlama2?

Use Llama2 model and train on all latest and more efficient (like SlimPajama vs redpajama) open datasets ? Just for the base model, then maybe open-assistant team can rlhf it

djaym7 updated 9 months ago
3
ray-project/ray #45928

[Bug] Poor performance when processing large amounts of LLM …

### What happened + What you expected to happen I want to use ray to do large-scale text data cleaning tasks, and extracted 5 million data from the open source redpajama github dataset for testing, w…

Cathy0908 updated 1 month ago
11

上一页 1...1 2 3 4 5 6 7...51 下一页

504 results for redpajama

504 results
for redpajama