k-bert Search Results - Githubissues

1000+ results
for k-bert

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

graphnet-team/graphnet #527

Use unsupervised learning to learn representations in data a…

**Suggested steps:** * [ ] Define unsupervised learning tasks, i.e., learning tasks that don't required truth-level labels but instead relies solely on the reconstruction-level data. This is the same…

asogaard updated 9 months ago
7
huggingface/trl #1054

trl/examples/scripts/reward_modeling.py fails with streaming…

Hi, I am trying to apply reward modelling to an IterableDataset. I am having an issue with a strange failure mode that I am struggling to debug. I can replicate the same stack trace in the reward_m…

elliotttruestate updated 5 months ago
6
intel-analytics/ipex-llm #11177

phi3 medium - garbage output in webui or generated by ollama

In attached, please find output of webui and ollama server console. At line 1 of webui output, I ask the question, using llama3:latest (line 3). Result is shown in lines 4-42 At line 45, I ask sam…

js333031 updated 4 months ago
12
Dao-AILab/flash-attention #1156

google/gemma-2-2b

When doing inference on Gemma-2-2B with Flash Attention 2, I get the following error. It works just fine with Flash Attention disabled. transformers==4.44.0 torch==2.4.0 flash-attn==2.6.3 python…

mhillebrand updated 2 months ago
2
pytorch/pytorch #119760

SDPA Pattern Match for hf-GPT2

### 🚀 The feature, motivation and pitch [GPT-2 SDPA](https://github.com/huggingface/transformers/blob/main/src/transformers/models/gpt2/modeling_gpt2.py#L182-L220) pattern is not currently being ma…

sanchitintel updated 8 months ago
6
lxe/simple-llm-finetuner #38

How should I prepare the dataset for generative question ans…

Hello, Thanks for creating this very helpful tool! I am fine-tuning the **_model (GPT-J-6B)_** for the question answering on the private documents. I have 1000+ documents and they are all in text f…

AayushSameerShah updated 5 months ago
50
Kosuke-Yamada/paper-survey #10

Learning to Retrieve In-Context Examples for Large Language …

https://aclanthology.org/2024.eacl-long.105

Kosuke-Yamada updated 7 months ago
1
alibaba/Pai-Megatron-Patch #280

转换权重的问题

# 问题转换完权重之后进行评估验证时出现下述问题 ```shell > number of parameters on (tensor, pipeline) model parallel rank (0, 0): 630167424 loading release checkpoint from /raid/LLM_train/Pai-Megatron-Patch/checkpoint…

Jayce1kk updated 3 months ago
3
ollama/ollama #1458

Ollama hung after 30 minute of use

I'm running Ollama on my mac M1 and I'm trying to use the 7b models for processing batches of questions / answers. I noticed that after a while ollama just hang and the process stay there forever. …

lfoppiano updated 6 months ago
22
facebookresearch/chameleon #40

Issue during setup - xformers

Facing an issue while setting up the repo and installing during this step > pip3 install -e . The error is coming while building wheel for xformers package. I am using MacBook M1. Any leads wou…

funmonty updated 4 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for k-bert

1000+ results
for k-bert