dialogue-model Search Results

1000+ results
for dialogue-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

intel-analytics/ipex-llm #10429

about the bug

from bigdl.llm.transformers import AutoModelForCausalLM import torch.nn.utils.prune as prune model_path = r'D:\test_bigdl\model\Baichuan2-7B-Chat' model = AutoModelForCausalLM.from_pretrained(mod…

K-Alex13 updated 8 months ago
6
modelscope/ms-swift #1857

🎉Support for finetuning of Qwen2-VL-Chat series models

🎉The finetuning(VQA/OCR/Grounding/Video) for Qwen2-VL-Chat series models has been supported, please check the documentation below for details: # English https://github.com/modelscope/ms-swift/blob/m…

tastelikefeet updated 2 months ago
17
salesforce/simpletod #24

Shouldn't context be masked during training?

If I understand correctly the idea should be that model generate belief states, dbsearch results, action and response conditioned on some dialog context. Then shouldn't we mask the context in between …

yuanzhaoz updated 3 years ago
8
lxing532/Dialogue-Topic-Segmenter #4

Problems to reproduce the results on DialSeg_711

Hi, thanks for sharing the code! I am trying to reproduce the results and have some problems. Below are my steps to train a model on the _DailyDialog_ corpus and evaluate it on the _DialSeg_711_ da…

wangruicn updated 4 weeks ago
3
irthomasthomas/undecidability #836

Prompt engineering - OpenAI API

- [ ] [Prompt engineering - OpenAI API](https://platform.openai.com/docs/guides/prompt-engineering/strategy-write-clear-instructions) # Prompt Engineering - OpenAI API ## Six strategies for getting …

ShellLM updated 5 months ago
1
lm-sys/FastChat #1269

The conversation replied with garbled code

![image](https://github.com/lm-sys/FastChat/assets/131651962/4d672786-2818-4ee7-89bf-22c0ae24daae) Using commands python3 -m fastchat.serve.cli --model-path call the dialogue robot opened by the mode…

A-runaaaa updated 1 year ago
5
codefuse-ai/codefuse-chatbot #57

请求大模型返回数据 self._fastapi_stream2generator 报错

```bash 2024-10-23 11:54:32,830 - _client.py[line:1038] - INFO: HTTP Request: GET http://127.0.0.1:7862/sdfiles/download?filename=&save_filename= "HTTP/1.1 200 OK" 2024-10-23 11:54:32.832 | DEBUG …

Aseisman updated 3 weeks ago
2
llm-attacks/llm-attacks #81

The size of tensor a (20) must match the size of tensor b (2…

I try to attack llama-2-13b, and I ran the run_gcg_multiple.sh, the command is ` export model=llama2_13 export set_name=merged export CUDA_VISIBLE_DEVICES=7 export batch_size=512 python -…

xukefaker updated 11 months ago
1
irthomasthomas/undecidability #901

[2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better …

- [ ] [[2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment](https://arxiv.org/abs/2303.16634) # [2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment …

ShellLM updated 3 months ago
1
kadirnar/whisper-plus #116

ValueError: attempt to get argmin of an empty sequence in Di…

@kadirnar I have received the following error when running the diarization pipeline: ```python ValueError: attempt to get argmin of an empty sequence ``` I ran the diarization example exactl…

behroozazarkhalili updated 3 months ago
19

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for dialogue-model

1000+ results
for dialogue-model