safety-evaluation Search Results

1000+ results
for safety-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

stanford-crfm/helm #1595

Safety Evaluation

Evaluate safety issue of LLMs from various aspects (To be continued, open by JingFeng, Hongye, ...)

Mooler0410 updated 1 year ago
1
confident-ai/deepeval #881

Enhancing DeepEval's Vertex AI implementation sample

DeepEval docs have a Google Vertex AI implementation sample here: https://docs.confident-ai.com/docs/metrics-introduction#google-vertexai-example However, it has a number of issues: 1. It doesn…

meteatamel updated 6 days ago
1
FYYFU/safety-defense #1

Doubt about the existence of malicious documents

I noticed in the paper that 'In total, we collected 2,000 malicious documents for training with an average number of tokens of 702.79'. However, I do not find these malicious documents within the repo…

zwenjing95 updated 1 day ago
1
open-feature/python-sdk #96

Review thread safety of client/API/Evaluation context

Review the thread safety of the SDK to make sure there's no potential concurrency issues, particularly around state maintained in the global API object, clients, and evaluation context objects. See…

toddbaert updated 3 months ago
3
microsoft/promptflow #3512

[BUG]calling pf_client.run inside the callable target functi…

**Describe the bug** A clear and concise description of the bug. **How To Reproduce the bug** Steps to reproduce the behavior, how frequent can you experience the bug: I am trying to hook up the…

yanggaome updated 1 week ago
2
dynamic-superb/dynamic-superb #144

[Task] Vehicle sounds classification

# Task Name Vehicle sounds classification ## Task Objective The primary goal of this task is to evaluate the audio language model's capability to accurately recognize and classify different …

bunbun221008 updated 1 month ago
3
dynamic-superb/dynamic-superb #164

[Task] Industrial Machine situation

# Industrial Machine situation This dataset is a sound dataset for malfunctioning industrial machine investigation and inspection ## Task Objective In this task, we consider false positives and …

ChenWils updated 1 month ago
5
JailbreakBench/jailbreakbench #29

How to submit the attack results for the Claude-3 and gpt-4-…

Sorry for creating another issue, I am quite lost on how to submit the results of the attack for Claude-3 and GPT-4-0613. Another question I have concerns the use of a benign dataset. I observed…

iBibek updated 2 weeks ago
2
HowieHwong/TrustLLM #25

scripts to reproduce the results in the paper

Hi Could you share scripts that may reproduce the results in the paper ? Thanks. I tried the generation and evaluation for safety using the following script on an Nvidia GPU. The results are …

jinz2014 updated 1 month ago
1
RLHFlow/RLHF-Reward-Modeling #21

Training and evaluating for pair_pm model.

Hi, I have replicated the training and evaluation for the pair_rm model, but I haven't achieved the results reported in Table 2 of the paper. The best results I obtained were with pm_models/llama3-…

t-sifanwu updated 1 week ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for safety-evaluation

1000+ results
for safety-evaluation