safety-evaluation Search Results

microsoft/aitour-operate-ai-responsibly-with-ai-studio #23

Title & Description Update

**New Title**: Trustworthy AI: Advanced AI risk evaluation and mitigation **New Description**: New challenges and threats are emerging as AI evolves. Learn how Azure AI’s advanced responsible AI tooli…

aprilspeight updated 17 hours ago

stanford-crfm/helm #1595

Safety Evaluation

Evaluate safety issue of LLMs from various aspects (To be continued, open by JingFeng, Hongye, ...)

Mooler0410 updated 1 year ago

opea-project/GenAIEval #75

Restructure genAI Eval to address evaluation of multiple cat…

Restructure the genAI_Eval to categorize the various Evaluation criteria. Need to have different Repos for each category so it makes it easier to users to find what they are looking for. Example: A…

Padmaapparao updated 2 weeks ago

GSTT-CSC/project-template #88

PTD - Test updated documentation templates and data files

Create the list of the order of documents to fill in. this should follow the Design and development SOP to start with, with other QMS SOP following. **Order to fill out Templates** 1. Literature …

AnilMistry updated 1 month ago

patronus-ai/llama-3-toxicity-experiments #6

Questions/Issues in the Llama Guard Evaluation and Clarifica…

Hi there, Thank you for benchmarking Llama Guard 3 and sharing the learnings with us. We will continue to improve the Llama Guard series and Llama safety overall. However, I notice that there might…

JFChi updated 1 month ago

coin-or/ADOL-C #35

ADOL-C thread safety

I am working on a project which involves the evaluation of several functions and their derivatives. At each step, the derivatives of several different functions have to be evaluated (rather than sever…

chrhansk updated 2 weeks ago

huggingface/lighteval #324

[EVAL] Add RewardBench

## Evaluation short description - Why is this evaluation interesting? RewardBench is perhaps the _only_ evaluation suite to provide broad coverage of the strengths and weaknesses of reward models …

lewtun updated 2 months ago

United-Home/2405-45th-St #14

Assessment of Current Electrical Panel Box

#### Objective To conduct a comprehensive evaluation of the existing electrical panel box to identify its current capabilities, limitations, and any necessary upgrades for the planned expansion to 42…

SuzyQuant updated 3 weeks ago

openfoodfacts/openfoodfacts-server #595

Add a link to Food Authorities opinions in the Additive page

### What - WHO/FAO JECFA Toxicity evaluation - WHO/FAO JECFA Safety evaluation - EFSA opinions - FSANZ Hazard Assessment The taxonomy has a few examples already: https://en.wiki.openfoodfacts.…

teolemon updated 1 month ago

JailbreakBench/jailbreakbench #34

GPT4 as judge/classifier BadRequestError.

Hi team, I have tried to use GPT4 as a classifier to classify the model responses but am getting content moderation filter trigger. Changes made -- Instead of Llama70BJudge, I have a similar cla…

NamburiSrinath updated 1 day ago

1000+ results for safety-evaluation

1000+ results
for safety-evaluation