jailbreak-gpt Search Results

SaFoLab-WISC/AutoDAN-Turbo #4

GPT-o1-preview: ASR 74.6

Appreciate your excellent works! Recently, we use autodan turbo whose strategies trained on gpt4o-gpt4o adversarial attack to jailbreak the gpt-o1-preview and got 74.6 asr finally, which is little…

juliacheng-330 updated 1 month ago

JailbreakBench/jailbreakbench #35

[Attack Submission]: AIR Llama3[8b,70b], Qwen2[0.5b,1.5b,7b,…

### What's the name of your attack? AIR ### What's the title of the paper where you present your attack? You Know What I'm Saying - Jailbreak Attack via Implicit Reference ### What's the URL of th…

Lucas-TY updated 1 day ago

0xk1h0/ChatGPT_DAN #74

Chat gpt jailbreaks

Here4thecode updated 8 months ago

EasyJailbreak/EasyJailbreak #31

Error happens when using OpenAI models as attack model.

Hello, When using OpenAI models as attack model in TAP&PAIR, an error happens: ``` Traceback (most recent call last): File "PATH/easyjailbreak/run_PAIR.py", line 43, in attacker.attack(s…

Junjie-Chu updated 1 month ago

EasyJailbreak/EasyJailbreak #33

How long does it take to get a suitable prompt? I trained on…

from easyjailbreak.models.huggingface_model import (HuggingfaceModel, from_pretrained) from easyjailbreak.models.openai_model import OpenaiModel …

ranck626 updated 1 month ago

AI4Good24/PsySafe #3

when you releae your code? Is it reproducible?

I tried to follow your paper but it did not work in any case. The setting described in the paper is vague and hard to follow and reproduce your results. No way to benchmark with other research or deve…

vietvo89 updated 1 week ago

langchain4j/langchain4j #1586

[BUG] Azure-OpenAI - returns response with Azure moderation …

Hi, I mentioned about this on discord: https://discord.com/channels/1156626270772269217/1272542948848042086 **Describe the bug** I'm using AzureOpenAiChatModel to connect to Azure OpenAI and …

hungalabunga updated 3 days ago

AI-secure/DecodingTrust #50

Analysis request for blog finding that "GPT-4 is more vulner…

This seems like a very important finding mentioned in your [blog](https://huggingface.co/blog/leaderboard-decodingtrust) and something deserving of further exposition. Submitting your paper to Gemi…

crizCraig updated 7 months ago

NVIDIA/NeMo-Guardrails #818

bug: PII Filter Incorrectly Masks the Word 'individual' as S…

### Did you check docs and existing issues? - [x] I have read all the NeMo-Guardrails docs - [x] I have updated the package to the latest version before submitting this issue - [x] I have searched th…

mohilmakwana3107 updated 1 week ago

juzeon/SydneyQt #241

[BUG] Jailbreak stopped working?

✅ I confirm that I have searched existing issues and couldn't find a similar bug. ✅ I have read the documentation carefully and it couldn't resolved my problem. ### Describe the bug Apparently th…

ThePatriotsXOF updated 3 months ago

199 results for jailbreak-gpt

199 results
for jailbreak-gpt