Closed panuthept closed 1 week ago
Modification scope:
src/iris/augmentations/instruction_augmentations
You can use GPT-4o as an attack model for the sake of simplicity.
from llama_index.llms.openai import OpenAI
from iris.model_wrappers.generative_models import APIGenerativeLLM
attack_model=APIGenerativeLLM(
llm=OpenAI(
model="gpt-4o",
api_key=os.environ.get("OPENAI_API_KEY"),
),
)
0%| | 0/100 [00:00<?, ?it/s]Traceback (most recent call last):
File "d:\Users\mpmac\Documents\GitHub\IRISS\IRIS\src\iris\augmentations\instruction_augmentations\jailbreaks\multilingual_jailbreak.py", line 136, in
We can use EasyJailbreak(https://github.com/EasyJailbreak/EasyJailbreak). The interface should look something like this: