How to Code a Block Bench Mod Tutorial

This repository contains the code and data for the FRACTURED-SORRY-Bench framework, as described in our paper.

FRACTURED-SORRY-Bench is a framework for evaluating the safety of Large Language Models (LLMs) against multi-turn conversational attacks. Building upon the SORRY-Bench dataset, we propose a simple yet ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

This repository contains the code and data for the FRACTURED-SORRY-Bench framework, as described in our paper.

Trending now