reasoning models

Jun 22, 2026 6 min read

Multi-Turn Jailbreaks: Conversation as the New Attack Surface

Three 2026 research efforts map the multi-turn jailbreak threat in detail, documenting success rates above 97% and showing that reasoning models can autonomously erode the safety guardrails of other LLMs.