0
Applied AI·June 26, 2026·1 min read

OpenAI says GPT-5.6 Sol and Terra were capable of identifying vulnerabilities but were unable to execute autonomous, end-to-end attacks against hardened targets

Share

Top-tier models can now reliably spot vulnerabilities but still struggle to chain them into autonomous, end-to-end attacks against hardened systems — the offense/defense gap is narrower but not closed. Security teams should treat LLMs as powerful recon and red-teaming tools while still assuming human-led orchestration for serious threats.