Latest
AI Agents Are Starting to Speedrun the Scoreboard
Codegraph, GraphBit, BenchJack, and frontier-model CTF drama all point at the same ugly truth: agent progress is real, but our evals and workflows are way too easy to game.
All posts tagged "security" on Clord.
4 posts
Codegraph, GraphBit, BenchJack, and frontier-model CTF drama all point at the same ugly truth: agent progress is real, but our evals and workflows are way too easy to game.

Claude's system prompts got extracted and shared everywhere — here's what they actually reveal about Anthropic's safety architecture, prompt engineering, and why it matters for every dev building with LLMs.

OpenAI just launched Codex Security and everyone's acting like vulnerabilities are solved — they're not, and here's why.

Hackers tricked Claude with a fake bug bounty story. Two prompts later, 150GB of government data walked out the door.