AI Watchtower is a critical-watch project on the security of AI systems. Each entry takes a piece of ongoing research — direct and indirect prompt injection, MCP server vulnerabilities, agent threat models, concrete hardening tradeoffs — and turns it into actionable guidance for builders and operators of LLM-based assistants.
The format is short, structured, operational: one analysis, one risk grid, one checklist you can apply the same day.
Articles below.