Writing

Research and field notes.

Deep dives, threat analysis, and notes from the work - mostly on AI and agent security, sometimes on whatever is breaking this week.

Latest

InfluenceChat: What Failed While Building A Manipulation Dataset

A research note on why real manipulative assistance requests were much harder to retrieve from public LLM logs than expected.

June 3, 20263 min readAI Safety Research

Read note