Writing
Research and field notes.
Deep dives, threat analysis, and notes from the work - mostly on AI and agent security, sometimes on whatever is breaking this week.
Latest
InfluenceChat: What Failed While Building A Manipulation Dataset
A research note on why real manipulative assistance requests were much harder to retrieve from public LLM logs than expected.
June 3, 20263 min readAI Safety Research
Read note