Anthropic and ETH Zurich showed a fully automated deanonymization attack with 90% precision.

LLMs autonomously matched Hacker News and LinkedIn profiles using cross-platform references.

Simon Lermen and the team from MATS, Anthropic, and ETH Zurich published "Large-scale online deanonymization with LLMs" and showed how Hacker News and Reddit accounts can be deanonymized at scale.

Highlights:

My take:

  1. LLMs are effectively advanced deanonymizers.
  2. Pseudonymity has relied on the assumption that deanonymization is expensive. Not anymore. We need to rethink what "sufficiently anonymized" means in the LLM era. E.g., HIPAA Safe Harbor strips 18 identifier types from clinical data, but soft identifiers in clinical notes (rare diagnoses, injury circumstances, social history) can reveal identity.
  3. Expect better targeting and recon by threat actors at scale. An everyday Joe in your company gets the same level of profiling that was previously justified only for high-value targets.

Large-scale online deanonymization with LLMs

LLM deanonymization pipeline: extract, search, reason, and calibrate across platforms Deanonymization agent matching Hacker News accounts to LinkedIn profiles at 90% precision Precision-recall tradeoff results for automated identity matching at $1-$4 per target