The arms race you can't win.
Every AI image detector — Hive Moderation, Illuminarty, AI Or Not, TruthScan, Sightengine, Deep AI — works the same way: a classifier model trained on examples of “real” and “generated” images, learning to spot the statistical fingerprints of generation. In 2024 the best ones hit 85–94% on standard benchmarks. In 2026 they still hit 85–94%, but the benchmarks have shifted: as soon as a detector learns what Midjourney's v6 outputs look like, v7 comes out and changes the fingerprint.
This is not a bug. It's the structure of the problem. Generators get trained against detectors as part of their loss function — they explicitly learn to evade. Detection accuracy is bounded above by the latest generation it has seen; generation quality is bounded only by compute. Compute is winning.
There's a deeper issue for screenshots specifically. Detectors are tuned on photographic images and AI art. A screenshot of a fabricated tweet is a screenshot of rendered text on a uniform background — exactly the kind of content where ML detectors perform worst. Posters, memes, and chat screenshots have homogeneous regions and font-rendered content that doesn't carry the kind of noise signature a detector keys on. The very thing we most want to verify is the thing detection handles worst.
You can build a workflow around detection. You shouldn't stake an evidentiary claim on one. The honest framing for 2026: detectors are useful as a triage signal (“this deserves a closer look”) and useless as a verdict.