The detector arrived as an answer to a panic. When generative writing tools became something any undergraduate could open in a browser tab, the institutional reflex was not to ask what the technology meant for the assignment but to find a machine that could police the old assignment unchanged — and the plagiarism vendors, Turnitin foremost among them, shipped one within months. The promise was clean: paste in the essay, receive a percentage, know who cheated. What the promise concealed is the subject of this week's column. A classifier that decides whether a sentence was written by a person or a model does not return the truth; it returns a guess, expressed with a confidence the underlying mathematics does not earn. And every guess of that kind comes with a second number that the marketing does not print on the box — the rate at which it calls the innocent guilty.
That second number is where the cost lives. The arc this column traces runs between two things that have moved at different speeds: a conversation that spent most of 2025 talking about AI detection as a manageable problem, a tool to be tuned and trusted, and a body of statistical reality that has not budged at all. The conversation flipped from early skepticism to a long season of solution-optimism and only lately began to sour again. The reality — that rare events misclassified at scale produce far more false alarms than true catches, that detectors learn to flag surface features rather than authorship, that running every student through the same test guarantees you will accuse some of them wrongly — was true before the first detector launched and will be true after the last one is switched off. This is a piece about why those two stories took so long to meet, and about who pays the interest while they stay apart.
The talk arrived in a recognizable shape. At the close of 2024 the conversation about AI's failure modes was still faintly skeptical in tone — the few voices addressing detection and its costs leaned critical, treating the new tools as suspect before they had been adopted. That posture did not last. Across the first two quarters of 2025 the framing inverted hard toward optimism, and the optimism took a specific form: not "this works" so much as "this can be made to work." The risk was real, the genre conceded, but it was a risk to be governed. 10 AI dangers and risks and how to manage them, IBM's contribution to the genre, is built entirely on that grammar — name the pitfall, then hand over the management plan, the danger always safely upstream of a procedure. How to build safe, secure and trustworthy AI capabilities makes the same move at the level of institutional process: caution is invoked precisely so that deployment can proceed. The AI Dilemma: Powering the Future or Fueling Our Fears? stages the fear in its title and then resolves it, the way the genre almost always does, on the side of the future.