Why Robots.txt and Metadata Aren't Enough: The Technical Case for a Text and Data Mining (TDM) Registry
Understanding the gap between web protocols and AI licensing needs When publishers first learned that AI companies were training models on copyrighted content, the response was predictable: "Can't we just use robots.txt?" After all, search engines have respected these simple text files for decades, telling