The Data That Powers A.I. Is Disappearing Fast

The article discusses the emerging challenges faced by AI developers due to increasing restrictions on data used for training AI models. A study by the Data Provenance Initiative highlights that many websites, through the Robots Exclusion Protocol and altered terms of service, are now blocking their data from being harvested, resulting in a significant reduction in available high-quality data. The study emphasizes the need for better tools to allow content owners to control data usage more precisely, reflecting the tensions between AI and content.

https://www.nytimes.com/2024/07/19/technology/ai-data-restrictions.html