Crawlers, which now account for half of all internet traffic, are essential for search engines, price comparison, and data aggregation. They also support web security, accessibility, and historical archives. However, the rise of AI has disrupted this balance. AI companies like OpenAI use web-crawled data to train systems like ChatGPT, leading to a backlash from websites fearing economic displacement. Since mid-2023, over 25% of high-quality data has been restricted. Websites are employing anti-crawling technologies, potentially limiting access for all crawlers, including those from academics and journalists. This could lead to a more closed web, with increased logins, paywalls, and access restrictions, reducing the open access that has defined the internet.
Source: www.technologyreview.com















