
Alex Reisner / The Atlantic:
A profile of nonprofit Common Crawl, which scraped billions of web pages since 2013, including paywalled articles, to build an archive used by OpenAI and others — Common Crawl claims to provide a public benefit, but it lies to publishers about its activities.

Alex Reisner / The Atlantic:
A profile of nonprofit Common Crawl, which scraped billions of web pages since 2013, including paywalled articles, to build an archive used by OpenAI and others — Common Crawl claims to provide a public benefit, but it lies to publishers about its activities.
Source: TechMeme
Source Link: http://www.techmeme.com/251104/p10#a251104p10