
Alex Reisner / The Atlantic:
A profile of nonprofit Common Crawl, which has scraped billions of webpages since 2013, including paywalled ones, to build an archive used by OpenAI and others — Common Crawl claims to provide a public benefit, but it lies to publishers about its activities.

Alex Reisner / The Atlantic:
A profile of nonprofit Common Crawl, which has scraped billions of webpages since 2013, including paywalled ones, to build an archive used by OpenAI and others — Common Crawl claims to provide a public benefit, but it lies to publishers about its activities.
Source: TechMeme
Source Link: http://www.techmeme.com/251104/p10#a251104p10