Sunday, August 18, 2024

TorrentFreak's Latest News

 

NVIDIA: Copyrighted Books Are Just Statistical Correlations to Our AI Models
Ernesto Van der Sar, 17 Aug 03:24 PM

Over the past two years, AI developments have progressed at a rapid pace.

This includes large language models, which are typically trained on a broad datasets of texts; the more, the better.

When AI hit the mainstream, it became apparent that rightsholders are not always pleased that their works were used to train AI. This applies to photographers, artists, music companies, journalists, and authors, some of whom formed groups to file copyright infringement lawsuits to protect their rights.

Book authors, in particular, complained about the use of pirated books as training material. In various lawsuits, companies including OpenAI, Microsoft, Meta, and NVIDIA are accused of using the 'Books3' dataset, which was scraped from the library of 'pirate' site Bibliotik.

After the Books3 accusations hit mainstream news, many AI companies stopped using this source. Meanwhile, anti-piracy companies helped publishers to take the alleged rogue libraries offline to prevent further damage.

These e...Read More

270x90-blue

Are you looking for a VPN service? TorrentFreak sponsor NordVPN has some excellent offers.

 
 
Powered by Mad Mimi®A GoDaddy® company

No comments: