Besides selling the most sought-after hardware, NVIDIA is also developing its own models, including NeMo Megatron models. These were trained using NVIDIA’s own hardware and with help from large text libraries, much like other tech giants do.
…
As the case progressed, the authors also brought up NVIDIA’s contacts with Anna’s Archive, inquiring about “high-speed access” to the shadow library’s massive collection of pirated books.
This is probably why Anna’s Archive hasn’t been taken down yet - the big fish are pirating, too.
these guys gonna lose to china. already chinese coding models almost the same and 1/10 the price. check out z.ai and others
This is unrelated to the post, China is using the same source material, they are just never going to be sued for using it 😁
also china can use our shit, but because of the firewal - we cannot use their shit. they are in a lot better position.
ya all dont think we should get our own freedom firewall going ?
Authoritarian means will not generate the anarchist ends.
Tldr? What is shadow library scripts?
scripts that NVIDIA distributed to clients so they could automatically download and preprocess The Pile dataset.
sounds like they allegedly wrote some stuff to get faster downloads/avoid throttling while they were allegedly pirating books from shadow libraries for their AI
In addition, the motion also targets the contributory copyright infringement allegations, which center on scripts and tools NVIDIA allegedly distributed so corporate customers could automatically download ‘The Pile,’ the dataset that contains Books3.





