Lawyers for The New York Times and Daily News, which are suing OpenAI for allegedly scraping their works to train its AI models without permission, say OpenAI engineers accidentally deleted data potentially relevant to the case. Earlier this fall, OpenAI agreed to provide two virtual machines so that counsel for The Times and Daily News could perform searches for their copyrighted content in its AI training sets. In a letter, attorneys for the publishers say that they and experts they hired have spent over 150 hours since November 1 searching OpenAI’s training data. In this case and others, OpenAI has maintained that training models using publicly available data — including articles from The Times and Daily News — is fair use. OpenAI has neither confirmed nor denied that it trained its AI systems on any specific copyrighted works without permission.
Source: New York Times November 21, 2024 15:06 UTC