Microsoft AI CEO: Everything in the fair use of the open web for training


So that writeLEAD advertising campaignsAND haste on the part of the authorities AI needs training materials. ChatGPT needed approx 300 billion words to get off the ground and continues to train based on how users interact with it.

However, human beings are not credited or compensated for creating the content that the AI ​​is consuming. authors, the artistsAND news organizations have already filed countless copyright lawsuits against AI giants like OpenAI and Microsoft after discovering that AI bots can talk about their copyrighted work”very accurate” β€” indicating that the works are in the UA training records.

That's why Microsoft's AI CEO Mustafa Suleyman was asked about it Aspen Festival of Ideas in late June whether AI companies have essentially stolen the world's intellectual property.

Suleiman's response? Almost all content on the Internet, with one possible exception, is fair game for AI training.

Connected: An artificial intelligence startup partnered with Microsoft is being sued by the world's largest recording companies

“I think with regard to content that's already on the open web, the social contract of that content since the '90s has been that it's fair use,” Sulejman said.

Suleiman stated that “anyone” can copy or recreate content on the open web.

“This has been the highway,” he said. “That was the understanding.”

However, some news sites and publishers have asked not to be scratched or dragged.

“This is a gray area and I think it will work through the courts,” Sulejman said.

Mustafa Sulejman. Photo: Stefan Wermuth/Bloomberg via Getty Images

Suleyman leads Microsoft AI at a time when Microsoft has invested billions in technology. Its position on what is and isn't fair use reflects how AI companies can defend intellectual property claims in court.

OpenAI, for example, is alleged to have used more than one million hours of YouTube videos to train ChatGPT. When asked Whether YouTube or social media videos were used to create OpenAI's video generator, Sora, the company's chief technology officer Mira Murati said, “We used publicly available data and licensed data,” and would not specify further beyond.

HE also seems to be eating up work generated by other AI, resulting in lower quality output. Experts estimate that 90% of the content on the Internet will be generated by AI within the next two years.

Connected: The most downloaded news app in the US may have published dozens of fake, AI-written stories



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *