cross-posted from: https://programming.dev/post/8121669
Japan determines copyright doesn’t apply to LLM/ML training data.
On a global scale, Japan’s move adds a twist to the regulation debate. Current discussions have focused on a “rogue nation” scenario where a less developed country might disregard a global framework to gain an advantage. But with Japan, we see a different dynamic. The world’s third-largest economy is saying it won’t hinder AI research and development. Plus, it’s prepared to leverage this new technology to compete directly with the West.
I am going to live in the sea.
www.biia.com/japan-goes-all-in-copyright-doesnt-apply-to-ai-training/



So if AI companies pay for a book or music (like a consumer) it’s no problem? Because I don’t think this is about paying for content, it’s that content holders refuse to work with AI companies.
Unironically yes, if AI companies paid for training data everyone would be much happier.
I sincerely doubt that NOBODY is willing to sell data to them. It’s far more likely that they have not offered anyone a fair price yet, which makes sense because that would set a precedent.
Even then, if people don’t want to sell them their copyrighted work then tough. You can’t compel people to take customers they don’t want.
So if I go on a free website that hosts art (ArtStation, DeviantArt, etc.) and get training data that I could have legally accessed for free…
They’ve all already done that haha. You could argue that a human has only one life in which to remix that art but an AI is theoretically immortal, so it’s a different category of customer.
At any rate, it’s clear that AI should not have free access to copyrighted works, like news articles, academic papers, stock images, and various kinds of non deviantart art.