this post was submitted on 27 Dec 2023
15 points (100.0% liked)
Technology
191 readers
1 users here now
This magazine is dedicated to discussions on the latest developments, trends, and innovations in the world of technology. Whether you are a tech enthusiast, a developer, or simply curious about the latest gadgets and software, this is the place for you. Here you can share your knowledge, ask questions, and engage in discussions on topics such as artificial intelligence, robotics, cloud computing, cybersecurity, and more. From the impact of technology on society to the ethical considerations of new technologies, this category covers a wide range of topics related to technology. Join the conversation and let's explore the ever-evolving world of technology together!
founded 2 years ago
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
How so?
The trained model includes vast swathes of copyrighted material. It's the rights holders who get to decide whether someone can use it.
Just because it makes it inconvenient or harder for someone to train an AI model does not justify wholesale stealing.
A lot of models are even trained on large numbers of pirated material like books downloaded from pirate sites etc. I guarantee you OpenAI and others didn't even buy a lot of the material they use to train the AI models on.
My hunch is that if they did actually buy or properly license that material, they would have been bankrupt before the first version of ChatGPT came online. And if that's true, then OpenAI owes it's entire existence to it's piracy.
Its not piracy to just webscrap everything for data...
There isn't a person sitting around and pirating shit, its a Algorithm that takes everything from the internet it can reach.
Yes it is.
No. It's publicly available, piracy would be to use stuff that isn't publicly available.
Publicly available =/= public domain.