this post was submitted on 25 Jun 2025

175 points (97.8% liked)

Technology

71885 readers

4751 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

[email protected]

175

Judge backs AI firm over use of copyrighted books (www.bbc.com)

submitted 18 hours ago by [email protected] to c/[email protected]

55 comments fedilink hide all child comments

top 50 comments

sorted by: hot top controversial new old

[–] [email protected] 11 points 11 hours ago* (last edited 11 hours ago)

Anakin: “Judge backs AI firm over use of copyrighted books”
Padme: “But they’ll be held accountable when they reproduce parts of those works or compete with the work they were trained on, right?”
Anakin: “…”
Padme: “Right?”

[–] [email protected] 8 points 11 hours ago

Pirate everything!

[–] [email protected] 112 points 18 hours ago* (last edited 18 hours ago) (1 children)

I'm not pirating. I'm building my model.

[–] [email protected] 60 points 17 hours ago (1 children)

To anyone who is reading this comment without reading through the article. This ruling doesn't mean that it's okay to pirate for building a model. Anthropic will still need to go through trial for that:

But he rejected Anthropic's request to dismiss the case, ruling the firm would have to stand trial over its use of pirated copies to build its library of material.

[–] [email protected] 16 points 13 hours ago* (last edited 13 hours ago) (1 children)

I also read through the judgement, and I think it's better for anthropic than you describe. He distinguishes three issues:

A) Use any written material they get their hands on to train the model (and the resulting model doesn't just reproduce the works).

B) Buy a single copy of a print book, scan it, and retain the digital copy for a company library (for all sorts of future purposes).

C) Pirate a book and retain that copy for a company library (for all sorts of future purposes).

A and B were fair use by summary judgement. Meaning this judge thinks it's clear cut in anthropics favor. C will go to trial.

[–] [email protected] 9 points 12 hours ago (1 children)

C could still bankrupt the company depending on how trial goes. They pirated a lot of books.

[–] [email protected] 5 points 12 hours ago (1 children)

As a civil matter, the publishing houses are more likely to get the full money if anthropic stays in business (and does well). So it might be bad, but I'm really skeptical about bankruptcy (and I'm not hearing anyone seriously floating it?)

[–] [email protected] 2 points 4 hours ago

Depending on the type of bankruptcy, the business can still operate, all their profits would just be going towards paying off their depts.

[–] [email protected] 40 points 18 hours ago (1 children)

An 80 year old judge on their best day couldn't be trusted to make an informed decision. This guy was either bought or confused into his decision. Old people gotta go.

[–] [email protected] 26 points 17 hours ago (1 children)

Did you read the actual order? The detailed conclusions begin on page 9. What specific bits did he get wrong?

[–] [email protected] 6 points 15 hours ago (1 children)

I'm on page 12 and I already saw a false equivalence between human learning and AI training.

[–] [email protected] 9 points 15 hours ago (4 children)

Is it this?

First, Authors argue that using works to train Claude’s underlying LLMs was like using works to train any person to read and write, so Authors should be able to exclude Anthropic from this use (Opp. 16).

That's the judge addressing an argument that the Authors made. If anyone made a "false equivalence" here it's the plaintiffs, the judge is simply saying "okay, let's assume their claim is true." As is the usual case for a preliminary judgment like this.

[–] [email protected] 2 points 7 hours ago

Wait, the authors argued that? Why? That's literally the opposite of the thing they needed to argue.

load more comments (3 replies)

[–] [email protected] 10 points 17 hours ago (2 children)

IMO the focus should have always been on the potential for AI to produce copyright-violating output, not on the method of training.

[–] [email protected] 4 points 13 hours ago* (last edited 13 hours ago)

Plantifs made that argument and the judge shoots it down pretty hard. That competition isn't what copyright protects from. He makes an analogy with teachers teaching children to write fiction: they are using existing fantasy to create MANY more competitors on the fiction market. Could an author use copyright to challenge that use?

Would love to hear your thoughts on the ruling itself (it's linked by reuters).

[–] [email protected] 11 points 17 hours ago* (last edited 11 hours ago) (3 children)

If you try to sell "the new adventures of Doctor Strange, Jonathan Strange and Magic Man." existing copyright laws are sufficient and will stop it. Really, training should be regulated by the same laws as reading. If they can get the material through legitimate means it should be fine, but pulling data that is not freely accessible should be theft, as it is already.

[–] [email protected] 3 points 10 hours ago

I have a freely accessible document that I have a cc license for that states it is not to be used for commercial use. This is commercial use. Your policy would allow for that document to be used though since it is accessible. This kind of policy discourages me from easily sharing my works as others profit from my efforts and my works are more likely to be attributed to a corporate beast I want nothing to do with then to me.

I'm all for copyright reform and simpler copyright law, but these companies need to be held to standard copyright rules and not just made up modifications. I'm convinced a perfectly decent LLM could be built without violating copyrights.

I'd also be ok sharing works with a not for profit open source LLM and I think others might as well.

[–] [email protected] 1 points 9 hours ago* (last edited 9 hours ago)

as it is already

Copies of copyrighted works cannot be regarded as "stolen property" for the purposes of a prosecution under the National Stolen Property Act of 1934.

https://en.m.wikipedia.org/wiki/Dowling_v.United_States(1985)

[–] [email protected] 4 points 17 hours ago (1 children)

That "freely" there really does a lot of hard work.

[–] [email protected] 5 points 14 hours ago* (last edited 14 hours ago) (1 children)

It means what it means, "freely" pulls its own weight. I didn't say "readily" accessible. Torrents could be viewed as "readily" accessible but it couldn't be viewed as "freely" accessible because at the very least you bear the guilt of theft. Library books are "freely" accessible, and if somehow the training involved checking out books and returning them digitally, it should be fine. If it is free to read into neurons it is free to read into neural systems. If payment for reading is expected then it isn't free.

[–] [email protected] 4 points 12 hours ago (1 children)

Civil cases of copyright infringment are not theft, no matter what the MPIA have trained you to believe.

[–] [email protected] 2 points 12 hours ago

But they are copyright infringement, which costs more than theft.

[–] [email protected] 3 points 14 hours ago

I hate AI with a fire that keeps we warm at night. That is all.

[–] [email protected] 7 points 18 hours ago* (last edited 18 hours ago)

Previous discussion from yesterday about the same topic: https://lemmy.world/post/31923154

load more comments