this post was submitted on 28 Jan 2025
865 points (94.4% liked)

memes

11276 readers
3336 users here now

Community rules

1. Be civilNo trolling, bigotry or other insulting / annoying behaviour

2. No politicsThis is non-politics community. For political memes please go to [email protected]

3. No recent repostsCheck for reposts when posting a meme, you can only repost after 1 month

4. No botsNo bots without the express approval of the mods or the admins

5. No Spam/AdsNo advertisements or spam. This is an instance rule and the only way to live.

A collection of some classic Lemmy memes for your enjoyment

Sister communities

founded 2 years ago
MODERATORS
 

Office space meme:

"If y'all could stop calling an LLM "open source" just because they published the weights... that would be great."

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 0 points 2 days ago (6 children)

If we're specifically saying that open source means you can recreate the binaries, then data is fundamentally not able to be open source

lol, are you claiming data isn't reproducable? XD

[–] [email protected] -1 points 2 days ago (3 children)

... Did you not read the litteral next phrase in the sentence?

since it distinctly lacks any form of executable content.

Your definition of open source specified reproducible binaries. From context it's clear that I took issue with your definition, not with the the notion of reproducing data.

[–] [email protected] 0 points 1 day ago (2 children)

Ok, then my definition givenwas too narrow, when I said "reproducable binaries". If data claims to be "open source", then it needs to supply information on how to reproduce it.

Open data has other criteria, I'm sure.

[–] [email protected] 1 points 1 day ago (1 children)

my definition givenwas too narrow

Yes, that's what I said when you opted to take the first half of a sentence out of context.

https://en.wikipedia.org/wiki/Open_data

The common usage of open data is just that it's freely shareable.
Like I said in my initial comment, people frequently use "open source" to refer to it, but it's such a pervasive error that it hardly worth getting too caught up on and practically doesn't count as an error anymore.

Some open data can't be reproduced by anyone who has access to the data.

[–] [email protected] 0 points 1 day ago

I was specifically addressing the use of the phrase "open source". And the term "open data" doesn't apply either, since it's not a dataset that's distributed, but rather weights of an LLM with data baked into it. That's neither open source nor open data.

load more comments (2 replies)