overview for cyd

Why Mark Zuckerberg wants to redefine open source so badly in c/[email protected]

[–] [email protected] 11 points 13 hours ago* (last edited 13 hours ago)

Aww come on. There's plenty to be mad at Zuckerberg about, but releasing Llama under a semi-permissive license was a massive gift to the world. It gave independent researchers access to a working LLM for the first time. For example, Deepseek got their start messing around with Llama derivatives back in the day (though, to be clear, their MIT-licensed V3 and R1 models are not Llama derivatives).

As for open training data, its a good ideal but I don't think it's a realistic possibility for any organization that wants to build a workable LLM. These things use trillions of documents in training, and no matter how hard you try to clean the data, there's definitely going to be something lawyers can find to sue you over. No organization is going to open themselves up to the liability. And if you gimp your data set, you get a dumb AI that nobody wants to use.

Researchers trained an OpenAI rival in half an hour for less than $50 in c/[email protected]

[–] [email protected] 2 points 13 hours ago* (last edited 13 hours ago)

The underlying research story is interesting, but the way it's written up actively makes it worse.

The researchers based s1 on Qwen2.5, an open-source model from Alibaba Cloud.

Watch me create a racing car for less than $50. Step 1: start with a Mercedes F1 racer...

DeepSeek’s rise shows why China’s top AI talent is skipping Silicon Valley. in c/[email protected]

[–] [email protected] 23 points 2 days ago (3 children)

It's definitely a trend. More and more top Chinese students are also opting to stay in China for university, rather than going to the US or Europe to study. It's in part due to a good thing, i.e. the improving quality of China's universities and top companies. But I think it's a troubling development for China overall. One of China's strengths over the past few decades has been their people's eagerness to engage with the outside world, and turning inward will not be beneficial for them in the long run.

The EU is to create its own open-source AI, that supports 30 European languages, and has EU values 'baked in.' in c/[email protected]

[–] [email protected] 1 points 2 days ago

But Mistral could do all that, with a far lower chance of pissing away the money...

The EU is to create its own open-source AI, that supports 30 European languages, and has EU values 'baked in.' in c/[email protected]

[–] [email protected] 1 points 3 days ago* (last edited 3 days ago) (2 children)

Why not just put the money into Mistral? Mistral seems to be pretty cash-strapped and they're just about the only EU entity doing anything interesting with LLMs, and they've released open models before. Commissioning a bunch of models from them would be a better use of money than spreading it among a bunch of randos.

Focus: DeepSeek gives Europe's tech firms a chance to catch up in global AI race in c/[email protected]

[–] [email protected] 5 points 3 days ago

Chinese or not, it's MIT licensed. A world where any company can spend ~$10k to locally deploy a frontier reasoning model is very different from one where you can only get AI via API access to a handful of US tech giants.

Trump says he wants Ukraine's rare earth elements as a condition of further support in c/[email protected]

[–] [email protected] 12 points 3 days ago (5 children)

Rare earths to begin with. There will be more demands.

US Bill proposed to jail people who download Deepseek in c/[email protected]

[–] [email protected] 20 points 3 days ago* (last edited 3 days ago) (6 children)

Base models are general purpose language models, mainly useful for AI researchers and people who want to build on top of them.

Instruct or chat models are chatbots. They are made by fine-tuning base models.

The V3 models linked by OP are Deepseek's non-reasoning models, similar to Claude or ChatGPT4o. These are the "normal" chatbots that reply with whatever comes to their mind. Deepseek also has a reasoning model, R1. Such models take time to "think" before supplying their final answer; they tend to give better performance for stuff like math problems, at the cost of being slower to get the answer.

It should be mentioned that you probably won't be able to run these models yourself unless you have a data center style rig with 4-5 GPUs. The Deepseek V3 and R1 models are chonky beasts. There are smaller "distilled" forms of R1 that are possible to run locally, though.

Canada and Mexico hit back after Trump signs order for punishing tariffs in c/[email protected]

[–] [email protected] 7 points 5 days ago (1 children)

Going after US tech is an obvious move. Digital services taxes, etc.

Canada and Mexico hit back after Trump signs order for punishing tariffs in c/[email protected]

[–] [email protected] 4 points 5 days ago (8 children)

"Via Greenland" makes no sense. The trouble with Canada-Europe trade is that Canada unfortunately lacks a good port on its east coast (certainly nothing comparable to Vancouver in the west). For the foreseeable future, if the trade dispute with the US drags on, Canada's best bet is to expand its trade with Asia.

OpenAI hits back at DeepSeek with o3-mini reasoning model in c/[email protected]

[–] [email protected] 6 points 5 days ago

Intriguingly, there's reason to believe the R1 distills are nowhere close to their peak performance. In the R1 paper they say that the models are released as proofs of concept of the power of distillation, and the performance can probably be improved by doing an additional reinforcement learning step (like what was done to turn V3 into R1). But they said they basically couldn't be bothered to do it and are leaving it for the community to try.

2025 is going to be very interesting in this space.

‘There will be many casualties’: Panama girds for war as Rubio opens talks in c/[email protected]

[–] [email protected] 34 points 5 days ago (2 children)

They have no armed forces. Panama always assumed that because of the importance of the canal, in case of external aggression the US will step in to defend them. LOL.

132

Orban tells EU leaders Trump would act as "Russia-Ukraine peace broker" (www.straitstimes.com)

submitted 6 months ago* (last edited 6 months ago) by [email protected] to c/[email protected]

43 comments fedilink

He claims Trump would act immediately upon winning the election, before taking office. Which sounds legally dubious, but not that that's ever stopped Trump....

91

Nate Silver's model for the 2024 election is out, currently predicts a 65% chance of a Trump victory. (www.natesilver.net)

submitted 7 months ago by [email protected] to c/[email protected]

77 comments fedilink

14

America’s assassination attempt on Huawei is backfiring (www.economist.com)

submitted 7 months ago by [email protected] to c/[email protected]

28 comments fedilink

Archive link: https://archive.is/vGKin

20

Spider-Inspired Microphone Detects Tiny Gusts of Sound (physics.aps.org)

submitted 8 months ago by [email protected] to c/[email protected]

0 comments fedilink

61

France imposes state of emergency in New Caledonia as unrest continues (www.npr.org)

submitted 8 months ago by [email protected] to c/[email protected]

4 comments fedilink

Always weird to me how France is so insistent on clinging to its colonial empire, two decades into the 21st century, despite the headaches that causes.

49

Sonia Sotomayor's retirement is a political IQ test (www.natesilver.net)

submitted 10 months ago by [email protected] to c/[email protected]

16 comments fedilink

27

Japan sorely needs separate surnames (eastasiaforum.org)

submitted 10 months ago by [email protected] to c/[email protected]

0 comments fedilink

1

RSAF will be conducting airdrops of humanitarian aid into Gaza (www.channelnewsasia.com)

submitted 10 months ago by [email protected] to c/[email protected]

0 comments fedilink

1

Foreign interference law invoked for the first time against naturalised Singaporean businessman (www.straitstimes.com)

submitted 1 year ago by [email protected] to c/[email protected]

0 comments fedilink

Guess which country is doing the alleged interference...

"Mr Chan, the managing director of several real estate investment firms, was invited to attend China’s annual Two Sessions parliamentary meetings in March 2023 as an “overseas Chinese representative”."

1

S’pore congratulates Taiwan’s President-elect Lai Ching-te on his victory (www.straitstimes.com)

submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/[email protected]

0 comments fedilink

I'm somewhat surprised that Singapore chose to stick its neck out with a statement, since you-know-who won't like this...

172

China is banning dailies, first time bonuses, and other gacha practices used in Genshin. (gamerant.com)

submitted 1 year ago by [email protected] to c/[email protected]

15 comments fedilink

45

China is banning dailies, first time bonuses, and other gacha practices found in HSR and Genshin. (gamerant.com)

submitted 1 year ago by [email protected] to c/[email protected]

2 comments fedilink