World News

846 readers

948 users here now

Rules:

Be a decent person
No spam
Add the byline, or write a line or two in the body about the article.

Other communities of interest:

[email protected]

founded 5 months ago

MODERATORS

[email protected]

DeepSeek Buzz Puts Tech Shares on Track for $1 Trillion Drop (finance.yahoo.com)

submitted 1 month ago by [email protected] to c/[email protected]

12 comments fedilink hide all child comments

Chinese artificial intelligence startup DeepSeek’s latest AI model sparked a $1 trillion rout in US and European technology stocks, as investors questioned bloated valuations for some of America’s biggest companies.

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 6 points 1 month ago (6 children)

OK, hold on, so I went over to huggingface and took a look at this.

Deepseek is huge. Like Llama 3.3 huge. I haven't done any benchmarking, which I'm guessing is out there, but it surely would take as much Nvidia muscle to run this at scale as ChatGPT, even if it was much, much cheaper to train, right?

So is the rout based on the idea that the need for training hardware is much smaller than suspected even if the operation cost is the same... or is the stock market just clueless and dumb and they're all running on vibes at all times anyway?

[–] [email protected] 1 points 1 month ago (1 children)

it does not take an entire nvidia datacenter to serve one customer. the largest model appears to run on a high end rig.

[–] [email protected] 1 points 1 month ago* (last edited 1 month ago)

The largest model, the only one that beats GPT is like 700gb.

It’s not running on a high end rig, it’s running on a server.

load more comments (4 replies)