this post was submitted on 28 Jan 2025
323 points (96.3% liked)

Technology

67077 readers
5095 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 36 points 1 month ago (9 children)

The big win I see here is the amount of optimisation they achieved by moving from the high-level CUDA to lower-level PTX. This suggests that developing these models going forward can be made a lot more energy-efficient, something I hope can be extended to their execution as well. As it stands currently, "AI" (read: LLMs and image generation models) consumes way too many resources to be sustainable.

[–] [email protected] -4 points 1 month ago (6 children)

PTX also removes NVIDIA lock-in.

[–] [email protected] 11 points 1 month ago (1 children)

Wtf, this is literally the opposite of true. PTX is nvidia only.

[–] [email protected] 4 points 1 month ago (1 children)

Google was giving me bad search results about PTX so I just posted am opinion and hoped Cunningham's Law would work.

[–] [email protected] 4 points 1 month ago
load more comments (4 replies)
load more comments (6 replies)