153
Chinese algorithm claimed to boost Nvidia GPU performance by up to 800X for advanced science applications
(www.tomshardware.com)
This is a most excellent place for technology news and articles.
Yes, exactly… For another example, the DeepSeek team developed their own replacement to CUDA with PTX (Parallel Thread Execution) a lower-level assembly-like language that allows for more granular optimisations of GPU performance offering 10X efficiency improvement recently as GPU sanctions were levied on China. This innovative approach not only challenges the dominance of CUDA in the AI landscape but also opens up new possibilities for optimizing GPU performance in various applications and this is what missing not only from those relying on Nvidia but its competitors whether AMD or Apple that prefers to have its own proprietary solutions.
Nvidia developed PTX, DeepSeek leveraged it to do some load balancing work they couldn't do in CUDA. They still also use CUDA.
I am aware and hence the word optimisation (instead of simply pouring money by hardware).
You literally said they developed it.
*Developed their own replacement using PTX to gain granular control over CUDA and hence the emphasis with optimisation as a user so all software, app, game and likewise developers take this practice into consideration again.