DeepSeek's AI breakthrough bypasses industry-standard CUDA, uses assembly-like PTX programming instead

mesamunefire@lemmy.world

Reminds me of the Bitcoin mining and how askii miners overtook graphic card mining practically overnight. It would not surprise me if this goes the same way.

sinceasdf@lemmy.world

This is why Nvidia stock has been hit so hard. CUDA is their moat

capsicones@lemmy.blahaj.zone

There seems to be some confusion here on what PTX is -- it does not bypass the CUDA platform at all. Nor does this diminish NVIDIA's monopoly here. CUDA is a programming environment for NVIDIA GPUs, but many say CUDA to mean the C/C++ extension in CUDA (CUDA can be thought of as a C/C++ dialect here.) PTX is NVIDIA specific, and sits at a similar level as LLVM's IR. If anything, DeepSeek is more dependent on NVIDIA than everyone else, since PTX is tightly dependent on their specific GPUs. Things like ZLUDA (effort to run CUDA code on AMD GPUs) won't work. This is not a feel good story here.

toffi@feddit.org

Never forget kids the market can stay irrational much longer than you can stay solvent.

demesisx@infosec.pub

True.
Thats why I tend to make small plays instead of being an absolute degenerate gambler.

eager_eagle@lemmy.world

I don't think anyone is saying CUDA as in the platform, but as in the API for higher level languages like C and C++.

capsicones@lemmy.blahaj.zone

Some commenters on this post are clearly not aware of PTX being a part of the CUDA environment. If you know this, you aren't who I'm trying to inform.

eager_eagle@lemmy.world

aah I see them now

eager_eagle@lemmy.world

I wish that was true, but this doesn't threaten any monopoly

demesisx@infosec.pub

It certainly does.
Until last week, you absolutely NEEDED an NVidia GPU equipped with CUDA to run all AI models.
Today, that is simply not true. (watch the video at the end of this comment)

https://www.tomshardware.com/tech-industry/artificial-intelligence/deepseeks-ai-breakthrough-bypasses-industry-standard-cuda-uses-assembly-like-ptx-programming-instead

I watched this video and my initial reaction to this news was validated and then some: this video made me even more bearish on NVDA.

pr06lefs@lemmy.ml

This specific tech is, yes, nvidia dependent. The game changer is that a team was able to beat the big players with less than 10 million dollars. They did it by operating at a low level of nvidia's stack, practically machine code. What this team has done, another could do. Building for AMD GPU ISA would be tough but not impossible.

eager_eagle@lemmy.world

mate, that means they are using PTX directly. If anything, they are more dependent to NVIDIA and the CUDA platform than anyone else.

eager_eagle@lemmy.world

you absolutely NEEDED an NVidia GPU equipped with CUDA

also not true

demesisx@infosec.pub

Ahh. Thanks for this insight.

demesisx@infosec.pub

Thanks for the corrections.

paraphrand@lemmy.world

I thought everyone liked to hate on Metal.

pieisawesome@lemmy.world

It’s written in nvidia instruction set PTX which is part of CUDA ecosystem.

Hardly going to affect nvidia

gsus4@mander.xyz

I thought CUDA was NVIDIA-specific too, for a general version you had to use OpenACC or sth.

remotelove@lemmy.ca

CUDA is NVIDIA proprietary, but may be open to licensing it? I think?

https://www.theregister.com/2021/11/10/nvidia_cuda_silicon/

australis13@fedia.io

The big win I see here is the amount of optimisation they achieved by moving from the high-level CUDA to lower-level PTX. This suggests that developing these models going forward can be made a lot more energy-efficient, something I hope can be extended to their execution as well. As it stands currently, "AI" (read: LLMs and image generation models) consumes way too many resources to be sustainable.

agnos.is Forums

DeepSeek's AI breakthrough bypasses industry-standard CUDA, uses assembly-like PTX programming instead