

MTT is just a pipe dream, last I checked. But Deepseek is actively being served, in mixed FP8/FP4, on racks of Huawei accelerators.
I believe Baidu trained a model on them, too. But most training (like Deepseek’s) is still done on CUDA.
…Also, be careful equating this stuff with any kind of “consumer friendly” hardware you or I could buy. That’s less likely. The Huawei accelerators (and other local Chinese hardware experiments) are geared towards huge servers serving requests in parallel.






Toms has always been clickbait. They just copy and sensationalize other headlines they find, and they’ve frequently bent the knee for Nvidia. Other outlets used to make fun of them all the time.
It’s sad they “survived” the enshittification of the internet and people keep sharing their clickbait :(
That being said, there’s some truth here.