Claude and ChatGPT too expensive, Chinese AI models surge in use due to low cost

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 22 days ago

Claude and ChatGPT too expensive, Chinese AI models surge in use due to low cost

brucethemoose@lemmy.world · edit-2 21 days ago

Doesn’t matter(for this, specifically) if it’s not performant on LLM inference engines.

And I’m not just talking about CUDA. Even GGUF Vulkan (for example) has all sorts of vendor quirks that can absolutely trash performance. VLLM is often a joke on AMD, with certain models, on certain cards, even with dev support.

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 21 days ago

Sure, but try extrapolating 2 or 3 years into the future here. Models are going to become more efficient and hardware is going to improve. Right now Chinese companies are just starting to put out GPUs, but once that process is ironed out, I don’t see why they wouldn’t put out chips that work well with Chinese models. This kind of stuff is happening already, it’s only a matter of time till it makes it to consumer market. too https://lushbinary.com/blog/deepseek-v4-huawei-ascend-ai-infrastructure-strategy