DeepSeek ditches Nvidia for Huawei chips in V4 launch

inari@piefed.zip · edit-2 1 month ago

DeepSeek ditches Nvidia for Huawei chips in V4 launch

brucethemoose@lemmy.world · 1 month ago

I just meant for mass inference serving.

Yeah, I haven’t seen much in the way of bitnet training savings yet, like regular old QAT. It does appear that Deepseek is finetuning their MoEs in a 4-bit format now, though.

DeepSeek ditches Nvidia for Huawei chips in V4 launch

DeepSeek ditches Nvidia for Huawei chips in V4 launch

Attention Required! | Cloudflare