DeepSeek’s distilled new R1 AI model can run on a single GPU

Posted by:

|

On:

|

DeepSeek’s updated R1 reasoning AI model might be getting the bulk of the AI community’s attention this week. But the Chinese AI lab also released a smaller, “distilled” version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably-sized models on certain benchmarks. The smaller updated R1, which was built using the Qwen3-8B model Alibaba […]

Posted by

in

Leave a Reply

Your email address will not be published. Required fields are marked *