DeepSeek Unveils Powerful, Efficient AI Model to Rival OpenAI, Optimized for Consumer Hardware
April 6, 2025
DeepSeek's model is more cost-effective, with token processing costs significantly lower than those of GPT-4.5, making it an attractive option for businesses.
Chinese AI startup DeepSeek has launched an upgraded AI model named V3-0324 on Hugging Face, enhancing its competitive stance against major players like OpenAI.
Developed in collaboration with researchers from Tsinghua University, this model employs a dual approach that combines generative reward modeling (GRM) and self-principled critique tuning, significantly improving the performance of large language models (LLMs).
The model architecture features a mixture-of-experts (MoE) design, utilizing 671 billion parameters while selectively activating parts of the network to optimize computational efficiency.
Notably, inferences from V3 run at under 200 watts, marking a significant advancement in power efficiency for AI deployments, especially when compared to traditional setups that consume several kilowatts.
DeepSeek-V3-0324 is designed to run on consumer-grade hardware, specifically the Apple Mac Studio equipped with the M3 Ultra chip, achieving over 20 tokens per second in performance.
Awni Hannun, an Apple machine learning researcher, demonstrated the model's capabilities on the Mac Studio, underscoring its operational efficiency and potential for edge AI applications.
Users can customize their GPU configurations and authentication methods, with SSH keys recommended for secure access, facilitating a more user-friendly setup.
The evolution of AI inference at the edge has been driven by advances in hardware and software, allowing for real-time applications like language translation and predictive maintenance on less powerful devices.
Creating robust AI models can be costly, ranging from thousands to millions of dollars, influenced by their complexity and the specialized resources required.
Despite these advancements, the immediate impact of DeepSeek's model on daily operations may be limited, indicating a gradual evolution in AI demand and capabilities.
Nvidia remains a dominant force in the AI infrastructure market, valued at approximately $2.72 trillion, although it has recently faced a slight decline in stock value.
Summary based on 34 sources
Get a daily email with more Startups stories
Sources

Reuters • Mar 25, 2025
DeepSeek's V3 upgrade challenges OpenAI and Anthropic in global AI race
Yahoo Finance • Mar 18, 2025
AudioCodes (AUDC) Strengthens AI Services with New Communication Tools
Yahoo Finance • Apr 7, 2025
China's DeepSeek Teams Up With Tsinghua University To Raise AI Bar, Boost Reasoning Capabilities
VentureBeat • Mar 24, 2025
DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI