TinyZero – Researchers Replicated DeepSeek’s R1-Zero Model for Just $30

In an impressive demonstration of cost-effective AI research, a group of researchers has successfully replicated DeepSeek’s R1-Zero model for just $30. Dubbed TinyZero, this project focuses on countdown and multiplication tasks, leveraging reinforcement learning (RL) to enable a 3-billion-parameter (3B) base language model (LM) to develop self-verification and search abilities autonomously. Built on the veRL […]

The post TinyZero – Researchers Replicated DeepSeek’s R1-Zero Model for Just $30 appeared first on Cyber Security News.

This article has been indexed from Cyber Security News

Read the original article: