DeepSeek, a Chinese firm, released a large language model named R1, which matched the performance of top US models but was developed at a significantly lower cost and distributed for free. This led to a $1 trillion loss in the US stock market. R1’s development involved innovative techniques, including automated reinforcement learning, which reduced the need for human input in training. DeepSeek’s approach has prompted competitors like Alibaba and the Allen Institute for AI to update their models, claiming superiority over R1. Despite the hype, DeepSeek’s innovations reveal that building reasoning models might be less complex and costly than previously thought, potentially leading to a surge in capable, freely available AI models.
Source: www.technologyreview.com















