OpenAI Reinforcement learning

DeepSeek-R1’s bold bet on reinforcement learning: How it outpaced OpenAI at 3% of the cost

DeepSeek-R1’s Monday release has sent shockwaves through the AI community, disrupting assumptions about what’s required to achieve cutting-edge AI performance. This story focuses on exactly how DeepSeek managed this feat,

OpenAI, DeepSeek

· 11h

OpenAI finds DeepSeek used its data to train R1 reasoning model

· 20h · on MSN

DeepSeek used OpenAI’s model to train its competitor using ‘distillation,’ White House AI czar says

· 1dunite

DeepSeek vs. OpenAI: The Battle of Open Reasoning Models

unite2d

DeepSeek-R1: Transforming AI Reasoning with Reinforcement Learning

DeepSeek-R1 is the groundbreaking reasoning model introduced by China-based DeepSeek AI Lab. This model sets a new benchmark in reasoning capabilities for open-source AI. As detailed in the accompanying research paper,

Open-source DeepSeek-R1 uses pure reinforcement learning to match OpenAI o1 — at 95% less cost

The company developed DeepSeek-R1 by using pure reinforcement learning on top of DeepSeek-V3-Base, and matched or beat o1 on some benchmarks.

Hosted on MSN16h

China’s DeepSeek Model Outpaces OpenAI—Sam Altman Says OpenAI Data Was Used ‘Unfairly’

DeepSeek, a Chinese AI research lab, has released an advanced AI model which rivals leading models from OpenAI. The DeepSeek-R1 model can perform complicated mathematical reasoning, code generation, and more with fewer resources than its American competitors.

New Qwen-2.5 Max Open Source AI Beats Deepseek and OpenAI

Qwen-2.5 Max AI model by Alibaba outperforms DeepSeek-v3 and rivals GPT-4. Offering advanced coding, math, and vision-language solutions with

Biometric Companies2d

OpenAI launches new AI agent Operator that can perform tasks independently

The AI agent is powered by Computer-Using Agent (CUA), a model combining GPT-4’s vision capabilities with advanced reasoning through reinforcement learning.