MATLAB Reinforcement Learning Tutorial

AgiBot Achieves First Real-World Deployment of Reinforcement Learning in Industrial Robotics

SHANGHAI, Nov. 2, 2025 /PRNewswire/ -- AgiBot, a robotics company specializing in embodied intelligence, announced a key milestone with the successful deployment of its Real-World Reinforcement ...

IEEE

Proof-of-Concept of a Reinforcement-Learning Based RT Shimming Technique for HTS Magnets

Abstract: We report a newly developed room-temperature (RT) shimming method for high-temperature superconducting (HTS) magnets employing a deep Q-network (DQN), a type of reinforcement learning theory ...

GitHub

reinforcement-learning-papers

The proceedings of top conference in 2021 on the topic of Reinforcement Learning (RL), including: AAAI, IJCAI, NeurIPS, ICML, ICLR, ICRA, AAMAS and more. The proceedings of top conference in 2018 on ...

Morningstar

CoreWeave to Acquire OpenPipe, Leader in Reinforcement Learning

CoreWeave, Inc. (NASDAQ: CRWV), the AI Hyperscaler™, today announced a definitive agreement to acquire OpenPipe Inc, a leading platform for training AI agents with reinforcement learning (RL).

Hosted on MSN

DenseNet Architecture Explained | Deep Learning Tutorial for Beginners

Learn how DenseNet works and why it’s a powerful architecture in deep learning. This tutorial breaks down DenseNet’s key concepts, including dense connections, feature reuse, and parameter efficiency ...

Hosted on MSN

Smooth Cardistry Tutorial for Learning Basic Moves

This video teaches a clean and fluid cardistry sequence focused on foundational movements. It’s designed to help you build flow and control while handling cards with style. Each move is broken down ...

marktechpost

Qwen Researchers Proposes QwenLong-L1: A Reinforcement Learning Framework for Long-Context Reasoning in Large Language Models

While large reasoning models (LRMs) have shown impressive capabilities in short-context reasoning through reinforcement learning (RL), these gains do not generalize well to long-context scenarios.

marktechpost

This AI Paper from the Tsinghua University Propose T1 to Scale Reinforcement Learning by Encouraging Exploration and Understand Inference Scaling

Large language models (LLMs) are developed specifically for math, programming, and general autonomous agents and require improvement in reasoning at test time. Various approaches include producing ...

VentureBeat

DeepSeek-R1’s bold bet on reinforcement learning: How it outpaced OpenAI at 3% of the cost

DeepSeek-R1's release last Monday has sent shockwaves through the AI community, disrupting assumptions about what’s required to achieve cutting-edge AI performance. Matching OpenAI’s o1 at just 3%-5% ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results