Mainframe RL - Search News

Bimanual Long-Horizon Manipulation Via Temporal-Context Transformer RL

Abstract: Dual-arm robots can perform bimanual long-horizon (LH) manipulation, surpassing the capabilities of single-arm robots. However, bimanual LH tasks are challenging for robot intelligence due ...

GitHub

Demystifying Reinforcement Learning in Agentic Reasoning

An overview of our research on agentic RL. In this work, we systematically investigate three dimensions of agentic RL: data, algorithms, and reasoning modes. Our findings reveal: Real end-to-end ...

GitHub

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

We propose TraceRL, a trajectory-aware reinforcement learning method for diffusion language models, which demonstrates the best performance among RL approaches for DLMs. We also introduce a ...

IEEE

Reinforcement Learning-Based Control of DC-DC Buck Converter Considering Controller Time Delay

Abstract: Non-linearities and unmodeled dynamics in the control system inevitably degrade the quality and reliability of voltage stabilization performance in DC-DC buck converters. Reinforcement ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results