Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025) ...
Google researchers introduce ‘Internal RL,’ a technique that steers an models' hidden activations to solve long-horizon tasks ...
Apple's researchers continue to focus on multimodal LLMs, with studies exploring their use for image generation, ...
Abstract: This study investigates the design of reward functions for deep reinforcement learning-based source term estimation (STE). Estimating the properties of unknown hazardous gas leakage using a ...
Aran Nayebi, an assistant professor in the Machine Learning Department in CMU's School of Computer Science, is part of a research team that created a virtual zebrafish that acted like a real zebrafish ...
By studying large language models as if they were living things instead of computer programs, scientists are discovering some ...
Abstract: Multi-objective reinforcement learning (MORL) is a structured approach for optimizing tasks with multiple objectives. However, it often relies on pre-defined reward functions, which can be ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results