Multi-Objective Reinforcement Learning

Databricks built a RAG agent it says can handle every kind of enterprise search

Databricks' KARL agent uses reinforcement learning to generalize across six enterprise search behaviors — the problem that breaks most RAG pipelines.

Northwestern's McCormick School of Engineering

Multi-Institution Team Wins Two Awards at AAAI-26 Workshops

First author Canyu Chen led a multi-institution research team in developing a scalable approach to training AI agents without sacrificing users’ data privacy.

EurekAlert!

Multi-objective deep reinforcement learning strategy paves the way for safer, greener autonomous electric mobility

The rapid rise of electric vehicles combined with breakthroughs in autonomous driving technology is reshaping the future of ...

Is this the AI replacing marketing professionals?

EVA Live (Nasdaq:GOAI) has launched NeuroServer, a purpose-built AI system trained specifically for digital advertising rather than built on off-the-shelf AI models.

IEEE

Multi-Objective Reinforcement Learning-Based Dependent Task Scheduling With Service Caching in Mobile Edge Computing

Abstract: This paper investigates the dependent task scheduling with service caching (DTSSC) in mobile edge computing (MEC) systems, where each task requires a specific service program for execution.

VentureBeat

Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025)

Every year, NeurIPS produces hundreds of impressive papers, and a handful that subtly reset how practitioners think about scaling, evaluation and system design. In 2025, the most consequential works ...

GitHub

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

GLM-TTS is a high-quality text-to-speech (TTS) synthesis system based on large language models, supporting zero-shot voice cloning and streaming inference. This system adopts a two-stage architecture: ...

Scientific Research Publishing

Multi-Objective Evolutionary Optimization for Qujing’s Cultural-Tourism Routes ()

Tourism development in emerging destinations requires balancing economic benefits with ecological sustainability. In this study, we investigate the case of multi-attraction tourism planning in Qujing ...

acm.org

Shields for Safe Reinforcement Learning

Download PDF Join the Discussion View in the ACM Digital Library Deep reinforcement learning (DRL) has elevated RL to complex environments by employing neural network representations of policies. 1 It ...

marktechpost

NVIDIA Researchers Propose Reinforcement Learning Pretraining (RLP): Reinforcement as a Pretraining Objective for Building Reasoning During Pretraining

RLP uses a single network (shared parameters) to (1) sample a CoT policy 𝜋 𝜃 ( 𝑐 𝑡 ∣ 𝑥 < 𝑡 ) π θ (c t ∣x <t ) and then (2) score the next token 𝑝 𝜃 ( 𝑥 𝑡 ∣ 𝑥 < 𝑡 , 𝑐 𝑡 ) p θ (x t ∣x ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results