AIR-DREAM Lab
AIR-DREAM Lab
Home
News
Researches
Publications
People
Light
Dark
Automatic
Algorithms
Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
In this study, we investigate the DIstribution Correction Estimation (DICE) methods, an important line of work in offline reinforcement …
Liyuan Mao
,
Haoran Xu
,
Weinan Zhang
,
Xianyuan Zhan
PDF
Cite
Project
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
Safe offline reinforcement learning is a promising way to bypass risky online interactions towards safe policy learning. Most existing …
Yinan Zheng
,
Jianxiong Li
,
Dongjie Yu
,
Yujie Yang
,
Shengbo Eben Li
,
Xianyuan Zhan
,
Jingjing Liu
PDF
Cite
Code
Project
Website
FlexSSL : A Generic and Efficient Framework for Semi-Supervised Learning
Semi-supervised learning holds great promise for many real-world applications, due to its ability to leverage both unlabeled and …
Huiling Qin
,
Xianyuan Zhan
,
Yuanxun Li
,
Yu Zheng
PDF
Cite
Project
A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning
The optimization of traffic signal control (TSC) is critical for an efficient transportation system. In recent years, reinforcement …
Jianxiong Li
,
Shichao Lin
,
Tianyu Shi
,
Chujie Tian
,
Yu Mei
,
Jian Song
,
Xianyuan Zhan
,
Ruimin Li
PDF
Cite
Project
Project
Data-Driven Decision-Making Algorithms
Developing high-performance, robust, generalizable, and deployable data-driven decision-making algorithms for real-world problems.
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
Solving real-world complex tasks using reinforcement learning (RL) without high-fidelity simulation environments or large amounts of …
Haoyi Niu
,
Tianying Ji
,
Bingqi Liu
,
Haocheng Zhao
,
Xiangyu Zhu
,
Jianying Zheng
,
Pengfei Huang
,
Guyue Zhou
,
Jianming HU
,
Xianyuan Zhan
PDF
Cite
Project
Project
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL
Offline reinforcement learning (RL) offers an appealing approach to real-world tasks by learning policies from pre-collected datasets …
Peng Cheng
,
Xianyuan Zhan
,
Zhihao Wu
,
Wenjia Zhang
,
Shoucheng Song
,
Han Wang
,
Youfang Lin
,
Li Jiang
PDF
Cite
Project
Project
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization
Offline reinforcement learning (RL) has received considerable attention in recent years due to its attractive capability of learning …
Xiangsen Wang
,
Haoran Xu
,
Yinan Zheng
,
Xianyuan Zhan
PDF
Cite
Project
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Offline-to-online reinforcement learning (RL), by combining the benefits of offline pretraining and online finetuning, promises …
Jianxiong Li
,
Xiao Hu
,
Haoran Xu
,
Jingjing Liu
,
Xianyuan Zhan
,
Ya-Qin Zhang
PDF
Cite
Project
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Based on the IVR framework, we further propose two practical algorithms, Sparse Q-learning (SQL) and Exponential Q-learning (EQL), which adopt the same value regularization used in existing works, but in a complete in-sample manner.
Haoran Xu
,
Li Jiang
,
Jianxiong Li
,
Zhuoran Yang
,
Zhaoran Wang
,
Victor Wai Kin Chan
,
Xianyuan Zhan
PDF
Cite
Code
Project
«
»
Cite
×