AIR-DREAM Lab
AIR-DREAM Lab
Home
News
Researches
Publications
People
Light
Dark
Automatic
Algorithms
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
Safe offline reinforcement learning is a promising way to bypass risky online interactions towards safe policy learning. Most existing …
Yinan Zheng
,
Jianxiong Li
,
Dongjie Yu
,
Yujie Yang
,
Shengbo Eben Li
,
Xianyuan Zhan
,
Jingjing Liu
PDF
Cite
Code
Project
Website
FlexSSL : A Generic and Efficient Framework for Semi-Supervised Learning
Semi-supervised learning holds great promise for many real-world applications, due to its ability to leverage both unlabeled and …
Huiling Qin
,
Xianyuan Zhan
,
Yuanxun Li
,
Yu Zheng
PDF
Cite
Project
A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning
The optimization of traffic signal control (TSC) is critical for an efficient transportation system. In recent years, reinforcement …
Jianxiong Li
,
Shichao Lin
,
Tianyu Shi
,
Chujie Tian
,
Yu Mei
,
Jian Song
,
Xianyuan Zhan
,
Ruimin Li
PDF
Cite
Project
Project
Data-Driven Decision-Making Algorithms
Developing high-performance, robust, generalizable, and deployable data-driven decision-making algorithms for real-world problems.
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
Solving real-world complex tasks using reinforcement learning (RL) without high-fidelity simulation environments or large amounts of …
Haoyi Niu
,
Tianying Ji
,
Bingqi Liu
,
Haocheng Zhao
,
Xiangyu Zhu
,
Jianying Zheng
,
Pengfei Huang
,
Guyue Zhou
,
Jianming Hu
,
Xianyuan Zhan
PDF
Cite
Project
Project
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL
Offline reinforcement learning (RL) offers an appealing approach to real-world tasks by learning policies from pre-collected datasets …
Peng Cheng
,
Xianyuan Zhan
,
Zhihao Wu
,
Wenjia Zhang
,
Shoucheng Song
,
Han Wang
,
Youfang Lin
,
Li Jiang
PDF
Cite
Project
Project
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization
Offline reinforcement learning (RL) has received considerable attention in recent years due to its attractive capability of learning …
Xiangsen Wang
,
Haoran Xu
,
Yinan Zheng
,
Xianyuan Zhan
PDF
Cite
Project
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Offline-to-online reinforcement learning (RL), by combining the benefits of offline pretraining and online finetuning, promises …
Jianxiong Li
,
Xiao Hu
,
Haoran Xu
,
Jingjing Liu
,
Xianyuan Zhan
,
Ya-Qin Zhang
PDF
Cite
Project
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Based on the IVR framework, we further propose two practical algorithms, Sparse Q-learning (SQL) and Exponential Q-learning (EQL), which adopt the same value regularization used in existing works, but in a complete in-sample manner.
Haoran Xu
,
Li Jiang
,
Jianxiong Li
,
Zhuoran Yang
,
Zhaoran Wang
,
Victor Wai Kin Chan
,
Xianyuan Zhan
PDF
Cite
Code
Project
When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning
DOGE marries dataset geometry with deep function approximators in offline RL, and enables exploitation in generalizable OOD areas rather than strictly constraining policy within data distribution.
Jianxiong Li
,
Xianyuan Zhan
,
Haoran Xu
,
Xiangyu Zhu
,
Jingjing Liu
,
Ya-Qin Zhang
PDF
Cite
Code
Project
«
»
Cite
×