AIR-DREAM Lab
AIR-DREAM Lab
Home
News
Researches
Publications
People
Light
Dark
Automatic
Robotics
Are Expressive Models Truly Necessary for Offline RL?
Among various branches of offline reinforcement learning (RL) methods, goal-conditioned supervised learning (GCSL) has gained …
Guan Wang
,
Haoyi Niu
,
Jianxiong Li
,
Li Jiang
,
Jianming HU
,
Xianyuan Zhan
PDF
Cite
Code
Project
Project
Instruction-Guided Visual Masking
Instruction following is crucial in contemporary LLM. However, when extended to multimodal setting, it often suffers from misalignment …
Jinliang Zheng
,
Jianxiong Li
,
Sijie Cheng
,
Yinan Zheng
,
Jiaming Li
,
Jihao Liu
,
Yu Liu
,
Jingjing Liu
,
Xianyuan Zhan
PDF
Cite
Code
Project
Project
Website
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
Multimodal pretraining has emerged as an effective strategy for the trinity of goals of representation learning in autonomous robots: …
Jianxiong Li
,
Jinliang Zheng
,
Yinan Zheng
,
Liyuan Mao
,
Xiao Hu
,
Sijie Cheng
,
Haoyi Niu
,
Jihao Liu
,
Yu Liu
,
Jingjing Liu
,
Others
PDF
Cite
Code
Project
Project
Website
Learning-based Methods for Robotics & Autonomous Driving
Learning robust and generalizable control / decision-making policies for robotics and autonomous driving using real-world data.
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
Solving real-world complex tasks using reinforcement learning (RL) without high-fidelity simulation environments or large amounts of …
Haoyi Niu
,
Tianying Ji
,
Bingqi Liu
,
Haocheng Zhao
,
Xiangyu Zhu
,
Jianying Zheng
,
Pengfei Huang
,
Guyue Zhou
,
Jianming HU
,
Xianyuan Zhan
PDF
Cite
Project
Project
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
H2O introduces a dynamics-aware policy evaluation scheme, which adaptively penalizes the Q function learning on simulated state-action pairs with large dynamics gaps, while also simultaneously allowing learning from a fixed real-world dataset.
Haoyi Niu
,
Shubham Sharma
,
Yiwen Qiu
,
Ming Li
,
Guyue Zhou
,
Jianming HU
,
Xianyuan Zhan
PDF
Cite
Code
Project
Project
Discriminator-Guided Model-Based Offline Imitation Learning
Offline imitation learning (IL) is a powerful method to solve decision-making problems from expert demonstrations without reward …
Wenjia Zhang
,
Haoran Xu
,
Haoyi Niu
,
Peng Cheng
,
Ming Li
,
Heming Zhang
,
Guyue Zhou
,
Xianyuan Zhan
PDF
Cite
Project
Project
Cite
×