AIR-DREAM Lab
AIR-DREAM Lab
Home
News
Researches
Publications
People
Light
Dark
Automatic
Ya-Qin Zhang
Latest
Query-Policy Misalignment in Preference-Based Reinforcement Learning
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Cite
×