AIR-DREAM Lab
AIR-DREAM Lab
Home
News
Researches
Publications
People
Light
Dark
Automatic
Xiao Hu
Research Intern
Latest
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
Query-Policy Misalignment in Preference-Based Reinforcement Learning
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Cite
×