Search

AIR-DREAM Lab
AIR-DREAM Lab
  • Home
  • News
  • Researches
  • Publications
  • People
  • Light Dark Automatic
Xiao Hu

Xiao Hu

PhD student at Tsinghua University

Latest

  • Data Center Cooling System Optimization Using Offline Reinforcement Learning
  • DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
  • Query-Policy Misalignment in Preference-Based Reinforcement Learning
  • PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
  • Mind the Gap: Offline Policy Optimization for Imperfect Rewards

© 2025 - AIR-DREAM Lab.

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.

Cite
Copy Download