Investigating the Efficacy and Impacts of RL-Based HVAC Control on Indoor and Local Urban Climates.
Junjie Yu, John S. Schreck, David John Gagne, Keith W. Oleson, et al.
Reinforcement learning (RL)-based HVAC control offers an innovative path for urban climate adaptation. However, its interaction with background climate remains complex. This study proposes an integrated framework combining RL with the Community Land Model Urban (CLMU) to evaluate efficacy, impacts, and transferability.
Our findings reveal that "rewards" (balancing energy & comfort) are highly sensitive to background climate. Cities in hot climates achieve higher rewards, while cities with high temperature variability produce more transferable RL policies. This suggests that city-to-city learning could be a viable strategy for deploying smart HVAC controls globally.
The study found a strong correlation between seasonal temperature variation and RL policy transferability.
Cities with larger seasonal temperature swings act as better "training grounds." RL agents trained in these environments learn robust strategies that adapt well to other locations, unlike agents trained in stable climates.
Figure: Transferability score vs Temperature Variability
Comprehensive overview of the RL framework, experimental setup, and cross-city analysis.