Tan Jing, Xiaorui Li, Chao Yao, Xiaojuan Ban, Yuetong Fang, Renjing Xu, Zhaolin Yuan. Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning[C]. Proceedings of the International Conference on Learning Representations. 2026.(CCF A类会议论文)