代表论文:
Yuankun Jiang, Chenglin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, “Variance Reduced Domain Randomization for Reinforcement Learning with Policy Gradient”, IEEE/ACM Trans. Pattern Analysis and Machine Intelligence (TPAMI), 2023.