A Stochastic Trust-Region Framework for Policy Optimization. (2022). Journal of Computational Mathematics, 40(6), 1004-1030. https://doi.org/10.4208/jcm.2104-m2021-0007