Li, S., Y. Wu, X. Cui, H. Dong, F. Fang, and S. Russell. “Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, July 2019, pp. 4213-20, doi:10.1609/aaai.v33i01.33014213.