There are a series of basic problems such as spectral efficiency,reliability and low latency need to be solved urgently for Unmanned Aerial Vehicle(UAV)cluster communication and network.It is a good solution at present to use Deep Reinforcement Learning(DRL)for optimization of UAV cluster communication network.A comprehensive investigation is conducted on optimized scheduling of resources in UAV cluster communication and network.The research results of using DRL method for optimized scheduling of resources in communication and network are summarized,and the future development of the technology is prospected.