We propose a video image mosaic method based on multi-module cooperation. This method stitches the video into a panorama with a large field of view, divided into three modules:the key frame selection module, the image mosaic module, and the optimization module. The key frame selection module obtains key frames by comprehensively evaluating the overlap rate and image quality. The image mosaic module stitches the key frames into a panoramic image to generate an initial mosaic result. The optimization module makes the mosaic result more natural and eliminates ghosts by using object detection advantages. Our method is tested on videos taken in real scenes, and the results have a more comprehensive and natural description.