MMDetection: from General Object Detection to Multi-modal Agent
1. This part will introduce object detection, instance segmentation, and panoptic segmentation. We will go through problem formulation, challenges, and representative methods in these fields.
2. In this part, we will introduce MMDetection, the OpenMMLab Detection Toolbox and Benchmark, which is also one of the most popular toolboxes in object detection and is the foundation of many toolboxes like MMDetection3D and MMTracking. We will go through the modular design and model zoo of MMDetection 3.0.
3. Lastly, Lagent, a lightweight framework for building LLM-based agents, will be introduced. Lagent allows to use object detectors implemented in MMDetection.We will introduce the simple design of Lagent, and know how it utilizes object detectors in mmdetection, as well as other models in other libraries, to let language model work as multi-modal agent.
source