Tutorial
Recent Advances in Vision Foundation Models
Zhengyuan Yang · Linjie Li · Zhe Gan · Chunyuan Li · Jianwei Yang
Summit 437- 439
Abstract:
This tutorial covers the advanced topics in designing and training vision foundation models, including the state-of-the-art approaches and principles in (i) learning vision foundation models for multimodal understanding and generation, (ii) benchmarking and evaluating vision foundation models, and (iii) agents and other advanced systems based on vision foundation models.
Chat is not available.