Abstract: This article focuses on the applications and advances of Visual Language Modeling (VLM) in 3D scene understanding. The article details several mainstream visual language models and analyzes ...