Official repository for the paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs". The encoder-free 3D LMM directly utilizes a token embedding module to convert point cloud data ...
More and more large multimodal models (LMMs) are being released from time to time, but the finetuning of these models is not always straightforward. This codebase aims to provide a unified, minimal ...
Abstract: Existing Large Multimodal Models (LMMs) generally focus on only a few regions and languages. As LMMs continue to improve, it is increasingly important to ensure they understand cultural ...
Abstract: Domain adaptation leverages the knowledge in one domain—the source domain—to improve learning efficiency in another domain—the target domain. Existing heterogeneous domain adaptation ...