Qianfan-VL: Domain-Enhanced Vision-Language Models

Qianfan-VL is a series of general-purpose multimodal large language models enhanced for enterprise-level multimodal applications. The models offer deep optimization for high-frequency scenarios in industrial deployment while maintaining strong general capabilities.

🔗 Links: GitHub | HuggingFace | ModelScope | Documentation

MultimodalTextbox
Model