BAGEL AI

软件描述

We present BAGEL, an open-source multimodal foundation model with 7B active parameters (14B total) trained on large-scale interleaved multimodal data. BAGEL outperforms the current top-tier open-source VLMs like Qwen2.5-VL and InternVL-2.

官方网站

访问软件的官方网站了解更多信息

官方认证

bagel-ai.org

安全链接HTTPS

什么是 BAGEL AI?

We present BAGEL, an open-source multimodal foundation model with 7B active parameters (14B total) trained on large-scale interleaved multimodal data. BAGEL outperforms the current top-tier open-source VLMs like Qwen2.5-VL and InternVL-2.5 on standard multimodal understanding leaderboards, and delivers text-to-image quality that is competitive with strong specialist generators such as SD3. Moreover, BAGEL demonstrates superior qualitative results in classical image-editing scenarios than the leading open-source models. More importantly, it extends to free-form visual manipulation, multiview synthesis, and world navigation, capabilities that constitute "world-modeling" tasks beyond the scope of previous image-editing models. The figure below showcases BAGEL's qualitative performance.

主要功能

✓图像生成 ✓AI写作 ✓人工智能驱动

雷思软件

BAGEL AI

软件描述

官方网站

什么是 BAGEL AI?

主要功能

支持平台

标签

下载与相关链接