Alibaba Unveils Qwen3-Omni: A Native End-to-End Multimodal Foundation Model

Popular：

Virtualization DNS security formal verification reachability analysis compiler errors macro conflict web extension development framework Bitmap Graphics API inconsistencies All Tags

Alibaba Unveils Qwen3-Omni: A Native End-to-End Multimodal Foundation Model

2025-09-22

Alibaba has released Qwen3-Omni, a native end-to-end multilingual omni-modal foundation model. It processes text, images, audio, and video in real-time, delivering streaming responses in text and natural speech. Qwen3-Omni achieves state-of-the-art results across numerous benchmarks, boasts support for multiple languages, and features a novel MoE architecture and flexible control. The model, along with its toolkits, cookbooks, and demos, is open-sourced, providing developers with extensive resources.

(github.com)

Uber CEO Warns of Mass Driver Displacement Due to Self-Driving Cars

SWE-Bench Pro: A Challenging Benchmark for Evaluating LLMs on Software Engineering