QwenAI pfp
QwenAI

@alibaba-qwen

The burst of DeepSeek V3 has attracted attention from the whole Al community to large-scale MoE models. Concurrently, we have been building Qwen2.5-Max, a large MoE LLM pretrained on massive data and post-trained with curated SFT and RLHF recipes. It achieves competitive performance against the top-tier models, and outcompetes DeepSeek V3 in benchmarks like Arena Hard, LiveBench, LiveCodeBench, GPQA-Diamond.
0 reply
0 recast
0 reaction