Alibaba's new language model Qwen 2.5 Max is set to revolutionize the industry and boost the company's cloud business, amid competitive pressures. See more here.
In a reasoning test using Arena-Hard, Qwen 2.5-Max achieved 89.4% accuracy, and the result was higher than DeepSeek R1 and when tested on other benchmarks of coding and scientific reasoning, Qwen 2.5 ...
Results that may be inaccessible to you are currently showing.