AI models have already been performing quite well on math benchmarks, but they're also making their presence felt in real ...
Alibaba Cloud on Thursday launched QwQ-32B, a compact reasoning model built on its latest large language model ( LLM ), Qwen2 ...
This remarkable outcome underscores the effectiveness of RL when applied to robust foundation models pre-trained on extensive ...
Albibab Cloud’s latest model rivals much larger competitors with just 32 billion parameters in what it views as a critical ...
While DeepSeek-R1 operates with 671 billion parameters, QwQ-32B achieves comparable performance with a much smaller footprint ...
All may not be well between Microsoft and OpenAI. A new report suggests that Microsoft is building its own AI model to rival ...
Alibaba's new AI model with fewer parameters rivals DeepSeek's R1 and OpenAI's o1 mini. Read more.
Anthropic's newest flagship AI model, Claude 3.7 Sonnet, is a 'hybrid' model that can both 'reasoning' through and give ...
Researchers behind the MASK benchmark found that more knowledge doesn't mean more 'moral virtue.' See which model lies the ...
OpenAI is launching GPT-4.5 today, its newest and largest AI language model. GPT-4.5 will be available as a research preview ...
An AI expert who requested anonymity told Ars Technica, "GPT-4.5 is a lemon!" when comparing its reported performance to its ...