Microsoft found that small language models can exceed the performance of much larger ones when trained to specialize in a single area. Researchers fine-tuned the Mistral 7B model to create Orca-Math, ...
New benchmark study results show leading AI models, including ChatGPT, Claude, and Gemini, still lag humans in visual math reasoning.
Chinese AI lab DeepSeek has quietly updated Prover, its AI model that’s designed to solve math-related proofs and theorems. According to South China Morning Post, DeepSeek uploaded the latest version ...
Researchers have introduced Light-R1-32B, a new open-source AI model optimized to solve advanced math problems. It is now available on Hugging Face under a permissive Apache 2.0 license — free for ...
If OpenAI's new model can solve grade-school math, it could pave the way for more powerful systems. This story is from The Algorithm, our weekly newsletter on AI. To get stories like this in your ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Ludi Akue discusses how the tech sector’s ...
The Register on MSN

AI models still suck at math

Just less than before, according to the ORCA test exclusive Current-day LLMs are prediction engines and, as such, they can only find the most likely solution to problems, which is not necessarily the ...
Researchers have developed a three-dimensional mathematical model of prostate cancer. The model depicts various processes, including tumour growth, genetic evolution and tumour cell competition.