DeepAgent is a reasoning agent with scalable toolsets, capable of tackling general tasks by searching for and using the appropriate tools from over 16,000 RapidAPIs in an end-to-end agentic reasoning ...
Pull requests help you collaborate on code with other people. As pull requests are created, they’ll appear here in a searchable and filterable list. To get started, you should create a pull request.
Abstract: This paper explores the role of LLMs in supporting undergraduate students' mathematical reasoning when solving electronics problems. We analyzed the model’s responsiveness, problem-solving ...
Recently, Artificial Intelligence (AI) has reached a historic milestone in one of the world’s toughest math contests, the International Mathematical Olympiad (IMO). Google DeepMind’s Gemini Deep Think ...
Singapore-based AI startup Sapient Intelligence has developed a new AI architecture that can match, and in some cases vastly outperform, large language models (LLMs) on complex reasoning tasks, all ...
Existing long-CoT reasoning models have achieved state-of-the-art performance in mathematical reasoning by generating reasoning trajectories with iterative self-verification and refinement. However, ...
RE-IMAGINE synthesizes new reasoning benchmarks by (1) symbolically mutating the solution processes from existing benchmarks, and (2) asking language models to imagine what would happen if the ...
Recent research indicates that LLMs, particularly smaller ones, frequently struggle with robust reasoning. They tend to perform well on familiar questions but falter when those same problems are ...
AI reasoning models were supposed to be the industry’s next leap, promising smarter systems able to tackle more complex problems. Now, a string of research is calling that into question. Researchers ...