Model Parameter - 搜索 News

2 天

DeepSeek unveils next-gen AI model as Huawei vows ‘full support’ with new chips

The company says its cost-efficient new V4 model is competitive with top closed-source models from OpenAI and Google DeepMind ...

Decrypt

Tencent's New Hy3 AI Model Is the Most Efficient Chinese LLM No One's Talking About

Tencent just open-sourced Hy3 preview, a model that punches above its weight on coding agents, reasoning, and search—built in ...

3 天on MSN

Tencent unveils first flagship AI model with former OpenAI researcher at helm

The model is relatively small with only 295 billion parameters, bucking a recent trend of large models with trillions of ...

1 个月

Nvidia's Nemotron-Cascade 2 wins math and coding gold medals with 3B active parameters ...

Nvidia's Nemotron-Cascade 2 is a 30B MoE model that activates only 3B parameters at inference time, yet achieved gold medal-level performance at the 2025 IMO, IOI, and ICPC World Finals. Nvidia has ...

TechNode

DeepSeek upgrades V3 model with more parameters, open-source shift

DeepSeek released an updated version of its DeepSeek-V3 model on March 24. The new version, DeepSeek-V3-0324, has 685 billion parameters, a slight increase from the original V3 model’s 671 billion.

4 天

Don't Default To The Biggest AI Model: Agentic Systems Deserve Better

This isn't about rejecting large models; it's about having the engineering discipline to use smaller, specialized models ...

Bloomberg L.P.

Introducing BloombergGPT, Bloomberg’s 50-billion parameter large language model, purpose ...

NEW YORK – Bloomberg today released a research paper detailing the development of BloombergGPT TM, a new large-scale generative artificial intelligence (AI) model. This large language model (LLM) has ...

5 天

British Software Company Achieves Pioneering Breakthrough, Making It Possible to Now Run a ...

Privacy focused iPhone app LiberaGPT has been updated to now support the largest and most intelligent AI model ever to ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果