Cohere's first developer coding model is a 30B mixture-of-experts running on a single H100 with 256K context length.
DiffusionGemma is Google DeepMind's experimental 26B open model using text diffusion for up to 4x faster generation on GPUs.
一些您可能无法访问的结果已被隐去。
显示无法访问的结果