LLM News - AI News | GogoAI News

JetBrains Open Sources Mellum2: 12B Parameter AI Coding Model

2026-06-02 👁 21

JetBrains releases Mellum2, a 12B parameter MoE model for code completion, offering high performance with low computatio…

2026-06-02 👁 29

Alibaba releases Qwen 3.7 Plus, a mid-tier model designed to balance performance and cost for global developers.

2026-06-02 👁 21

Microsoft reveals its first self-developed reasoning model MAI-Thinking-1 and a new Copilot super app at Build 2026, sig…

2026-06-02 👁 17

Alibaba's Qwen3.7-Plus launches as a powerful multimodal agent, enhancing complex reasoning and visual analysis for ente…

2026-06-02 👁 15

New AWS integrations cut LLM load times by 50% using GPUDirect and FSx for Lustre.

2026-06-02 👁 16

New MuskAPI service offers GPT-5.5 and Codex models at unprecedented low rates with high stability for developers.

2026-06-01 👁 16

Domestic Chinese LLMs now rival Western tools in coding productivity, offering viable alternatives to GitHub Copilot for…

2026-06-01 👁 16

Chinese firm MiniMax launches M3, an open-weight model with 1M token context and native multimodality.

2026-06-01 👁 18

Users discover a loophole in Anthropic's Claude AI, allowing unlimited usage despite strict hourly caps.

2026-06-01 👁 17

Users report Claude bypassing 5-hour rate limits, allowing continuous output without reducing weekly quotas.

2026-06-01 👁 13

MiniMax launches M3, targeting enterprise agents with 2M token context and tool execution.

2026-06-01 👁 14

Kog AI releases KIE, achieving 3000 tokens/s on AMD MI300X without quantization or speculative decoding.