FEATUREDAI HOT (Curated Pool)· aihot-apiZH15:00 · 06·04
→Nex-N2-Pro launches as a 397B MoE reasoning model based on Qwen3.5
neolab released Nex-N2-Pro, a 397B-parameter MoE reasoning model based on Qwen3.5-397B-A17B, with 262K context, VLM support, claimed GPT-5.5 and Claude Opus 4.7-level performance, 30–50% fewer thinking tokens, SOTA results on Terminal Bench 2.1, GDPVal, and SWE-Verified, plus free access for the first two weeks via SiliconFlow.
#Reasoning#Multimodal#Agent#neolab
why featured
HKR-H/K/R pass: the title has a strong benchmark hook and the post gives size, context, and token-reduction claims. Kept in 72-77 because it is a single X source and evaluation conditions are not disclosed.
editor take
Nex-N2-Pro has real specs, but “GPT-5.5-level” needs receipts; 397B MoE and 262K context don’t excuse missing eval details.
sharp
Nex-N2-Pro is a strong Qwen3.5 derivative on paper, but the “GPT-5.5 and Claude Opus 4.7-level” line needs to be treated as marketing until the scorecard shows up. The concrete hooks are real: Qwen3.5-397B-A17B, 397B total MoE, 262K context, VLM support, and a claimed 30–50% cut in thinking tokens. It also names Terminal Bench 2.1, GDPVal, and SWE-Verified as SOTA targets. The missing parts are the scores, tool budget, pass@1 settings, and latency.
I care less about the crown claim than the agent-coding economics. If A17B active parameters actually give Claude Code and Cursor users 30–50% lower reasoning-token burn without hurting SWE-Verified, that is a useful product wedge. SiliconFlow’s two-week free access will generate data fast, but free trials hide rate limits and tail latency better than paid traffic does.
HKR breakdown
hook ✓knowledge ✓resonance ✓