hot events · 2026-05-13

▸ 37 signals · updated 3m ago

live · 217 today·policy v2

LATENT SPACEAnthropic pulls Fable and Mythos after US e…96·LATENT SPACEAnthropic launches Claude Fable 5, its firs…88·HACKER NEWS FRONTPAGDid Anthropic ask for its own export contro…82·HACKER NEWS FRONTPAGAnthropic flies senior technical staff to D…82·AI HOT (CURATED POOLWSJ: OpenAI weighs steep price cuts and pla…82·HACKER NEWS FRONTPAGBram Cohen: Claude is turning into an assho…78·R/LOCALLLAMAXiaomi serves MiMo V2.5 at 1000–3000 tps wi…78·IMPORT AI (JACK CLARAI learns to game society's rules, and Anth…78·MIT TECHNOLOGY REVIEGoogle DeepMind is worried about what happe…78·DWARKESH PATELThe sample efficiency black hole: AI models…78·LATENT SPACECognition launches FrontierCode: a coding b…78·HACKER NEWS FRONTPAGGabriel Weinberg argues with data that “eve…78·LATENT SPACEAnthropic pulls Fable and Mythos after US e…96·LATENT SPACEAnthropic launches Claude Fable 5, its firs…88·HACKER NEWS FRONTPAGDid Anthropic ask for its own export contro…82·HACKER NEWS FRONTPAGAnthropic flies senior technical staff to D…82·AI HOT (CURATED POOLWSJ: OpenAI weighs steep price cuts and pla…82·HACKER NEWS FRONTPAGBram Cohen: Claude is turning into an assho…78·R/LOCALLLAMAXiaomi serves MiMo V2.5 at 1000–3000 tps wi…78·IMPORT AI (JACK CLARAI learns to game society's rules, and Anth…78·MIT TECHNOLOGY REVIEGoogle DeepMind is worried about what happe…78·DWARKESH PATELThe sample efficiency black hole: AI models…78·LATENT SPACECognition launches FrontierCode: a coding b…78·HACKER NEWS FRONTPAGGabriel Weinberg argues with data that “eve…78·LATENT SPACEAnthropic pulls Fable and Mythos after US e…96·LATENT SPACEAnthropic launches Claude Fable 5, its firs…88·HACKER NEWS FRONTPAGDid Anthropic ask for its own export contro…82·HACKER NEWS FRONTPAGAnthropic flies senior technical staff to D…82·AI HOT (CURATED POOLWSJ: OpenAI weighs steep price cuts and pla…82·HACKER NEWS FRONTPAGBram Cohen: Claude is turning into an assho…78·R/LOCALLLAMAXiaomi serves MiMo V2.5 at 1000–3000 tps wi…78·IMPORT AI (JACK CLARAI learns to game society's rules, and Anth…78·MIT TECHNOLOGY REVIEGoogle DeepMind is worried about what happe…78·DWARKESH PATELThe sample efficiency black hole: AI models…78·LATENT SPACECognition launches FrontierCode: a coding b…78·HACKER NEWS FRONTPAGGabriel Weinberg argues with data that “eve…78·

⤓ RSS live

browse by dayclear filter ✕

May 2026

MTWTFSS

126 212 320 419 542 632 749 826 923 1017 1136 1248 1337 1454 1539 1630 1719 1849 1976 2045 2148 2249 2313 2415 2520 2637 2744 2848 2935 3022 3114

June 2026

MTWTFSS

147 258 348 447 545 619 715 852 945 1031 1128 1222 1313 1416 154161718192021222324252627282930

2026-05-13 · Wed

22:25

32d ago

FEATUREDr/LocalLLaMA· rssEN22:25 · 05·13

→2x RTX 3090 setup for local Qwen 3.6 27B inference

A Reddit user ran Qwen 3.6 27B on a dual RTX 3090 Ubuntu setup, reporting 48GB VRAM, a 262k context window, no NVLink, about 4000 pp/s prompt processing, and 113 tk/s generation.

#Code#Tools#Inference-opt#Qwen

why featured

All HKR axes pass, and this is a first-person local-inference run with concrete numbers. Source is a single Reddit post with limited reproducibility detail, so it sits at the low featured threshold.

editor take

Dual RTX 3090s pushing Qwen 3.6 27B at 113 tok/s is exactly the kind of DIY result that keeps cloud-only AI narratives honest.

sharp

The dual-RTX 3090 result attacks the inference-threshold story, not the training story. The reported setup is 48GB VRAM, 262k context, no NVLink, about 4000 prompt-processing tok/s, and 113 generation tok/s on Qwen 3.6 27B. If reproducible, that is enough for local long-context code review and offline tool workflows. I’d discount it first because Reddit returned 403, so the body only gives the summary. Quantization, batch size, KV-cache settings, and prompt length are not disclosed. Still, the direction is hard to ignore: used 3090s keep eating small and mid-size inference demand. Cloud vendors are no longer selling “can it run”; they are selling concurrency, uptime, and not babysitting a hot Linux box.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

22:04

32d ago

FEATUREDThe Verge · AI· rssEN22:04 · 05·13

→Microsoft Edge Copilot update uses AI to pull information from across your tabs

Microsoft Edge will let Copilot gather information from all open tabs so users can ask questions, compare products, and summarize articles; the snippet says users can choose which experiences to enable, but the post does not disclose a rollout date.

#Agent#Tools#Microsoft#The Verge

why featured

HKR-H/K/R pass, but the post gives tab-wide reading, product comparison, and summaries without launch timing or deeper execution. This fits the lower featured band for a mid-weight product update.

editor take

Edge tab-reading Copilot sounds like a browser agent; in practice it jams privacy consent and context assembly into one button.

sharp

Microsoft is grabbing browser context before proving a serious browser agent. Edge Copilot will read all open tabs, answer questions, compare products, and summarize articles. Users can choose which experiences stay on, but rollout date, default permission state, and enterprise policy controls are not disclosed. That gap matters because tabs are the messiest high-value context layer in daily work. I read this as defense against Chrome sidebars and Arc/Perplexity-style browsers. Copilot Mode once included agentic moves like booking a reservation; folding it into this update is a retreat in ambition. First make cross-tab retrieval reliable, then talk about acting for the user. The privacy fight will be loud, but the product test is simpler: off by default, and not hallucinating across 20 open tabs.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

21:45

32d ago

FEATUREDTechCrunch AI· rssEN21:45 · 05·13

→Notion just turned its workspace into a hub for AI agents

Notion launched a developer platform that lets teams connect AI agents, external data sources, and custom code directly inside its workspace; the RSS snippet does not disclose pricing, rollout timing, supported models, or limits for the new platform.

#Agent#Tools#Notion#Product update

why featured

HKR-H/K/R all pass, but price, launch timing, and supported models are not disclosed, keeping it in the 72–77 mid-weight product-update band. TechCrunch authority supports featured, not same-day must-write.

editor take

Notion is making a claim on workplace context, not just agents; the snippet gives no pricing, models, rollout, or permission details.

sharp

Notion is trying to own the landing zone for agents, not win a model race. The platform connects AI agents, external data sources, and custom code inside the workspace; that puts the fight around permissions, docs, tasks, and databases, not chat UX. The public detail is thin: pricing, rollout timing, supported models, and rate limits are absent. Against Slack, Microsoft 365 Copilot, and Atlassian Rovo, Notion has a cleaner knowledge-base and lightweight-database habit. Its weakness is enterprise control and system-of-record depth. If permission inheritance, audit logs, and third-party agent sandboxing are loose, this becomes a prettier plugin directory.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

21:24

32d ago

● P1Hacker News Frontpage· rssEN21:24 · 05·13

→Medicare introduces new payment model designed for AI

The title says Medicare’s new payment model is built for AI, but the RSS body only provides the article URL, Hacker News URL, 3 points, and 0 comments; the post does not disclose the model mechanism, coverage scope, or launch timeline.

#Medicare#TechCrunch#Hacker News#Policy

why featured

Triggers hard-exclusion-6: only title, URL, 3 HN points, and 0 comments are available, with no data, example, or mechanism. HKR-H passes, but the sourcing is too thin for all.

editor take

Medicare opening reimbursement for AI agents beats another hospital copilot demo; still, this is a TechCrunch-to-HN signal chain, not market proof.

sharp

TechCrunch and HN carried the same Medicare ACCESS story with the same frame; HN is amplification, not independent confirmation. The hard hook is specific: Medicare lacked a way to pay an AI agent for between-visit monitoring, check-in calls, housing referrals, or medication pickup reminders, and ACCESS creates that payment slot. I find this harder than most healthcare AI funding news because U.S. health software usually hits reimbursement walls before model walls. Abridge and Nabla can ride existing documentation workflows; care-coordination agents stay pilots when no payer funds the work. The catch is equally concrete: the body does not give rates, eligibility rules, or liability design. Founders can map workflows today, but they cannot underwrite revenue from this article alone.

HKR breakdown

hook ✓knowledge —resonance —

→ open source

SCORE

H1·K0·R0

20:41

32d ago

FEATUREDr/LocalLLaMA· rssEN20:41 · 05·13

→24+ tok/s from ~30B MoE models on an old GTX 1080

User mdda ran Qwen 3.6 35B-A3B on an i7-6700, GTX 1080, and 32GB RAM machine at about 24 tok/s with 128k context; the setup uses llama.cpp MoE offloading plus TurboQuant/RotorQuant KV cache quantization, with PCIe 3.0 x16 saturated and GPU utilization at about 40–50%.

#Inference-opt#Qwen#Gemma#llama.cpp

why featured

Single Reddit source limits authority, but the GTX 1080 + Qwen 3.6 35B-A3B + 128k + 24 tok/s setup gives a concrete local-inference result. HKR-H/K/R all pass; this is a practical featured item, not a major model or product launch.

editor take

A GTX 1080 hitting 24 tok/s on a 30B-ish MoE is a reminder: local inference gains are coming from memory plumbing, not model magic.

sharp

The sharp part is not that an old GPU runs a big model; it is that 128k context now has a repeatable engineering path on 8GB VRAM. The summary names the setup: i7-6700, GTX 1080, 32GB RAM, Qwen 3.6 35B-A3B, about 24 tok/s. The mechanism is llama.cpp MoE offloading plus TurboQuant / RotorQuant KV-cache quantization, with PCIe 3.0 x16 saturated and GPU utilization around 40–50%. I would discount the 24 tok/s number until the missing run details show up. Reddit returned 403, so prompt length, batch size, quant level, and generation-phase curve are not visible. Still, the direction is clear: sparse MoE activation plus KV compression is moving local inference away from the “just buy more VRAM” story and toward cache and bus management.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

20:22

32d ago

FEATUREDHacker News Frontpage· rssEN20:22 · 05·13

→Meta won't let users block its AI account on Threads

Meta does not let users block its AI account on Threads; the RSS snippet only lists 37 Hacker News points and 10 comments, and the post does not disclose the account mechanism or scope.

#Meta#Threads#Hacker News#Product update

why featured

HKR-H/K/R all pass because the title gives a concrete, debate-ready product constraint. The body lacks mechanism, scope, or Meta rationale, so this stays in the 60–71 small platform/product controversy band.

editor take

Meta AI being unblockable on Threads is not a UX quirk; Meta is treating its bot like infrastructure and demoting user control.

sharp

Two outlets picked this up, but HN is only relaying The Verge, so the fact chain is thin: Meta blocks Threads users from blocking the Meta AI account, and the body only says users can tag it for answers. That reads less like a technical constraint and more like a product decision. I don’t buy the “assistant account” framing. Normal accounts can be blocked; Meta AI cannot. That changes the category from publisher to platform fixture. After Meta pushed AI into search, Instagram, and WhatsApp, Threads is the blunt version. For AI practitioners, the model quality is secondary here: once distribution is hardwired, refusal becomes a setting Meta can remove.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

20:15

32d ago

FEATUREDBloomberg Technology· rssEN20:15 · 05·13

→Cisco Raises Sales Forecast and Announces Job Cuts to Focus on AI

Cisco rose as much as 19% in late trading after issuing a better-than-expected sales forecast and announcing plans to cut thousands of jobs to focus on the AI market.

#Cisco#Product update#Personnel

why featured

Cisco is AI-infrastructure adjacent, and the 19% after-hours move plus thousands of cuts clears HKR. The summary lacks concrete sales guidance, AI revenue scale, and restructuring mechanics, so this stays in the 60–71 generic industry-reporting band.

editor take

Cisco posted record quarterly revenue and still cut 5% for AI spending; that smells less like AI productivity and more like cost discipline wearing an AI badge.

sharp

Cisco raised its sales outlook and cut 5% of jobs, with TechCrunch putting the hit near 4,000 people. Bloomberg frames the upside forecast and stock move; TechCrunch frames the layoffs. All three still put AI restructuring in the headline, so the coverage is aligned around Cisco’s earnings message. I don’t buy the “spend more on AI” wrapper yet. Cisco reported record quarterly revenue and still removed 5% of staff, which makes AI look like a capital-allocation label before a product proof point. The disclosed material here gives no AI order number, GPU-networking revenue, retention signal, or margin bridge. Compared with Arista’s cleaner AI cluster networking demand story, Cisco’s version reads like a legacy org reshuffle dressed as an AI push.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

19:49

32d ago

FEATUREDTechCrunch AI· rssEN19:49 · 05·13

→Musk’s xAI Is Running Nearly 50 Gas Turbines Unchecked at Its Mississippi Data Center

Musk’s xAI is running nearly 50 gas turbines at its Colossus 2 data center in Mississippi, and the company faces a lawsuit over using “mobile” gas turbines as power plants; the RSS snippet does not disclose permitting details or the lawsuit’s specific claims.

#xAI#Elon Musk#Incident#Policy

why featured

All HKR axes pass: a sharp conflict hook, concrete turbine/lawsuit facts, and resonance around AI data-center power compliance. Not a model or product release, so it stays near the featured threshold.

editor take

xAI running nearly 50 gas turbines for Colossus 2 smells less like AI hustle and more like regulatory arbitrage around power.

sharp

xAI just exposed the physical bill behind Colossus 2: nearly 50 gas turbines, with a lawsuit centered on treating “mobile” turbines like power plants. The bottleneck for frontier AI is no longer just GPUs or model talent; it is grid access, permitting, emissions, and local tolerance. The article is only an RSS snippet, so permitting status, emissions numbers, and the lawsuit’s exact claims are not disclosed. Still, the pattern is loud. OpenAI, Anthropic, and Meta have spent the last year locking down power and data-center capacity through formal deals. xAI’s version looks more Musk-coded: deploy first, make the externalities somebody else’s meeting. If Colossus 2 needs temporary gas turbines to keep scaling, the speed story now carries a regulatory and environmental debt.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

19:29

32d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH19:29 · 05·13

→Best Practices for Computer and Browser Use with Claude

Anthropic published guidance for Claude computer and browser use, with Claude 4.6 API screenshots capped at a 1,568-pixel long edge and 1.15 million total pixels, while Opus 4.7 raises the limits to 2,576 pixels and 3.75 million total pixels.

#Agent#Vision#Tools#Anthropic

why featured

Anthropic’s first-party Claude computer/browser guide has actionable screenshot limits, not just promo copy. HKR-H/K/R all pass, but this is a practice guide rather than a major model or capability launch, so it sits in the 72–77 band.

editor take

Anthropic raising Claude screenshots from 1.15M to 3.75M pixels is a practical agent upgrade; many GUI failures start as bad vision, not bad reasoning.

sharp

The useful part of Anthropic’s browser-use post is the pixel budget, not the “best practices” label. Opus 4.7 raises API screenshots to a 2,576-pixel long edge and 3.75M total pixels, up from Claude 4.6’s 1,568 pixels and 1.15M. That matters because GUI agents often fail before reasoning starts: tiny buttons, missed table columns, modal states, and dense SaaS screens. I like this because it treats computer use as an input-engineering problem, not a demo reel. OpenAI and Google talk about operating browsers too, but Anthropic is spelling out constraints developers can actually tune: screenshot size, page framing, and action granularity. The missing pieces are pricing and latency. If 3.75M-pixel screenshots are slow or expensive, teams will keep downsampling and then blame the model for vision errors.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

18:09

32d ago

FEATUREDMIT Technology Review· rssEN18:09 · 05·13

→AI chatbots are giving out people’s real phone numbers

MIT Technology Review documents three cases where Gemini surfaced real personal phone numbers in customer-service or contact-info answers. DeleteMe says generative-AI privacy queries rose 400% in seven months, with 55% referencing ChatGPT, 20% Gemini, 15% Claude, and 10% other tools.

#Safety#Alignment#MIT Technology Review#Google

why featured

MIT Technology Review adds concrete cases and DeleteMe figures, so HKR-H/K/R all pass. The impact is privacy and product-liability risk, not a model or platform-level update, keeping it just above the featured threshold.

editor take

Don’t file this as a Gemini bug; phone numbers in answers expose how weak model vendors’ deletion paths still are.

sharp

Gemini putting real phone numbers into customer-service answers is not just hallucination; it is a missing PII exit path. MITTR has only 3 documented cases, so the public sample is small. DeleteMe says generative-AI privacy queries rose 400% in 7 months, with 55% naming ChatGPT, 20% Gemini, 15% Claude, and 10% other tools. That split mostly tracks product reach, not incident rates. I don’t buy “PII in training data” as a sufficient explanation. RAG, search grounding, business listings, and scraped contact pages can all inject numbers at answer time. In Google Search, people at least know the surface: index removal, robots.txt, result complaints. In chatbot answers, the phone number arrives as a synthesized statement, and the victim has no obvious layer to challenge.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

17:15

32d ago

● P1Bloomberg Technology· rssEN17:15 · 05·13

→Microsoft Has Invested Over 100 Billion Dollars in OpenAI Partnership

Microsoft has spent more than $100 billion on its OpenAI partnership, but the RSS snippet does not disclose the spending breakdown, timeline, or agreement terms.

#Microsoft#OpenAI#Partnership

why featured

HKR-H/K/R all pass: Bloomberg adds a striking over-$100B figure tied to Microsoft-OpenAI economics and control. The post does not disclose spend composition, timeline, or agreement terms, so it stays at 84.

editor take

Both items are Bloomberg title-only through a 403 wall; $100B spent and $92B targeted return smells like Microsoft turning OpenAI into an investor-facing ledger.

sharp

Both items are Bloomberg-only in this feed, and the titles provide two hard numbers: Microsoft spent over $100 billion on the OpenAI partnership, while it targeted a $92 billion return on the early investment. The body is blocked by a 403 page, so the accounting basis and timeline are not disclosed. I read this less as another “strategic partnership” story and more as Microsoft’s AI capex narrative getting pulled back into the income statement. A $100 billion-plus commitment is no longer just preferred Azure supply. If the $92 billion return target came from internal modeling, investors should press on three mechanics: revenue recognition, GPU depreciation, and OpenAI profit-sharing. Compared with the widely cited $10 billion 2023 investment, this scale turns OpenAI from a product halo into a balance-sheet question.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

17:14

32d ago

● P1Bloomberg Technology· rssEN17:14 · 05·13

→Anduril Raises $5 Billion Funding Round, Doubles Valuation to $61 Billion

Anduril doubled its valuation to $61 billion in a fresh $5 billion funding round led by Thrive Capital and Andreessen Horowitz; CEO Brian Schimpf said the company will invest aggressively in manufacturing capacity, research and development, and infrastructure.

#Robotics#Anduril#Thrive Capital#Andreessen Horowitz

why featured

HKR-H/K/R all pass: the $61B valuation, $5B round, and use of proceeds are concrete. It is major defense-robotics funding, not a core model release, so it sits in the 78–84 band.

editor take

Anduril’s $61B tag says defense AI is being priced less like software and more like a Pentagon procurement rail.

sharp

FT and Bloomberg both frame Anduril as doubling its valuation to $61B or over $60B. The FT body is paywalled here, so the round size, investors, and terms are not disclosed. That alignment smells like one financing narrative being shopped, not two outlets independently surfacing separate facts. My read: Anduril is no longer being priced like a normal AI startup. A $61B valuation puts it closer to a pre-IPO SpaceX-style defense asset than an app-layer model company. The asset is not a benchmark chart; it is Lattice, autonomous systems, sensors, delivery credibility, and access to US defense procurement. Compared with labs fighting over SWE-bench or token pricing, Anduril is selling integration into budget lines. AI people should read this as procurement leverage getting venture multiples.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

17:10

32d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH17:10 · 05·13

→Claude paid plans will offer monthly coding usage credits

Claude paid plans can claim monthly coding usage credits from June 15, covering Claude Agent SDK, claude -p, Claude Code GitHub Actions, and third-party apps built on the Agent SDK.

#Agent#Code#Tools#Claude

why featured

HKR-H/K/R all pass: the update names a date, quota mechanism, and covered Claude coding surfaces. Importance stays in the low featured band because this is a billing/access change, not a model release.

editor take

Anthropic is carving coding credits out of paid Claude plans; that lowers Claude Code friction while drawing a clearer fence around heavy dev usage.

sharp

Anthropic is changing distribution, not model capability. Starting June 15, paid Claude plans can claim monthly coding credits covering Claude Agent SDK, claude -p, Claude Code GitHub Actions, and third-party apps built on the Agent SDK. That bundle is pointed: Anthropic wants developers living in terminals, CI, and agent apps, not only in Claude chat. Pricing, credit size, rollover, and overage rules are not given, so I would not call this a price cut. It smells more like Claude Code being folded into subscription value, forcing OpenAI Codex, Cursor, and GitHub Copilot to compete on usable included quota. Developers will judge this by how many PRs, CI runs, and agent loops one paid plan actually buys.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

17:02

32d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH17:02 · 05·13

→Introducing Runway Agent

Runway launched Runway Agent, a video creation agent that turns one natural-language conversation into multi-scene videos with narration, dialogue, and music; new free-plan users receive 1,500 credits for their first video.

#Agent#Multimodal#Tools#Runway

why featured

HKR-H/K/R pass: a notable AI-video vendor ships an agentic multi-scene workflow with a 1,500-credit free plan. Score stays in the 72–77 band because the post is still a vendor announcement without pricing, limits, or independent tests.

editor take

Runway Agent sells a conversational producer, but only discloses 1,500 free credits; no quality, duration, or pricing proof yet, so don’t buy the finished-video story.

sharp

Runway Agent’s suspect phrase is “ready-to-publish.” The post names one conversation, multi-shot video, voiceover, dialogue, music, timeline editing, and 1,500 free credits. It does not give max duration, resolution, per-video cost, retry rules, or a measurable bar for brand consistency. Runway is aiming less at auteur filmmaking and more at low-budget video throughput for marketing teams. That is the right wedge: social ads and product clips tolerate more weirdness than narrative film. Runway also has a cleaner commercial workflow than Sora’s demo-heavy posture: editor, references, aspect ratio, audio preferences, and enterprise-facing positioning. But “minutes” to high-resolution multi-shot video still has four ugly gates: visual continuity, subtitles/lip sync, music rights, and brand review. The agent label sells well; the delivery standard is still hidden inside the product.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

16:45

32d ago

● P1The Verge · AI· rssEN16:45 · 05·13

→Meta AI launches Incognito Chat with end-to-end encryption

Mark Zuckerberg announced Meta AI Incognito Chat, saying it stores no conversation logs on servers and uses end-to-end encryption; the post does not disclose rollout scope, retention audit details, or the key-management mechanism.

#Safety#Meta#Mark Zuckerberg#The Verge

why featured

Meta’s Incognito Chat clears HKR-H with the privacy-contrast hook, HKR-K with E2E encryption plus no server logs, and HKR-R on trust. Missing rollout, retention audit, and key-management details keep it at the mid-weight product-update threshold.

editor take

Three outlets cover Incognito Chat, but only titles are disclosed; Meta is selling “private AI” inside WhatsApp before regulators define the rules.

sharp

Three sources cover Incognito Chat with the same frame: WhatsApp, Meta AI, and end-to-end encryption. That alignment smells like a coordinated Meta product push, not independent discovery. The disclosed text gives no rollout markets, default setting, retention window, or whether encryption covers user-to-model processing rather than only chat transport. I don’t buy the “completely private” framing yet. AI chat is not a normal WhatsApp message: inference needs context handling, safety logging, and often tool calls. If Meta only encrypts the chat wrapper while server-side model processing still sees content, the privacy claim has a hole exactly where practitioners care. Apple’s Private Cloud Compute at least made the audit and hardware boundary part of the pitch; Meta’s title-level story gives us a nice door label, not the room layout.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

16:28

32d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH16:28 · 05·13

→Anthropic Launches Claude for Small Business Package

Anthropic launched Claude for Small Business with connectors and 15 ready-made automation workflows for QuickBooks, PayPal, HubSpot, and related business tools; users run tasks through Claude Cowork and manually approve key steps.

#Agent#Tools#Anthropic#Claude

why featured

HKR-H/K/R all pass: the Anthropic SMB bundle has 15 workflows, named connectors, and a manual approval mechanism. It is a substantive Claude product update, but pricing, rollout scope, and usage data are not disclosed, so it stays below must-write.

editor take

Anthropic is pushing Claude into QuickBooks and PayPal, not another chat box. Fifteen workflows are concrete; missing pricing keeps the ROI claim soft.

sharp

Anthropic is taking the practical route: Claude for Small Business plugs into QuickBooks, PayPal, HubSpot, Canva, Docusign, Google Workspace, and Microsoft 365, then goes after messy back-office work. The 15 agentic workflows cover payroll planning, month-end close, invoice chasing, campaigns, and contract review. Manual approval before sending, posting, or paying is the right constraint, not a weakness. This smells like Claude Cowork becoming Zapier plus a finance clerk for SMBs. The strong part is data locality: books, settlements, pipeline, and documents live in the connected tools. The soft part is what Anthropic leaves out: pricing, permission granularity, audit logs, and liability. A bad QuickBooks reconciliation is not a bad email draft. If Anthropic sells saved hours without owning error boundaries, the risk lands on the owner.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

16:08

32d ago

FEATUREDr/LocalLLaMA· rssEN16:08 · 05·13

→sensenova/SenseNova-U1-A3B-MoT · Hugging Face

SenseNova published SenseNova-U1-A3B-MoT on Hugging Face; the post lists A3B MoT, 8B MoT, and 0.4B LoRA weight links, and says the NEO-unify architecture unifies multimodal understanding, reasoning, and generation in one model family.

#Multimodal#Vision#Reasoning#SenseNova

why featured

HKR-H/K/R all pass: an open multimodal model release with multiple weight sizes and a named NEO-unify mechanism. Source authority and missing benchmarks/license details keep it in the lower featured band.

editor take

Only the summary is visible: SenseNova put A3B/8B MoT and 0.4B LoRA links on HF; I don’t buy “unified multimodal” until weights and evals hold up.

sharp

SenseNova’s drop reads like an open-distribution probe, not a capability claim I’d trust yet. The summary gives three concrete hooks: SenseNova-U1-A3B-MoT, an 8B MoT link, and a 0.4B LoRA link. The fetched body is only a Reddit 403, so license, training mix, context length, inference cost, and benchmarks are not visible. “NEO-unify” unifying understanding, reasoning, and generation is cheap language in multimodal land. Qwen-VL, InternVL, and community Llama vision finetunes already made Hugging Face the default proving ground. The gap shows up in reproducible OCR, charts, video frames, tool use, and multi-turn visual reasoning. If SenseNova ships links without eval cards and commercial terms, LocalLLaMA will turn it into a reality check fast.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

15:38

32d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH15:38 · 05·13

→Open-source psql_bm25s speeds up PostgreSQL retrieval for multi-agent systems by 23x

The team open-sourced psql_bm25s, a native PostgreSQL access method for exact BM25 retrieval, and says it runs about 23x faster than pg_search on standard benchmarks.

#Agent#RAG#PostgreSQL#psql_bm25s

why featured

HKR-H/K/R pass via the 23x retrieval-speed hook, named Postgres access method, and RAG latency pressure. Single-source release details lack independent reproduction and production constraints, so it stays in the lower featured band.

editor take

23x is loud, but I want the benchmark harness first; exact BM25 inside Postgres helps RAG engineers, not the agent narrative.

sharp

psql_bm25s has a clean pitch: keep exact BM25 inside PostgreSQL as a native access method, with one less search stack to run. The article gives one hard number, about 23x faster than pg_search, but it omits corpus size, indexing time, update cost, and concurrency settings. That makes 23x a lead, not a migration case. I buy the engineering direction more than the multi-agent framing. Production RAG teams already use Postgres for permissions, metadata, and transactions; folding retrieval back into the same database cuts operational surface area versus Elasticsearch or OpenSearch. The hard part starts after the benchmark: frequent writes, tenant isolation, vacuum behavior, locks, and index bloat. BM25 latency is only the first gate.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

13:44

32d ago

FEATUREDHacker News Frontpage· rssEN13:44 · 05·13

→Show HN: Rotunda - A Browser Built for Agents with Simulated Typing

Pierce released Rotunda, a Firefox 150-based browser for agents that simulates mouse and keyboard timing with an RNN trained on one week of his own patterns, and exposes local control through a CLI or Playwright API for Claude, Codex, or other harnesses.

#Agent#Tools#Rotunda#Firefox

why featured

HKR-H/K/R all pass: simulated input timing is a concrete hook, RNN plus Playwright gives a testable mechanism, and agent-browser reliability is a live builder pain. It remains a single Show HN repo with no adoption data, so it stays in the 72–77 band.

editor take

Rotunda’s hook isn’t an agent browser; it’s human-like input spoofing trained on one week of personal patterns. Fraud teams will care before UX teams do.

sharp

Rotunda pushes browser agents straight into anti-bot territory. It is Firefox 150-based, exposes local control through a CLI and Playwright API for Claude or Codex, and uses an RNN to simulate mouse and keyboard timing. The training set is one week of the author’s own behavior. That is useful engineering, but it attacks the exact signal many bot-detection stacks still lean on: nonhuman input cadence. I don’t buy the clean “agent-first browser” framing. Browserbase, Steel, and similar projects mostly sell sessions, state, and tool plumbing. Rotunda smells closer to a local anti-detection lab. The repo text here does not disclose which sites it passes, false-block rates, or latency overhead. Without those numbers, this is a red-team-shaped prototype, not a production agent runtime.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

13:00

32d ago

FEATUREDBloomberg Technology· rssEN13:00 · 05·13

→Amazon Integrates Alexa Into Shopping Search Bar

Amazon is putting Alexa into the Amazon.com shopping search bar, according to the RSS snippet; the post does not disclose rollout scope, ranking mechanics, conversion metrics, pricing impact, or a launch timeline, and only states that AI algorithms are coming to one of Amazon’s most valuable retail surfaces.

#Agent#Tools#Amazon#Alexa

why featured

This is a standard Amazon product-entry update: HKR-H passes, but HKR-K lacks mechanism or data and HKR-R is weak, so it stays in the lower small-update band.

editor take

Amazon put Alexa into shopping search, but only headlines are disclosed; this smells like search-entry defense, not Alexa+ proving itself.

sharp

Bloomberg and TechCrunch both frame this as Amazon putting Alexa or Alexa+ inside the shopping search bar, so the angle is aligned; the body gives no pricing, rollout scope, date, or conversion metric. The signal is plain: Amazon is moving generative UI into its highest-frequency commerce surface, not trying to revive Alexa as a living-room voice brand. I don’t buy the “AI shopping assistant” wrapper yet. Amazon’s constraint is ads and ranking, not chat quality. Once Alexa+ sits in search, the hard questions are how answers place products, label sponsored slots, and handle bad reviews. Compared with Perplexity shopping or Google AI Mode, Amazon’s edge is not conversation; it owns the checkout path.

HKR breakdown

hook ✓knowledge —resonance —

→ open source

SCORE

H1·K0·R0

12:29

32d ago

FEATUREDr/LocalLLaMA· rssEN12:29 · 05·13

→AIDC-AI/Ovis2.6-80B-A3B on Hugging Face

AIDC-AI released Ovis2.6-80B-A3B, a multimodal MoE model with 80B total parameters and about 3B active parameters at inference, supporting a 64K-token context window and images up to 2880×2880 resolution.

#Multimodal#Vision#Reasoning#AIDC-AI

why featured

HKR-H/K/R pass: the open multimodal MoE has concrete specs and a real efficiency hook. Score stays near the featured floor because the post gives no benchmarks, license details, or hands-on results.

editor take

Ovis2.6-80B-A3B’s hook is not 80B; it’s 3B active params with 64K context and 2880px images. Open VLMs are chasing cheap document work.

sharp

Ovis2.6-80B-A3B brings open multimodal competition back to serving economics: 80B total params sounds large, but ~3B active params is the number LocalLLaMA users will care about. The 64K context window, 2880×2880 image input, OCR, charts, and long-document QA all point at one workload: cheap document understanding, not another chat demo. I have doubts about the “Think with Image” framing. If cropping, rotation, and region re-checking are truly callable inside the reasoning loop, that tracks the tool-using vision direction Gemini and Claude have been moving toward. But the snippet gives no benchmark, latency, VRAM, or throughput under high-resolution inputs. I’d judge this release by deployment cost first, slogans second.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

12:00

32d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH12:00 · 05·13

→Configuring Development Environments for Agents

Cursor released tools for cloud agent development environments, adding multi-repository support, Dockerfile-based configuration, audit logs, and environment-level network and secret controls; the post says cache hits improve build speed by 70%.

#Agent#Code#Tools#Cursor

why featured

HKR-K and HKR-R pass: Cursor adds concrete cloud-agent environment controls, including Dockerfile setup, audit logs, permissions, and 70% faster cached builds. HKR-H is weaker, so this sits at the lower featured band.

editor take

Cursor is moving the agent fight into environment control: multi-repo, Dockerfiles, audit logs. That matters more than another coding demo.

sharp

Cursor is fixing the unglamorous part of agents: the environment, not the model. Multi-repo workspaces, Dockerfile config, build secrets, audit logs, and environment-level network and secret controls are the gates before enterprises let cloud agents touch real engineering workflows. The 70% faster rebuild claim on cache hits is a concrete nod to cold-start pain. I trust this kind of release more than another “agent fixes a bug” demo. Devin, Copilot Workspace, and OpenAI Codex-style products have hit the same wall: writing code is cheap; cloning the right repos, installing private deps, running tests, and reaching internal services is where autonomy dies. Cursor still leaves gaps around isolation, permission inheritance, and rollback. Without those details, “fleets of agents” still smells like handing CI access to a room full of interns.

HKR breakdown

hook —knowledge ✓resonance ✓

→ open source

SCORE

H0·K1·R1

11:00

32d ago

● P1OpenAI Blog· rssEN11:00 · 05·13

→OpenAI builds secure sandbox for Codex on Windows

OpenAI built a secure sandbox for Codex on Windows. The RSS snippet discloses controlled file access and network restrictions, but the post does not disclose implementation details, performance data, or rollout conditions.

#Agent#Code#Safety#OpenAI

why featured

OpenAI details a Windows sandbox for Codex with file-access and network controls. It is not a major model release, but HKR-H/K/R all pass because the safety boundary matters for coding-agent adoption.

editor take

OpenAI’s Windows Codex sandbox is the unglamorous blocker: coding agents don’t become daily tools until OS permissions stop being a trust fall.

sharp

Two sources track the same OpenAI engineering post, and their angles are aligned; aihot reads like a relay, so this is still an official-source chain. OpenAI says Windows Codex had two bad modes: approve nearly every command, or enable Full Access. That explains why agentic coding on Windows has felt half-finished. I buy the engineering diagnosis more than the product gloss. OpenAI walks through AppContainer, Windows Sandbox, and MIC, then rejects each for concrete workflow reasons: agents need shells, Git, Python, package managers, build tools, and the user’s real checkout. Compared with macOS Seatbelt or Linux seccomp/bubblewrap, Windows lacks the clean default isolation primitive Codex needs. If OpenAI wants Codex living inside the IDE all day, this sandbox work matters as much as another benchmark bump.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

10:40

33d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH10:40 · 05·13

→Miaoda App and Enterprise Edition launch with 90% self-generated code

Baidu launched the Miaoda app and Miaoda Enterprise Edition, saying 90% of the Miaoda app’s code was generated by Miaoda itself; Miaoda-generated apps have served over 10 million users and reached a total value of RMB 5 billion.

#Code#Agent#Baidu#Miaoda

why featured

HKR-H/K/R all pass via the 90% dogfooding hook, concrete adoption/value figures, and coding-tool resonance. Company-source metrics lack independent context, so this stays in the lower featured band.

editor take

Baidu says Miaoda wrote 90% of its own app code, but RMB 5B “app value” lacks a definition; this smells like low-code positioning, not proven dev migration.

sharp

Baidu is selling Miaoda as a self-bootstrapping coding agent, but the suspect number is RMB 5B in “total app value.” The snippet gives three figures: 90% of the Miaoda app’s code was generated by Miaoda, Miaoda-built apps served over 10 million users, and total value reached RMB 5 billion. It does not define whether 90% means lines, commits, modules, or accepted diffs. Pricing, retention, paid seats, and enterprise SLA are also absent. I buy that coding assistants can eat long-tail internal software. Cursor and GitHub Copilot already trained teams to let AI sit inside the dev loop. Baidu’s claim is different: generated-app value, not customer willingness to pay. Without billing or production usage data, the RMB 5B reads like a valuation spreadsheet, not market proof.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

10:06

33d ago

FEATUREDQbitAI (量子位) · WeChat· rssZH10:06 · 05·13

→ByteDance Proposes Generative Refinement Networks as a Third Route for Visual Generation

ByteDance’s commercial technology team proposed GRN, a visual generation architecture using HBQ, global refinement, and complexity-aware sampling to address quantization loss, error accumulation, and fixed-step inference; on a 130M model, adaptive sampling reduced inference from 50 steps to an average of 24, while gFID changed from 3.56 to 3.79.

#Multimodal#Vision#Inference-opt#ByteDance

why featured

HKR-H/K/R all pass: ByteDance’s GRN has a concrete hook plus 130M, 24-step inference and gFID 3.79. It is a strong research release, not a flagship model launch, so it stays in the 78–84 band.

editor take

GRN’s strongest claim is not the “third path” pitch; it cuts 50 steps to 24 on a 130M model while gFID only moves 3.56→3.79.

sharp

ByteDance’s GRN reads better as an inference-budget paper than a “diffusion killer.” The concrete win is complexity-aware sampling: a 130M model drops fixed 50-step inference to 20–40 steps, averaging 24, while gFID only worsens from 3.56 to 3.79. That is a compute allocation story, not a visual-quality coup. The other numbers are still clean: HBQ hits 0.56 rFID on ImageNet 256 reconstruction, and GRN-G 2B reports 1.81 FID on class-to-image. But the T2V claim is still 480p, 2–10 seconds, and demo-grade 2B territory. That is not in the same operational league as Sora or Veo-style systems. Coming from ByteDance’s commercial tech team, this smells less like academic architecture theater and more like a path to cheaper generation at scale.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

10:06

33d ago

FEATUREDQbitAI (量子位) · WeChat· rssZH10:06 · 05·13

→An 8-Year-Old Turns Ideas into Apps as Baidu Launches Miaoda 3.0

Baidu launched Miaoda 3.0 at its 2026 Create conference, adding iOS and Android app generation, Android packaging, online hot updates, and an enterprise edition with three-level permissions, environment isolation, and SLA commitments.

#Agent#Code#Tools#Baidu

why featured

HKR-H/K/R pass: Baidu’s Miaoda 3.0 adds mobile app generation, Android packaging, hot updates, and enterprise controls. This is a solid product update, not a flagship model release or must-write event.

editor take

Miaoda 3.0 pushes AI app builders into mobile packaging and SLA territory, but the 8-year-old demo smells like PR; shipping constraints matter more.

sharp

Baidu is trying to move Miaoda 3.0 out of the demo bucket, and the hard parts are Android packaging, online hot updates, three-level permissions, environment isolation, and SLA commitments. That is the right surface area. AI coding tools often stop at generated code; production work starts at release, rollback, access control, and uptime. The evidence in this piece is thin. It cites a 90,000-eldercare platform, 440,000 daily visitors, and 1.2 million visits, but gives no retention, incident rate, SLA tier, iOS distribution path, or enterprise pricing. Compared with Cursor or Claude Code, Miaoda is betting on app distribution for non-developers, not a stronger developer IDE. The product wins only if generated apps keep running after the conference demo.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

09:42

33d ago

FEATUREDSynced (机器之心) · WeChat· rssZH09:42 · 05·13

→Lin Junyang Reportedly Starts New AI Lab Seeking $2 Billion Valuation

The Information says Lin Junyang is raising several hundred million dollars for a new AI Lab at a potential $2 billion post-financing valuation, while the lab’s research direction and final valuation remain undisclosed.

#Agent#Robotics#Multimodal#Lin Junyang

why featured

HKR-H/K/R all pass, but the article only gives The Information’s funding rumor and valuation; research focus, team, and product plan are not disclosed. This fits the 72–77 featured band.

editor take

A $2B valuation is pricing Lin Junyang’s personal credit, not a lab. No direction, terms, or core team details yet—too early for mythology.

sharp

A $2B valuation turns Lin Junyang into China’s star-lab test case, but the numbers are still riding on biography. The article gives only “several hundred million dollars” in fundraising, talks with Gaorong and Sequoia, and a few hires from ByteDance, Tencent, and overseas backgrounds. Research direction and final valuation are not disclosed. His Qwen track record is real: 33 years old, Alibaba P10, core driver of the open-source Qwen family. I’m wary of the pricing. DeepSeek already squeezed the valuation logic for closed model startups by making low-cost open models look credible. Without disclosed compute terms, cloud distribution, or a concrete agent/robotics agenda, $2B smells more like founder premium than business value.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

08:31

33d ago

● P1r/LocalLLaMA· rssEN08:31 · 05·13

→The Trillion-Parameter Dilemma: MiMo-V2.5-Pro Open-Sourced at 1.02T Parameters

Xiaomi open-sourced MiMo-V2.5-Pro with 1.02T parameters, 42B active parameters, a 1M context window, and an MIT license; the author ran 125 Claude Code sessions through the API, spending $70.12 for 387,380,436 tokens with a 96.3% cache hit rate.

#Agent#Code#Inference-opt#Xiaomi

why featured

HKR-H/K/R all pass: a Xiaomi 1.02T open model plus a concrete Claude Code API cost experiment. Reddit sourcing keeps it at the low end of the 85+ band, but the domestic flagship-model signal clears p1.

editor take

A 1.02T open model is only “free” until you compare it with $70 for 387M API tokens and 96.3% cache hits.

sharp

MiMo-V2.5-Pro makes the open-weight economics look brutal: 1.02T total parameters, 42B active parameters, 1M context, MIT license—and the cited API run processed 387,380,436 tokens across 125 Claude Code sessions for $70.12, with a 96.3% cache hit rate. The issue is not whether you can download the weights. It is whether your local inference stack beats hosted cache economics. Xiaomi gets developer attention, and MIT licensing gives companies room to modify the model. But self-hosting a 1T MoE means paying for memory, routing, concurrency, KV cache, monitoring, and idle capacity. Unless you need compliance isolation, sustained high throughput, or weight-level customization, “open source saves money” gets crushed by this API bill.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

08:26

33d ago

FEATUREDFinancial Times · Technology· rssEN08:26 · 05·13

→SoftBank profits surge on $25bn gain for OpenAI stake

SoftBank reported $11.6bn in fourth-quarter net income after a $25bn OpenAI stake gain; the post does not disclose valuation details.

#SoftBank#OpenAI

why featured

FT authority plus HKR-H/K/R support featured: the $25bn OpenAI stake gain ties AI valuations to SoftBank earnings, with $11.6bn net profit disclosed. It is not a model or product update, and the valuation basis is not disclosed.

editor take

SoftBank turned an OpenAI mark-up into $11.6bn quarterly profit; that is not operating strength, it is AI private-market leverage hitting the P&L.

sharp

SoftBank’s quarter has thin earnings quality: $11.6bn in net income is being carried by a $25bn gain on its OpenAI stake. The title gives the core numbers; the FT body is paywalled, so valuation method, ownership percentage, and realized-versus-paper treatment are not disclosed. I’m wary of this AI accounting loop. A private OpenAI mark gets converted into public-company profit, long before the model business proves clean cash yield at that scale. SoftBank has lived through this movie with Vision Fund marks, WeWork pain, and Arm upside; OpenAI is the larger, hotter instrument. Without cash exit data or a disclosed secondary price, the $25bn gain reads like a valuation thermometer, not evidence of operating power.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

06:37

33d ago

● P1New York Times Chinese· rssZH06:37 · 05·13

→China Sought Access to Anthropic’s Latest Technology but Was Rejected

Chinese think-tank representatives asked Anthropic in Singapore last month to give Beijing access to Mythos, and Anthropic refused; the company has limited the vulnerability-finding model to the U.S. government and more than 40 organizations.

#Code#Safety#Tools#Anthropic

why featured

HKR-H/K/R all pass: the NYT report gives the Singapore request, Mythos’s bug-finding use, and its US-government-plus-40 access scope. This is a same-day security and US-China AI access story.

editor take

Mythos is being treated like cyber arms control; Anthropic refusing Beijing says more than any safety memo.

sharp

Mythos has crossed into quasi-arms-control territory. Anthropic is not selling a coding model; it is drawing a U.S.-aligned access perimeter. After the April launch, Mythos went only to the U.S. government and more than 40 organizations. Chinese think-tank representatives asked in Singapore last month for Beijing access, and Anthropic refused. That user list is too small to read as normal enterprise gating. The NYT cites U.S. estimates that OpenAI ChatGPT 5.5 and Anthropic Mythos pushed the U.S.-China model gap from about six months to nine-to-twelve months. I don’t fully buy that gap as clean measurement; national-security briefings always carry deterrence theater. But vulnerability discovery changes the product category. DeepSeek adapting to Huawei chips helps the compute story. It does not solve access to a restricted cyber-capability model.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

06:37

33d ago

FEATUREDNew York Times Chinese· rssZH06:37 · 05·13

→Jensen Huang Gets Last-Minute Invitation to Join Trump’s China Trip

Trump called Jensen Huang on Tuesday morning to invite him to join the China trip; the White House’s Monday list of 16 CEOs did not include him, while Nvidia is still seeking approval to sell AI chips to China.

#Inference-opt#Nvidia#Jensen Huang#Donald Trump

why featured

HKR-H/K/R all pass: NYT reports a last-minute Jensen Huang invite tied to Nvidia’s China AI-chip license push. No disclosed policy change or license outcome, so this stays near the featured threshold.

editor take

Jensen’s last-minute Air Force One invite says Nvidia’s China license fight has moved from Commerce paperwork to summit bargaining.

sharp

Jensen Huang getting added to Air Force One at the last minute is not protocol noise. Nvidia has pushed its China chip license fight onto the summit table. The White House published 16 CEOs on Monday without him. Trump called Tuesday morning, then Huang boarded during the Alaska stop that evening. The prize is not one shipment. It is Nvidia’s legal right to stay inside China’s AI stack. The article says Trump approved sales of prior-generation Nvidia chips last summer and even planned to take a cut from those sales, but Beijing has not approved purchases. Both sides are treating downgraded export-compliant chips as leverage: Washington fears compute leakage, Beijing fears dependence on a U.S.-controlled supply chain. Huang on the plane says the compliant-SKU workaround is no longer enough.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

06:19

33d ago

● P1AI HOT (Curated Pool)· aihot-apiZH06:19 · 05·13

→SenseTime releases SenseNova-U1 technical report and open-source model

SenseTime released the SenseNova-U1 technical report, covering six-stage training, RL post-training, and distillation; the open-source SenseNova-U1-A3B-MoT uses an MoE architecture and activates only 3 billion parameters.

#Multimodal#Vision#Fine-tuning#SenseTime

why featured

HKR-H/K/R all pass: A3B-MoT’s 3B active parameters and six-stage training recipe give concrete signal. The score stays near the featured floor because this is a vendor post with no benchmarks, license terms, or reproduction details disclosed.

editor take

Only the titles are available: SenseTime released a SenseNova-U1 report and open weights, but no size, license, or evals. I’d treat this as China multimodal positioning, not proof yet.

sharp

Two sources align: SenseTime released the SenseNova-U1 technical report and opened model weights based on an MoE architecture. The body is empty, so model size, license, training mix, and benchmarks are not disclosed. I’d discount the launch for now. Native multimodal plus MoE is the right architectural lane, but open-weight credibility in 2025 is no longer earned by publishing weights alone. It needs reproducible numbers on MMMU, Video-MME, MathVista, OCRBench, and direct pressure against Qwen2.5-VL, InternVL, and DeepSeek-adjacent tooling. The headline leans hard on “construction guide,” which smells like a developer-mindshare play. Without eval tables or usage terms, SenseNova-U1 is a positioning move, not yet a model practitioners can safely plan around.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

05:37

33d ago

FEATUREDNew York Times Chinese· rssZH05:37 · 05·13

→China Seeks AI Technology Self-Reliance, Weakening Washington’s Leverage Over Beijing

DeepSeek optimized its latest model for inference on Huawei chips for the first time, while two semiconductor sources said training still relies on Nvidia chips; Huawei says it plans to release a training chip this year, but matching current Nvidia performance will take another year.

#Inference-opt#DeepSeek#Huawei#Nvidia

why featured

HKR-H/K/R all pass: NYT ties DeepSeek-Huawei chip optimization and Huawei's training-chip timeline to US export-control leverage. It is not a model launch and lacks benchmark results, so it stays in the 78–84 band.

editor take

DeepSeek can run inference on Huawei chips, but training still sits on Nvidia; China is peeling off the serving layer first, not declaring chip independence.

sharp

DeepSeek’s hard move is carving out inference, not escaping Nvidia. The article gives two concrete hooks: its latest model is optimized for Huawei chips for inference, while two semiconductor sources say training still relies on Nvidia. Serving is the daily cost and deployment surface; training remains the frontier choke point. Beijing gets usable autonomy, not a closed loop. Huawei’s own timeline is restrained: a training chip is planned this year, and matching current Nvidia performance needs another year. The H200 approval also looks hollow if Nvidia has booked no China revenue from it. Washington can still pressure training, but China is moving the model-chip-app stack into a domestic inference lane. It is messy, but it chips away at Nvidia’s default position in China.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

04:06

33d ago

FEATUREDAI Era (新智元) · WeChat· rssZH04:06 · 05·13

→Tsinghua-affiliated team open-sources MiniCPM-V 4.6, a 1.3B model tunable on one RTX 4090

ModelBest, Tsinghua University, and OpenBMB open-sourced MiniCPM-V 4.6, a 1.3B multimodal model that supports full fine-tuning on one RTX 4090 and offers 4x/16x visual token compression for accuracy or speed trade-offs.

#Multimodal#Vision#Fine-tuning#ModelBest

why featured

HKR-H/K/R all pass: the story gives a concrete open-source multimodal release with size, hardware condition, and token-compression details. It lowers local fine-tuning cost, but it is not a frontier-lab flagship release, so 78–84 fits.

editor take

MiniCPM-V 4.6’s punch is full fine-tuning on one RTX 4090; if 4x/16x compression holds, edge VLM costs take a real hit.

sharp

MiniCPM-V 4.6 attacks the edge VLM bottleneck at the hardware level, not just the leaderboard layer. The useful claims are concrete: 1.3B parameters, full fine-tuning on one RTX 4090, 4x/16x visual-token compression, 2.2x faster TTFT on 3136² images, and 5.4M tokens consumed on Artificial Analysis versus 101M for Qwen3.5-0.8B non-reasoning. I buy the direction: early visual-token compression is exactly where small multimodal models need to compete. I don’t buy the victory lap yet. The article does not give the full benchmark table, accuracy-loss curve, or the RTX 4090 fine-tuning settings for batch size and resolution. Practitioners should test OCR-heavy documents and high-res VQA first; that is where this model either becomes useful or turns into another nice demo.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

00:39

33d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH00:39 · 05·13

→Google launches its first AI-first laptop Googlebook with Gemini integration

Google launched Googlebook, its first laptop designed around Gemini Intelligence, with three disclosed mechanisms: Magic Pointer as an AI interaction entry point, natural-language widget creation, and Android-based cross-device app and file access.

#Agent#Tools#Google#Gemini

why featured

HKR-H/K/R all pass: a Google Gemini-first laptop with 3 named interaction mechanisms. Specs, pricing, launch timing, and demos are not disclosed, so it stays in the 78–84 band.

editor take

Googlebook puts Gemini into the pointer and widgets; smart surface choice, but no price, silicon, or offline story makes it feel like an entry-point claim.

sharp

Googlebook’s bet is clear: the AI entry point moves from chat into the pointer, widgets, and Android file flow. The three named mechanisms—Magic Pointer, Create Your Widget, and cross-device Android access—are better surfaces than another Gemini app. They hit selection, desktop state, and phone-app handoff, where users already act. I buy the surface choice, not the “AI-first laptop” claim yet. The snippet gives no price, silicon, NPU spec, battery life, offline Gemini behavior, or enterprise controls. Windows Copilot+ PCs already ran the local-AI hardware story, and Chromebooks have long struggled with high-end productivity perception. If Googlebook is mostly a Gemini UI layer, it becomes a Chromebook with smarter pop-ups.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

00:00

33d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH00:00 · 05·13

→Claude Code adds /goal feature to keep tasks running until completion

Claude Code introduced a /goal feature that keeps Claude working until a task is completed; the post does not disclose the trigger mechanism, supported versions, pricing, or failure conditions.

#Agent#Code#Tools#Anthropic

why featured

HKR-H/K/R pass because /goal targets a real Claude Code reliability pain. It is a single-feature Anthropic update with sparse mechanics, so it lands at the lower featured band, not same-day major news.

editor take

Claude Code’s /goal has a clean name, but without failure rules or mechanics, it smells like “keep going” sold as task completion.

sharp

Claude Code’s /goal sells persistence before it proves completion. The title says Claude keeps working until the task is done, but the snippet gives no trigger mechanism, supported versions, pricing, timeout, rollback path, or failure condition. Coding agents do not fail because they stop too early. They fail because they keep editing after losing the thread. Cursor, Windsurf, and Codex-style CLIs all hit the same wall: longer loops amplify bad assumptions. If Anthropic ships /goal as a command without auditable stop criteria, it is great demo language and shaky production behavior.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

00:00

33d ago

● P1OpenAI Blog· rssEN00:00 · 05·13

→OpenAI responds to TanStack npm supply chain attack affecting staff devices

OpenAI described its response to the TanStack “Mini Shai-Hulud” npm supply-chain attack, including protections for systems and signing certificates, and said macOS users must update OpenAI apps by June 12, 2026.

#Safety#OpenAI#TanStack#Incident

why featured

HKR-H/K/R pass: an official OpenAI security response names the TanStack npm attack and a June 12, 2026 macOS update deadline. Scope and technical detail are not disclosed, so it stays near the featured floor.

editor take

OpenAI disclosed that employee devices were hit by the TanStack npm supply chain attack, exposing code-signing certificates and forcing a mandatory macOS app update by June 12.

sharp

This is OpenAI's own disclosure, and the other source is just relaying it, so the facts are consistent. The attack path is straightforward: the TanStack open-source library was compromised, the malicious package landed on two employee devices, and the attacker reached internal repos containing code-signing certificates. OpenAI says no customer data was touched and no misuse of the certificates has been found, but they're revoking the old certs on June 12 anyway. Mac users who don't update will have their apps blocked by the OS. I'd take the 'no customer data impact' claim with a grain of salt. OpenAI hired a third-party forensics firm and confirmed credential exfiltration, but only says 'limited credential material' was taken without specifying what. Code-signing certificates are high-value credentials on their own. If the attacker grabbed them and didn't use them, it's either because they ran out of time or their goal wasn't signing malware. What's missing is the forensics report and a list of affected repos—OpenAI's word alone isn't enough to close the book on this.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

SCORE

H1·K1·R1

hot events · 2026-05-13

more

feeds

admin