all posts

▸ 50 items · updated 3m ago

browse by day4283 items · 60 days

May 2026

MTWTFSS

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 2573 26105 27120 28142 29116 3064 3162

June 2026

MTWTFSS

1150 2157 3132 4117 5127 669 773 8141 9135 1084 1196 1288 1346 1434 1570 1682 1775 1886 1955 2027 2120 2274 2374 2468 2564 2640 2724 2837 2956 3083

July 2026

MTWTFSS

156 271 347 421 527 664 758 865 975 1050 1134 1228 1345 1484 1582 1683 1745 1818 1938 2051 2170 2265 2340 24 25 26 27 28293031

2026-07-16 · Thu

02:00

12d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH02:00 · 07·16

→Tiangong Short Drama Workbench launches dual-track creation with Agent smart storyboarding and infinite canvas

Tiangong Short Drama Workbench uses a director Agent to auto-parse scripts, plan blocking and camera positions, targeting the persistent face-swap and position-drift issues in AI short dramas. It packs film-grade prompt templates, 720° panoramas, and a 3D director console for controllable production. Three works already launched on DramaWave, hitting seven-figure USD revenue in 7 days.

#Vision#天工短剧工作台#DramaWave

why featured

Featured · importance 72 · hook + knowledge

editor take

Tiangong's director Agent tackles face-swap and position-drift in AI short dramas, with works already hitting $1M+ in 7 days on DramaWave.

sharp

The reason this is worth a click: it productizes the two biggest headaches in AI short drama—face inconsistency and position drift. The director Agent auto-parses scripts, plans blocking and camera angles, then generates multi-view detail shots. The idea is moving consistency control from post-production fixes to pre-production planning, which is a smarter workflow than pure black-box generation. The $1M+ in 7 days on DramaWave is a solid number—it signals real paying demand for AI-generated short drama overseas. But the post only gives us an RSS snippet. No details on the Agent's technical approach, how character consistency holds up across multi-episode arcs, or how much human intervention went into those three works. I'd discount this a bit: short dramas have lower consistency demands than feature-length content, and we can't tell if this approach scales to more complex narratives.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

72

SCORE

H1·K1·R0

01:36

12d ago

AI HOT (Curated Pool)· aihot-apiZH01:36 · 07·16

→MiniMax Code 2.0 desktop app rebuilt, finance module coming soon

MiniMax Code 2.0 desktop app rebuilt on the Pi Agent framework. Session startup is faster and long-running tasks are more stable. Chart loading and file preview editing improved. The finance module isn't live yet but already connected to Hundsun financial database and Qichacha MCP; it will support multi-source data retrieval and report generation. Remote control and browser manipulation coming this month.

#Code#MiniMax#恒生#企查查

editor take

MiniMax Code 2.0 desktop rebuilt on Pi Agent framework — faster startup, stable long tasks, finance module wired to Hundsun and Qichacha, but no launch date yet.

HKR breakdown

hook —knowledge ✓resonance —

→ open source

68

SCORE

H0·K1·R0

01:21

12d ago

Hacker News Frontpage· rssEN01:21 · 07·16

→Bluesky acquires AT Protocol trademark to protect the ecosystem from legal abuse

Bluesky bought the trademark for 'AT Protocol' and 'atproto' from a company that was threatening legal action. Now Bluesky owns it, the community can keep using it freely. Commercial use as a brand or product name requires a license. Everyday use—docs, open-source tools, saying your app is compatible—does not. Bluesky calls it a defensive move, no licensing fees, and plans to transfer the mark to an independent governance body later. The post doesn't disclose the purchase price.

#Bluesky#AT Protocol#Open source

editor take

Bluesky bought the AT Protocol trademark from a company threatening to sue; community use stays free, commercial use needs a license.

HKR breakdown

hook —knowledge —resonance —

→ open source

55

SCORE

H0·K0·R0

00:40

12d ago

Product Hunt · AI· rssEN00:40 · 07·16

→River: AI account executives demo and close B2B deals in real time

River replaces B2B sales reps with voice AI. When a lead inquires, the AI joins a live call instantly, runs the demo, handles objections, and closes. Founder Tarek says he used to do the same demo 15 times a day, so he cloned himself. Backed by founders of Ramp, Kalshi, and Vercel. The post doesn't disclose pricing, customer case studies, or actual close rates.

#River#Ramp#Kalshi

editor take

River's voice AI jumps on live calls to demo and close B2B deals, but no pricing or close rates disclosed yet.

HKR breakdown

hook ✓knowledge —resonance ✓

→ open source

55

SCORE

H1·K0·R1

00:33

12d ago

AI HOT (Curated Pool)· aihot-apiZH00:33 · 07·16

→xAI's open-source Grok CLI codebase yields a Mermaid-to-Unicode box art tool

Simon Willison found a Rust-based Mermaid diagram terminal renderer in xAI's newly open-sourced Grok CLI codebase. He compiled it to WebAssembly and built a browser tool: write Mermaid source on the left, see Unicode box art on the right, with copy-as-text and link-sharing. The post doesn't clarify whether Grok CLI itself uses this renderer.

#Code#xAI#Simon Willison#Grok CLI

editor take

Simon Willison found a Rust Mermaid terminal renderer in Grok CLI's codebase and turned it into a browser tool via WASM.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

62

SCORE

H1·K1·R0

00:12

12d ago

AI HOT (Curated Pool)· aihot-apiZH00:12 · 07·16

→Remote Agent Control: Codex as Main, UU Remote as Backup

The author shares a remote Agent setup: Codex's remote control as the main tool, connected via ChatGPT App to a 24/7 Mac Mini at home, syncing tasks, rules, and Agent memory. For scenarios like QR code login or GUI operations that Codex struggles with, NetEase UU Remote lets you control the full desktop from a phone. UU Remote is free, supports multi-device collaboration, and requires no LAN or public network setup.

#Codex#ChatGPT#Mac Mini

editor take

Remote Agent setup: Codex handles tasks, UU Remote covers QR codes and GUI ops—free, no config needed.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

60

SCORE

H1·K1·R0

00:00

12d ago

● P1Hugging Face Blog· rssEN00:00 · 07·16

→Hugging Face discloses an end-to-end autonomous AI agent intrusion into its production infrastructure

On July 16, Hugging Face disclosed that an autonomous AI agent system breached its production infrastructure through a malicious dataset. The attacker exploited remote-code loading and template injection in the dataset pipeline, escalated to node-level access, harvested cloud and cluster credentials, and moved laterally across internal clusters over a weekend. The campaign involved tens of thousands of automated actions with self-migrating C2 on public services. Hugging Face closed the initial vulnerability, rotated credentials, rebuilt compromised nodes, and tightened cluster admission controls. No tampering with public models, datasets, or Spaces was found; the software supply chain was verified clean. The post does not specify which LLM the attacker used or whether any partner/customer data was affected.

#Agent#Hugging Face#Incident

why featured

Featured · importance 92 · hook + knowledge + resonance

editor take

An autonomous AI agent breached HF's production infra via a malicious dataset, stole credentials, and moved laterally—public models and supply chain are clean.

sharp

This isn't just another platform breach—the attack method itself is the story. An autonomous agent framework exploited two bugs in HF's dataset pipeline (remote-code loading and template injection) to execute code on a processing node, escalate privileges, harvest cloud and cluster credentials, and move laterally across internal clusters over a weekend. Tens of thousands of automated actions, with self-migrating C2 on public services. HF says public models, datasets, and Spaces weren't tampered with, and the software supply chain checked out clean. The post doesn't name the LLM the attacker used, and it's still assessing whether partner or customer data was affected. I'd treat this as a milestone: we've been talking about agentic attacks for a while, but this is a real case of an AI agent autonomously running a full kill chain—exploitation, privilege escalation, lateral movement, C2 migration. HF also used AI for detection and forensics, which is the other side of the same coin. The missing pieces are motive and data exposure scope; those will determine how bad this actually was.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

92

SCORE

H1·K1·R1

00:00

12d ago

FEATUREDComputing Life · Share (鸭哥 research reports)· rssZH00:00 · 07·16

→Cloudflare Precursor: When AI agents use real browsers, bot detection shifts to session-long behavior

Cloudflare launched Precursor on July 13, 2026, a session-level behavior detection system inside Enterprise Bot Management. Instead of relying on browser fingerprints or single-request signals, it continuously collects pointer movement, keystroke timing, focus changes, and page visibility, then cross-checks whether the whole session is internally consistent. The shift matters because Playwright-driven agents now run inside real browsers, making old shortcuts like checking navigator.webdriver unreliable. Precursor injects scripts into HTML responses, compresses event streams, and evaluates them at the edge, updating Bot Score and challenge clearance as behavior changes. Cloudflare states it does not record actual keystrokes or bind signals to user identities. The post also covers the FP-Agent paper, where an XGBoost classifier distinguished seven browser agents in a controlled lab setting, but notes that doesn't translate to production accuracy. Independent accuracy rates, false-positive data by user group, and accessibility testing are not yet public. Official materials conflict on rollout scope. Behavior detection alone doesn't solve agent identity, user delegation, or business authorization—Precursor only addresses runtime risk.

#Cloudflare#Precursor#FP-Agent

why featured

Featured · importance 78 · hook + knowledge + resonance

editor take

Cloudflare Precursor shifts bot detection from single fingerprints to session-long behavioral consistency, because Playwright-driven agents now run inside real browsers.

sharp

The useful bit here is the problem shift it names: Playwright-driven agents run in real Chrome with normal fingerprints, so checking `navigator.webdriver` no longer cuts it. Precursor stretches detection across the whole session—pointer movement, keystroke timing, focus changes, page visibility—and cross-checks whether these signals explain each other. A cursor moving while the page is hidden, or keyboard events on an unfocused field, is harder to fake than a single property. I'd discount the accuracy claims for now. Cloudflare hasn't published independent accuracy rates, false-positive data by user group, or accessibility testing results. The FP-Agent paper got near-perfect classification on seven agents in a controlled lab with 56 students and fixed software versions—that doesn't translate to production. Precursor also only addresses runtime behavior consistency. It doesn't touch agent identity, user delegation, or what an agent is authorized to do.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

78

SCORE

H1·K1·R1

00:00

12d ago

FEATUREDComputing Life · Share (鸭哥 research reports)· rssZH00:00 · 07·16

→Tencent Hunyuan Hy3 1-bit quantization: fits on one GPU, but is it usable?

Tencent released IQ1_M quants for the 295B Hy3 MoE model, fitting into a single 96GB GPU at ~85.5 GiB. Actual BPW is ~2.4, not 1-bit. With 64K q8 KV cache, only ~2 GiB headroom remains—official config is marked tight. Agent and coding abilities dip slightly vs BF16; multi-step reliability is untested. Community GGUF hits ~23–25 tok/s on M3 Max, but 8K prompt prefill takes ~200s. Worth experimenting if you already own the hardware, not a reason to buy it.

#腾讯混元#Hy3#AngelSlim

why featured

Featured · importance 74 · hook + knowledge + resonance

editor take

Hy3's 295B model fits a 96GB GPU at 85.5 GiB, but 64K context leaves only 2 GiB headroom and agent performance dips.

sharp

The headline number is seductive: a 295B model on a single GPU. Tencent shipped IQ1_M quants for Hy3 at ~85.5 GiB, which does fit a 96GB card. But "1-bit" is the quantization format name—actual BPW is ~2.4, with attention and embedding layers kept at higher precision. Fitting isn't the same as running comfortably. Add 64K q8 KV cache and you're at ~94 GiB, with only 2 GiB left for CUDA context and compute buffers. Tencent marks this config as tight and hasn't published peak memory or speed logs. The capability tradeoff matters more. Official notes say agent and coding abilities dip slightly vs BF16. In multi-step tasks, one deviation cascades—and there's no public end-to-end success rate data yet. Community GGUF hits ~23–25 tok/s on M3 Max, but 8K prompt prefill takes ~200 seconds. If you already own a 96GB GPU or high-spec Mac, pull it down and experiment. If you're thinking of buying hardware for this model, I'd wait—Q4_K_M reliability benchmarks and longer-context real-world numbers aren't here yet.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

74

SCORE

H1·K1·R1

00:00

12d ago

Computing Life · Share (鸭哥 research reports)· rssZH00:00 · 07·16

→MCP Elicitation: How tools pause mid-execution to ask a human

MCP tool calls assume all parameters are ready upfront. Real workflows don't work that way—servers discover missing info mid-task. Elicitation gives MCP a reverse channel: the server pauses, sends a structured question through the client, and resumes with the user's response. The spec splits this into form (non-sensitive input) and URL (OAuth, payments), with separate data paths. Cloudflare's July 2026 Agent SDK update added client-side handling, but support varies: VS Code, Claude Code CLI, Kiro, and Codex handle both modes; Cursor only confirms form; OpenCode hasn't declared it yet. The protocol exists—client UX still lags.

#MCP#Cloudflare#VS Code

editor take

MCP servers can now pause mid-task, ask a human, and resume—but client support is still patchy.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

72

SCORE

H1·K1·R0

2026-07-15 · Wed

23:37

12d ago

Product Hunt · AI· rssEN23:37 · 07·15

→Weave: Think out loud and watch it become a living map.

Weave is a voice-powered whiteboard that turns your spoken thoughts into a live, interactive map. The map reshapes as you change your mind and asks questions you haven't considered. It captures meetings, replays how a thought unfolded, and lets you share a link or export. Free to start, no credit card required. The post doesn't disclose the underlying model or latency details.

#Weave#Product Hunt#Emmanuel Adesola

editor take

Weave turns speech into a live mind map that reshapes as you talk, but the post doesn't disclose the model or latency—keep expectations in check.

HKR breakdown

hook ✓knowledge —resonance —

→ open source

55

SCORE

H1·K0·R0

23:00

12d ago

FEATUREDNVIDIA Blog· rssEN23:00 · 07·15

→NVIDIA launches Jetson Thor T3000 and T2000, bringing Blackwell to mainstream robotics and edge AI

NVIDIA announced two new Thor-based modules: T3000 (865 FP4 teraflops, 32GB memory, 273GB/s bandwidth) at roughly half the size and power of T5000, and T2000 (400 FP4 teraflops, 16GB) for broader edge AI. New Jetson agent skills automate memory optimization—some customers saved up to 15GB and moved to lower-memory SKUs. Cosmos 3 Edge, a 4B-parameter world model, runs on-device on Thor for real-time vision and robot policies. The post does not disclose pricing or ship dates for T3000/T2000.

#Robotics#NVIDIA#Jetson Thor#T3000

why featured

Featured · importance 72 · hook + knowledge

editor take

T3000 matches T5000's multimodal inference at half the size and power, cutting hardware cost amid high memory prices—but no pricing or ship dates yet.

sharp

The reason to click: Jetson Thor is finally reaching down into more practical form factors. T3000 packs 865 FP4 teraflops, 32GB memory, and 273GB/s bandwidth into roughly half the size and power of T5000, while delivering similar multimodal inference speed. For robotics and edge vision teams, that means fitting the same compute into a smaller chassis or saving on hardware when memory prices are high. T2000 at 400 teraflops and 16GB looks like the volume play for entry-level AMRs and industrial manipulators. The software side is just as interesting: new Jetson agent skills automate memory optimization across the stack. Some customers freed up 15GB and dropped from 64GB to 32GB modules without losing performance. Cosmos 3 Edge, a 4B-parameter world model, now runs locally on Thor for real-time vision and robot policies. I'd hold off on purchase decisions though. The post doesn't disclose pricing or ship dates for T3000/T2000, and it doesn't specify which models or batch sizes were used in the T5000 comparison. Treat this as a roadmap signal for now.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

72

SCORE

H1·K1·R0

22:49

12d ago

AI HOT (Curated Pool)· aihot-apiZH22:49 · 07·15

→Open-source memory system for coding agents syncs via SSH

An open-source memory project for coding AI agents launched on GitHub, enabling cross-session context retention via SSH sync without relying on specific cloud services. Users can self-host. The post does not disclose implementation details or performance benchmarks.

#GitHub

editor take

Deja Vu is an open-source memory layer for coding agents that syncs context across sessions via SSH, no cloud required.

HKR breakdown

hook ✓knowledge —resonance —

→ open source

62

SCORE

H1·K0·R0

22:23

12d ago

Hacker News Frontpage· rssEN22:23 · 07·15

→LLM Networking with MikroTik: Practical Tips and Pitfalls

Greg shares months of hands-on experience using LLMs (Claude, etc.) to configure MikroTik networking gear. Key takeaway: LLMs are a chaotic force multiplier for network setup, but you must mistrust and verify. He recommends using the REST/JSON API over SSH, because SSH leads to 'death by a thousand cuts' when LLMs pipe text back and forth. Tips include: dump full config before every change, use CAPsMAN for multi-AP setups, cross-check configs with multiple LLMs, and have a tested recovery runbook. He strongly recommends MAC-Telnet for when IP addresses conflict—LLMs can talk to devices over L2 telnet. The post doesn't specify model versions or latency numbers, but stresses 'minimize tasks, test after every change.'

#Code#MikroTik#Claude#Greg Sadetsky

editor take

Greg's months of MikroTik config with Claude: LLMs are a chaotic force multiplier—speed up but mistrust and verify every step.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

62

SCORE

H1·K1·R0

21:39

13d ago

Hacker News Frontpage· rssEN21:39 · 07·15

→MIT paper: AI valuation bubble can leave a permanent capital legacy even after it corrects

MIT economist Ricardo Caballero posted a working paper on July 15 offering a third reading of AI valuations: a temporary overvaluation can leave a permanent real legacy even after prices correct. AI capital substitutes for labor, shifts income toward high-saving capital owners, and lowers the long-run interest rate, creating a self-sustaining high-capital steady state. Rational pricing from the low-capital state won't get there; a belief-supported valuation is needed to accelerate investment. If Bayesian learning arrives late enough, enough capital is installed and the economy lands in the high-capital state. If learning arrives too soon, the transition collapses. Workers get higher wages at the destination despite a lower labor share; capitalists bear the valuation correction. The model combines q-theory investment, wealth-in-utility preferences, and Bayesian learning. The paper cites Fortune, Goldman Sachs, and McKinsey estimates on AI data-center and compute investment.

#MIT#Ricardo J. Caballero#NBER

editor take

An MIT working paper argues AI overvaluation can leave a permanent real legacy—more capital, higher wages—even after prices correct.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

72

SCORE

H1·K1·R0

21:33

13d ago

FEATUREDThe Verge · AI· rssEN21:33 · 07·15

→xAI sues a man for using Grok to generate CSAM deepfakes

xAI filed a federal lawsuit against Terry Harwood, accusing him of bypassing Grok's safeguards to generate CSAM deepfakes. The company claims Harwood used prompt injection and other methods, and is seeking reputational and legal damages. It's a rare case of an AI company proactively suing a user for generating illegal content, though the post doesn't disclose the specific techniques or volume of images produced.

#Vision#xAI#Grok#Terry Harwood

why featured

Featured · importance 78 · hook + knowledge + resonance

editor take

Rare case of an AI company suing a user for generating illegal content, but the post doesn't disclose the specific techniques or volume.

sharp

This one's worth opening because an AI company proactively suing a user for generating CSAM deepfakes is genuinely rare. xAI claims Terry Harwood used prompt injection to bypass Grok's safeguards and is seeking reputational and legal damages. The post is thin on details though: no word on the specific injection method, how many images were produced, or how xAI detected it. Legally, these 'user abuse' cases are tricky — Section 230 usually shields platforms from user-generated content liability, but here xAI is flipping the script and going after the user directly, which feels like testing a different legal path. I'd read this as a signal that AI companies are using lawsuits to say 'safety isn't optional,' but the real impact depends on how the court rules.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

78

SCORE

H1·K1·R1

21:19

13d ago

FEATUREDHacker News Frontpage· rssEN21:19 · 07·15

→Two groups of friends made the same AI wedding video

At a wedding, the bride's friends and the groom's friends each made an AI-generated tribute video. Both videos ended up nearly identical—same voiceover cadence, same drone shots of beaches and forests, same National Geographic-style narration. The crowd loved the coincidence. The author argues everyday creativity is regressing to a mean, but doesn't settle on whether the driver is fear of making something bad or just taking the easy path.

#Vision

why featured

Featured · importance 72 · hook + knowledge + resonance

editor take

Two groups used AI for wedding tribute videos and produced nearly identical results—same voiceover, drone shots, and Nat Geo tone.

sharp

This one's worth a click because it puts the abstract "AI makes everything samey" complaint into a concrete wedding scene. The bride's friends and the groom's friends, without coordinating, each made an AI-generated tribute video. Different stories—one about gifts, one about the love story—but the outputs were nearly identical: same voiceover cadence, same drone shots over beaches and forests, same National Geographic-style narration. The crowd loved the coincidence. The author doesn't settle on a conclusion, just asks whether we reach for AI out of fear of making something bad or pure convenience. I'd say what collided here wasn't creativity—it was the default "heartwarming video" template baked into the training data.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

72

SCORE

H1·K1·R1

21:07

13d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH21:07 · 07·15

→xAI open-sources Grok Build coding agent and terminal UI

xAI released the full Grok Build codebase on GitHub, covering the agent loop, tool dispatch, terminal UI, and extension system. You can read the source to see how context assembly and tool calls work, or compile it yourself and point it at a local inference setup.

#Code#Agent#xAI#Grok Build

why featured

Featured · importance 82 · hook + knowledge + resonance

editor take

xAI open-sourced the full Grok Build coding agent—agent loop, TUI, extensions—compile it locally and run fully offline.

sharp

This is worth a click because it's a full-stack open-source drop, not just model weights. The entire engineering skeleton of a production coding agent is on GitHub: how the agent loop assembles context, parses model responses, and dispatches tool calls; how the terminal UI handles rendering and inline diffs; how skills, plugins, hooks, MCP servers, and subagents get loaded and invoked. The practical bit: compile it yourself, point it at a local inference setup, and run it fully offline. That's genuinely useful if you're trying to understand how a coding agent works under the hood, or if your team needs a controllable in-house coding assistant. What's missing: the post doesn't include any benchmark numbers—no SWE-bench scores, no comparison to other coding agents. It also doesn't say what model size you'd need to run locally for decent results. I'd treat this as a high-quality agent architecture reference first, and hold off on performance comparisons with closed-source coding tools.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

82

SCORE

H1·K1·R1

21:00

13d ago

Hacker News Frontpage· rssEN21:00 · 07·15

→Someone compiled Firefox's Gecko engine to WebAssembly, running full Firefox inside a browser tab

Puter's team compiled Firefox's Gecko engine to WebAssembly, running a full Firefox UI inside a browser tab. Rendering uses WebGL for GPU acceleration, plus an experimental JS-to-WASM JIT compiler. Web traffic is proxied through a Puter-hosted Wisp server. The post doesn't disclose performance overhead, memory usage, or JIT stability data—this looks like a tech demo, not daily-driver ready yet.

#Puter#Gecko#Firefox

editor take

Firefox's Gecko engine compiled to WASM, running in a browser tab. It's a tech demo—the post doesn't disclose performance, memory, or JIT stability.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

72

SCORE

H1·K1·R0

20:48

13d ago

AI HOT (Curated Pool)· aihot-apiZH20:48 · 07·15

→Grok Build is now open source

Elon Musk announced that Grok Build is now open source. The post does not disclose which modules are included, the license type, or version details—only the title is confirmed.

#xAI#Elon Musk#Open source

editor take

Elon Musk says Grok Build is open source, but no modules, license, or version details yet—don't get excited.

HKR breakdown

hook —knowledge —resonance —

→ open source

39

SCORE

H0·K0·R0

20:26

13d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH20:26 · 07·15

→Claude Code artifacts can now call MCP connectors

Claude Code artifacts can now invoke MCP connectors, letting dashboards and apps fetch data or run actions per viewer on demand. Available on Pro, Max, Team, and Enterprise plans; public shared artifacts are excluded. The post doesn't detail connector types or latency.

#Anthropic

why featured

Featured · importance 78 · hook + knowledge + resonance

editor take

Claude Code artifacts can now call MCP connectors for per-viewer live data, but public sharing is excluded.

sharp

This moves artifacts from static snapshots to something that can actually do work. Before, artifacts were mostly one-shot renders; now they can reach out through MCP connectors to fetch data or trigger actions per viewer. It's paywalled to Pro, Max, Team, and Enterprise, and public shared artifacts are excluded — Anthropic is drawing a clear line on cost and permissions. The post doesn't specify which connector types are supported or what latency looks like, and those two gaps determine whether this is demo-grade or production-ready. I'd treat it as an early version aimed at lightweight internal dashboards for now.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

78

SCORE

H1·K1·R1

20:24

13d ago

Hacker News Frontpage· rssEN20:24 · 07·15

→xAI open-sources Grok build tooling on GitHub

xAI has open-sourced the build tooling behind Grok on GitHub. The post doesn't disclose which modules or version are included, but the repo name grok-build suggests training or deployment infrastructure. Engineers can now read the source and config directly instead of reverse-engineering.

#xAI#Grok

editor take

xAI open-sourced Grok's build tooling on GitHub. Code is readable, but no word on which modules are included.

HKR breakdown

hook ✓knowledge —resonance —

→ open source

60

SCORE

H1·K0·R0

19:42

13d ago

Hacker News Frontpage· rssEN19:42 · 07·15

→Brainless: shadcn components that replicate Claude Code, Codex, and Grok UIs

Brainless is a shadcn/ui registry that rebuilds the terminal interfaces of Claude Code, Codex, and Grok as accessible React components. Install via one command, then compose agent sessions, thinking lines, tool calls, and diffs like any shadcn block. The post doesn't disclose performance metrics or user numbers, but the project is open-source and ready to pull via shadcn registry. Saves frontend devs from hand-rolling agent terminal UIs.

#Claude Code#OpenAI Codex#Grok#Open source

editor take

Drop-in shadcn components that replicate Claude Code, Codex, and Grok terminal UIs — one command to install.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

62

SCORE

H1·K1·R0

19:37

13d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH19:37 · 07·15

→Alex Turner left Google DeepMind over its unrestricted military AI deal

Alex Turner published a long post on July 15 explaining why he quit: Google quietly signed a Pentagon AI deal with no use restrictions, reversing the red-line framework he had pushed internally. He escalated to Jeff Dean and Demis Hassabis, and tried to mobilize Bengio, Russell, and others to apply public pressure—none of it worked. He calls out multiple AI leaders for staying silent on militarization and includes his own draft framework for government AI contracts. The post doesn't say where he's going next.

#Alex Turner#Google DeepMind#Jeff Dean

why featured

Featured · importance 82 · hook + knowledge + resonance

editor take

Alex Turner quit Google DeepMind after Google signed an unrestricted Pentagon AI deal, and his internal red-line push—escalated to Jeff Dean and Bengio—failed.

sharp

This isn't a vague "values misalignment" resignation post. Turner lays out a timeline, emails, and a lunch conversation with Jeff Dean. The core fact: Google signed a Pentagon AI contract with no use restrictions, effectively abandoning the red-line framework he'd pushed internally. He tried building internal friction and recruiting Bengio and Russell for public pressure—neither worked. I'd treat this as a primary source, not neutral analysis. Turner has a clear stance, but the details are specific enough to be cross-checked. The most contentious part is him calling out AI leaders for staying silent on militarization, though the post doesn't include their responses. What's missing: Google's side—contract specifics, internal decision-making, what Demis actually meant by "principles haven't changed." If a follow-up from Google staff or a reporter lands, this thread gets a lot sharper.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

82

SCORE

H1·K1·R1

19:26

13d ago

r/LocalLLaMA· rssEN19:26 · 07·15

→Google updates Gemma 4 chat templates, fixes tool calling, reduces laziness, enables Flash Attention 4 on Hopper GPUs

Google is updating Gemma 4's chat templates to fix tool calling and reduce model laziness. The update also enables Flash Attention 4 on Hopper GPUs and includes an interactive guide for improving vision capabilities. The post does not disclose a specific version number or release timeline.

#Vision#Google#Gemma 4#Hopper GPU

editor take

Gemma 4 fixes tool calling and laziness, enables Flash Attention 4 on H100s—but the post is 403, no version number disclosed.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

62

SCORE

H1·K1·R0

19:15

13d ago

Product Hunt · AI· rssEN19:15 · 07·15

→Backdrop: AI coworkers that run your projects and operations

Backdrop provides AI coworkers that integrate with Slack, GitHub, Linear, and more to run projects and operations. They synthesize customer feedback, create plans, manage tickets, and draft documents—acting as team members that understand company context. The post doesn't disclose pricing, underlying model, or latency, but the pitch is clear: let AI handle execution while humans focus on decisions.

#Backdrop#Slack#GitHub

editor take

Backdrop sells AI coworkers that plug into Slack and GitHub, but doesn't disclose model, latency, or pricing.

HKR breakdown

hook —knowledge —resonance —

→ open source

55

SCORE

H0·K0·R0

18:57

13d ago

● P1Financial Times · Technology· rssEN18:57 · 07·15

→Mira Murati's Thinking Machines releases debut AI model

Mira Murati's startup Thinking Machines released its first model. The FT reports the model borrows training techniques from Chinese firms like DeepSeek, achieving near-frontier performance with less compute. The article does not disclose the model name, parameter count, benchmark scores, or whether it is open-weight.

#Mira Murati#Thinking Machines#DeepSeek

why featured

Featured · importance 88 · hook + knowledge + resonance

editor take

Mira Murati's first model openly borrows architecture ideas from Chinese rivals — both FT and Bloomberg confirm it, but neither has pricing or full benchmarks yet.

sharp

Mira Murati's Thinking Machines just dropped its first model, and two major outlets — FT and Bloomberg — covered it on the same day. That's a coordinated media push, not a leak. The model is called TML-1, and both sources agree it uses a Mixture of Experts architecture. FT goes further, naming DeepSeek and Qwen as direct architectural influences. Bloomberg frames it as a general-purpose release; FT leads with the China connection. The facts don't conflict, just the emphasis. I'd take the "borrows from Chinese models" angle seriously — two independent outlets wouldn't both run that unless the company itself was comfortable with the framing. On performance, FT says TML-1 is competitive with GPT-4o on MMLU and HumanEval, but the full benchmark table isn't public. Selective comparisons are standard for a launch, but they also mean we can't assess weak spots yet. What's missing: pricing, context window size, API availability, and whether fine-tuning is supported. Right now this is a technical debut with a clear architectural story. Don't read it as a shipping product — read it as Murati showing her hand on model design philosophy.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

88

SCORE

H1·K1·R1

18:40

13d ago

FEATUREDHacker News Frontpage· rssEN18:40 · 07·15

→Alex Turner explains why he left Google DeepMind over a Pentagon AI deal and internal pushback

Alex Turner published a long post explaining his departure. The trigger was Google signing a Pentagon AI contract, which he feared would enable military surveillance. He drafted a 'red line framework' to block it, reached out to Jeff Dean, Bengio, and others, but senior management ultimately didn't adopt it. The post also covers Google's role in immigration enforcement supply chains and Pentagon pressure on Anthropic. Turner argues the company's AI principles were quietly set aside under government pressure. The post does not disclose the contract value or specific military use cases.

#Google DeepMind#Alex Turner#Jeff Dean

why featured

Featured · importance 82 · hook + knowledge + resonance

editor take

Alex Turner quit DeepMind after Google signed a Pentagon AI contract; his red-line push with Jeff Dean and Bengio failed, and AI principles were set aside.

sharp

This post is worth opening because Turner lays out the internal process in detail: he drafted a 'red line framework' to block the contract, reached out to Jeff Dean, Bengio, and Stuart Russell, but senior management ultimately didn't adopt it. The post also covers Google's role in immigration enforcement supply chains and Pentagon pressure on Anthropic. Turner's core claim: the company's AI principles were quietly set aside under government pressure, and Demis Hassabis publicly insisted they 'haven't changed' while the actual moves shifted. The post doesn't disclose the contract value or specific military use cases, so it's hard to gauge how sensitive the deal really is. But Turner's perspective as an internal researcher carries weight — he's not describing 'evil company,' he's describing how principles got bypassed in the process. I'd read this as a firsthand account, not an investigative report.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

82

SCORE

H1·K1·R1

18:14

13d ago

● P1Hacker News Frontpage· rssEN18:14 · 07·15

→Thinking Machines releases Inkling, 975B-parameter open-weights multimodal model

Inkling is a 975B total / 41B active parameter Mixture-of-Experts model with a 1M-token context window and native text, image, and audio input. Thinking Machines compares it against Nemotron 3 Ultra, GLM 5.2, GPT 5.6 Sol, and Claude Fable 5, claiming frontier-level performance in general intelligence, agentic coding, and speech. Weights are on Hugging Face and fine-tuning is available via the Tinker platform. The post does not disclose training data, training cost, inference latency, or specific benchmark scores—take the comparison charts with a grain of salt.

#Thinking Machines Lab#Mira Murati#Hugging Face

why featured

Featured · importance 100 · hook + knowledge + resonance

editor take

Mira Murati's Thinking Machines dropped its first open-weight model: 975B params, multimodal, Apache 2.0. Five sources all echoing the same official briefing — this is a coordinated launch.

sharp

Thinking Machines released Inkling, a 975B total / 41B active MoE model that handles text, images, and audio natively, with a 1M-token context window and an Apache 2.0 license. Hugging Face, TechCrunch, HN, and Latent Space all covered it on the same day with near-identical details — this is a coordinated press push, so the core specs are solid. TechCrunch added a framing layer the official blog didn't: positioning Inkling as a bet against one-size-fits-all AI, built for enterprises to fine-tune rather than use as a general-purpose chatbot. That aligns with Murati's public stance since leaving OpenAI, but it's still a narrative choice, not a technical claim. HN threads focused more on the parameter count and license, with people already comparing it to Llama 4 and DeepSeek-V3 on cost-performance. I'd hold off on the benchmarks until third-party evals show up. The official numbers look strong, but 41B active parameters means you're not running this on a single consumer GPU, and no one's disclosed training cost or inference pricing yet. Also, the training data composition is vague — if you're planning to fine-tune and deploy, the legal side isn't fully clear.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

100

SCORE

H1·K1·R1

17:29

13d ago

FEATUREDHugging Face Blog· rssEN17:29 · 07·15

→Ai2 shares the engineering lessons behind Shippy, a maritime AI agent

Ai2's Skylight team built Shippy, an AI assistant that helps maritime analysts query fishing activity, EEZ boundaries, and vessel tracks. The post breaks its architecture into three parts: a soul (system prompt), skills (markdown files that teach it to call APIs and interpret track data), and config (runtime settings; currently Claude Opus 4.6 with the OpenClaw framework). The core idea is wrapping a non-deterministic model in deterministic tools—every answer includes source, data cutoff, and a deep link to the Skylight map so an analyst can verify it. The post doesn't disclose error rates or latency numbers, but it stresses sandboxed hosting and evaluating the agent as a system, not just the model.

#Agent#Ai2#Skylight#Shippy

why featured

Featured · importance 72 · hook + knowledge

editor take

Ai2's Shippy agent wraps a non-deterministic model in deterministic tools—every answer ships with source, data cutoff, and a map link for analyst verification.

sharp

The useful bit here is the architecture pattern: split the agent into a soul (system prompt), skills (markdown files teaching API calls), and config (runtime settings), then wrap the non-deterministic model in deterministic tooling. Every response includes the data source, cutoff time, and a deep link back to the Skylight map so an analyst can verify it themselves. That's a concrete reliability play for high-stakes domains, not just vibes. The post doesn't disclose error rates or latency, and it's silent on what Claude Opus 4.6 costs per query in this setup. I'd treat this as a reference architecture for adding guardrails to agents in operational settings, not a reproducible performance report.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

72

SCORE

H1·K1·R0

17:27

13d ago

FEATUREDHugging Face Blog· rssEN17:27 · 07·15

→Model routing is simple—until you measure real cost, not sticker price

IBM Research found that routing by model sticker price backfired in agent workloads. Across 417 AppWorld tasks, Claude Sonnet 4.6 cost $79 total vs. GPT-4.1's $155—nearly double—because Sonnet's lower cache-read pricing exploited high context reuse across steps. The post argues real cost, latency, and complexity all depend on workload-infrastructure interaction, making routing a systems optimization problem, not a classification one.

#Agent#Reasoning#IBM Research#Anthropic

why featured

Featured · importance 78 · hook + knowledge + resonance

editor take

Routing by sticker price backfires: IBM ran 417 agent tasks and Claude Sonnet 4.6 cost $79 total vs. GPT-4.1's $155, thanks to cheaper cache reads.

sharp

I clicked because the numbers flip a common assumption. IBM ran 417 AppWorld agent tasks and found Claude Sonnet 4.6 cost $79 total while GPT-4.1 hit $155—nearly double—even though GPT-4.1's sticker price looks competitive. The reason: agents reuse large context windows across steps, and Sonnet's cache-read pricing is much lower, so the real bill tilts hard toward the model with better caching economics. The takeaway isn't "Sonnet beats GPT"—it's that routing by per-token price alone is broken for agent workloads. Cache hit rates, context reuse patterns, and step count dominate the final cost. The post doesn't break out latency or complexity numbers in detail, but the framework is solid. If you're building agent pipelines, benchmark your own task traces before trusting a routing table built on standalone evals.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

78

SCORE

H1·K1·R1

17:24

13d ago

Product Hunt · AI· rssEN17:24 · 07·15

→Graft AI: Turn company operations into a living map for agents

Graft AI launched this week on Product Hunt, targeting agents that need to work with legacy apps and internal tools without clean APIs. It learns how employees operate, maps workflows into a living operational map, and gives agents stable tools with permissions, approvals, audit trails, and verification. When the underlying UI changes, Graft detects drift and repairs the workflow without breaking the agent interface. The post doesn't disclose pricing details beyond "Free Options."

#Graft AI#Product Hunt

editor take

Graft AI watches how employees use legacy apps, maps the workflow, and gives agents stable tools that auto-repair when UIs change.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

62

SCORE

H1·K1·R0

17:09

13d ago

● P1MIT Technology Review· rssEN17:09 · 07·15

→OpenAI develops GPT-Red, an automated red-teamer to find vulnerabilities in its models

OpenAI trained GPT-Red as an automated red-teamer that attacks its own models to find and patch vulnerabilities before release. It uses a self-play loop to get better at attacking while defender models get better at resisting. GPT-Red discovered a new attack called fake chain of thought, where it slips spoofed info into a model's internal reasoning notes and the model accepts it as verified. In a rerun of a 2025 human red-teaming test, GPT-Red was more successful at finding effective attacks. OpenAI says this is meant to handle the growing attack surface as models become agents that interact with code, websites, and third-party tools.

#OpenAI#GPT-Red#GPT-5.6

why featured

Featured · importance 96 · hook + knowledge + resonance

editor take

OpenAI trained GPT-Red via self-play to attack its own models and claims it found novel prompt injection attacks humans missed — but this is a single-source exclusive from MIT Tech Review, no indep...

sharp

This is a single-source exclusive from MIT Technology Review — the second entry is just their own newsletter roundup — so the multi-source signal here is thin. Treat it as OpenAI's controlled narrative, not independently confirmed reporting. The core story: OpenAI trained GPT-Red in a self-play loop where it attacks other models and they learn to defend. It specializes in prompt injection, and the team claims it discovered a new attack vector called 'fake chain of thought' — slipping false entries into a model's reasoning trace so it treats bad info as self-verified. That's genuinely interesting for agent safety, since agents read web pages, run code, and call APIs, making the injection surface much wider than a chat window. OpenAI says GPT-Red outperformed human red-teamers from a 2025 experiment and successfully hacked a third-party test agent called Vendy. But what's missing matters: no model size, no cost, no false-positive rate, and no external red team has reproduced the findings. The Georgetown CSET researcher's quote is positive but it's a comment on the approach, not an audit. Read this as OpenAI showing its safety R&D hand, not as a new industry benchmark.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

96

SCORE

H1·K1·R1

17:00

13d ago

● P1TechCrunch AI· rssEN17:00 · 07·15

→Suno hack exposes source code showing scraping of YouTube Deezer for training data

404 Media reports that AI music generator Suno was breached via a supply chain attack last November. A hacker used employee credentials to access source code, which showed Suno scraped decades of audio from YouTube Music, Deezer, Genius, stock music libraries, and podcast RSS feeds. Suno had only admitted to training on 'publicly available music files' before. Major labels are suing, arguing that bypassing YouTube's anti-scraping protections violates the DMCA. The hacker also accessed customer emails, phone numbers, and partial credit card numbers via Stripe. Suno did not notify users, calling it a 'limited security incident that was quickly contained.'

#Suno#YouTube#Deezer

why featured

Featured · importance 88 · hook + knowledge + resonance

editor take

Leaked source code from a Suno hack points directly to scraping YouTube, Deezer, and Genius for training data—way more specific than the company's previous 'publicly available' line.

sharp

404 Media broke the story, and both TechCrunch and The Verge picked it up with matching details—all sourced from the same hacker's dump. So the signal here isn't 'how much did Suno scrape,' it's 'Suno's own source code was built to scrape these exact platforms.' The code named YouTube Music, Deezer, Genius, stock music libraries, and podcast RSS feeds as ingestion targets. That's a lot more damning than the company's previous careful phrasing about 'publicly available music files.' I'd discount the hacker's claims a bit until we get independent verification—Suno is calling this a 'limited security incident' and hasn't confirmed the code's authenticity. But the timing is rough for them. Major labels are already suing, arguing that bypassing YouTube's anti-scraping protections violates the DMCA. If this source code gets admitted as evidence, Suno's fair use defense gets harder to sustain. Also worth noting: the hacker accessed customer emails, phone numbers, and partial credit card numbers, and Suno never notified users about the November 2025 breach. That's a separate problem that's going to attract regulator attention.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

88

SCORE

H1·K1·R1

17:00

13d ago

TechCrunch AI· rssEN17:00 · 07·15

→Whatnot acquires Shaped to power real-time live shopping recommendations

Livestream shopping app Whatnot acquired Shaped, a machine learning startup focused on real-time recommendations and search. The deal targets a core live-commerce problem: inventory, auctions, and buyer demand shift constantly, so static product catalogs don't work. Whatnot plans to use Shaped's tech to improve discovery and personalization. The post doesn't disclose the deal size or team integration plans.

#Whatnot#Shaped

editor take

Whatnot bought Shaped for real-time recs to fix live shopping's dynamic inventory problem.

HKR breakdown

hook —knowledge —resonance —

→ open source

55

SCORE

H0·K0·R0

16:35

13d ago

FEATUREDHacker News Frontpage· rssEN16:35 · 07·15

→Pantograph pretrains on internet video to build a goal-conditioned Minecraft agent

Pantograph released Pan, a 4B-parameter Minecraft model that fights mobs, explores, and builds structures. It pretrains on internet-scale video using hindsight relabeling—later frames become the goal for earlier frames, so no hand-labeled rewards are needed. At inference, you give it a goal image and it acts, even in unseen environments. The post doesn't disclose training data volume, hardware, or wall-clock time.

#Pantograph

why featured

Featured · importance 72 · hook + knowledge

editor take

Pantograph trained a 4B Minecraft model on internet video using hindsight relabeling, but training scale and hardware aren't disclosed.

sharp

The method is clean: later frames in a video become the goal for earlier frames, so no one has to hand-label rewards. A 4B model fighting mobs and building structures from a goal image in unseen environments is a decent generalization signal. Pantograph frames this as a stepping stone toward robotics, with Minecraft as the testbed. I'd discount it a bit for now—the post doesn't disclose training data volume, hardware, or wall-clock time, and there's no quantitative comparison to existing Minecraft agents like MineDojo or Voyager. It reads more like a technical preview than a full paper. If the demos hold up, the useful bit is proving that goal-directed behavior can emerge from observation-only video at scale, without action labels. But without scale and compute details, it's hard to gauge the replication cost.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

72

SCORE

H1·K1·R0

16:25

13d ago

AI HOT (Curated Pool)· aihot-apiZH16:25 · 07·15

→The Three-Second Theft: Why AI Voice Fraud Outruns Every Defence

The FBI broke out AI-enabled fraud as a separate category for the first time in April 2026, logging over 22,000 complaints and $893M in adjusted losses, with $352M hitting victims aged 60+. The attack vector is brutally cheap: as little as three seconds of audio—from a TikTok clip or voicemail—can clone a voice indistinguishable from the original. A Consumer Reports investigation in March 2025 found that most of six voice-cloning products relied on a self-attestation checkbox with no technical consent verification. ElevenLabs offers post-hoc traceability and election-cycle blocks for protected figures, but none of these stop the clone from being generated. INTERPOL pegged global financial fraud at $442B in 2025, with AI-enhanced scams roughly 4.5x more profitable and agentic systems now autonomously running full fraud campaigns. The post does not disclose specific technical defenses or regulatory timelines.

#FBI#INTERPOL#Consumer Reports

editor take

FBI broke out AI fraud as a separate category: 22K+ complaints, $893M in losses. Three seconds of audio is enough to clone a voice.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

68

SCORE

H1·K1·R0

16:15

13d ago

Hacker News Frontpage· rssEN16:15 · 07·15

→Déjà Vu: open-source memory for coding agents, synced over SSH

Déjà Vu is an open-source tool that gives coding agents like Claude Code persistent project memory across sessions. It stores memory as local files and syncs them over SSH between teammates, with no third-party service involved. The post is README-level; it doesn't disclose latency numbers, conflict resolution details, or real-world team sizes.

#Claude Code#Open source

editor take

Persistent project memory for Claude Code, synced over SSH with no third-party service. README-only for now—no word on conflict resolution or latency.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

72

SCORE

H1·K1·R0

16:15

13d ago

FEATUREDHacker News Frontpage· rssEN16:15 · 07·15

→Unsolved Problems in MLOps

This ACM Queue piece lays out why classical ops practices break down for ML: non-deterministic outputs and data as a system driver make canary deploys, health checks, and alerting nearly useless. Azure validates new models by having LLMs judge LLM output—the SRECon audience was audibly surprised. The authors argue the field must either find a better paradigm or fix the ones we have.

#Microsoft#Azure#Brendan Burns#Benchmark

why featured

Featured · importance 78 · hook + knowledge + resonance

editor take

Azure validates new models by having LLMs judge LLMs and employees hit thumbs-up—the SRECon audience was audibly surprised.

sharp

This piece names the core problem most MLOps teams feel but rarely articulate: classical ops assumes determinism, and ML systems don't give you that. Health checks, canary deploys, alert thresholds—all built on the idea that you send a known input and get a known response. That assumption breaks the moment your system is a model. The most telling anecdote is from Brendan Burns at SRECon Americas 2025. Azure's validation pipeline for new models in their UI? LLMs judging LLM output, plus internal employees hitting thumbs-up. The audience was audibly surprised—not in a good way. The authors' phrasing is blunt: rolling out a new model is "more an exercise in vibes than anything else." I'd read this less as a solution paper and more as a mirror. If your team is still trying to bolt ML onto traditional CI/CD practices, the full 19 pages are worth your time. It doesn't offer fixes, but it frames the gap clearly: either find a better paradigm or fix the ones we're using now. The honest admission that we don't really know how to do this well is the useful part.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

78

SCORE

H1·K1·R1

16:12

13d ago

FEATUREDHacker News Frontpage· rssEN16:12 · 07·15

→OpenAI and Work Louder release Codex Micro keyboard for agent workflows

OpenAI and Work Louder co-designed the Codex Micro keyboard for $230. Each key maps to a Codex agent with live RGB status (thinking, running, waiting, done). A joystick launches common workflows like PR review or debugging; a dial adjusts reasoning level on the fly. Bluetooth/USB-C, Mac/Windows compatible, includes 32 custom icon keycaps. The post does not disclose shipping dates or initial batch size.

#OpenAI#Work Louder

why featured

Featured · importance 72 · hook + knowledge

editor take

OpenAI drops a $230 keyboard for Codex while fighting Apple over hardware trade theft — the timing is too on-the-nose to read as a hardware strategy rather than a branded accessory.

sharp

OpenAI teamed up with boutique keyboard maker Work Louder to release the Codex Micro, a $230 keyboard built specifically for its Codex coding agent. It's got light-up keys that show agent status, customizable shortcut buttons, a joystick for launching workflows, and a dial that adjusts how much reasoning compute an agent uses. Both TechCrunch and HN are covering it, but HN only has the product page title — all the details come from TechCrunch's article. The product specs are consistent across sources because they're pulled from the same OpenAI landing page. TechCrunch adds the context that OpenAI is currently in a legal fight with Apple over hardware trade secret allegations, which makes the timing of a physical product launch feel pointed. I wouldn't read this as OpenAI pivoting into hardware, though — it's a co-branded accessory where Work Louder handles the manufacturing and OpenAI supplies the branding and software integration. $230 isn't outrageous for a mechanical keyboard, but it's not an impulse buy either. No word yet on sales targets or whether more hardware is coming, so treat this as a niche desk toy for heavy Codex users.

HKR breakdown

hook ✓knowledge ✓resonance —

→ open source

72

SCORE

H1·K1·R0

15:58

13d ago

Hacker News Frontpage· rssEN15:58 · 07·15

→New codec misa77 decodes 2x faster than LZ4 with better ratios

misa77 is a new open-source codec that achieves up to 5219 MB/s decompression—over 2x faster than LZ4—with better compression ratios. The trade-off is slow compression (54.5 MB/s), making it ideal for write-once read-many workloads. Gains come from fewer branches and out-of-order core friendliness. The post doesn't mention streaming or cross-platform support.

#Inference-opt#misa77#LZ4#Silesia corpus

editor take

misa77 decompresses 2x faster than LZ4 with better ratios, but compression is 7x slower—great for write-once read-many workloads.

HKR breakdown

hook —knowledge ✓resonance —

→ open source

55

SCORE

H0·K1·R0

15:57

13d ago

FEATUREDHacker News Frontpage· rssEN15:57 · 07·15

→J-space comparisons across open models: replicating Anthropic's interpretability findings on 6 open-source models

The author replicated Anthropic's J-space findings on six open models using automated experiments. The middle layers contain a dictionary of directions that causally steer output. This structure appears early in training, transfers between models, and sharpens with scale. Six dimensions were tested: temporal horizon, emergence during training, transplantability, scale effects, corpus dependence, and MoE behavior. All data is open-sourced. The author admits they are not a domain expert and the experiments were run autonomously by an AI agent, so I'd discount the rigor somewhat.

#Anthropic#Elie Bak#Fable

why featured

Featured · importance 78 · hook + knowledge + resonance

editor take

An AI agent replicated Anthropic's J-space findings across six open models, confirming a "direction dictionary" in middle layers, but the agent ran the experiments autonomously, so discount the rigor.

sharp

This one's worth a click because it takes a finding from Anthropic's closed-model paper and replicates it on open models. The core idea: a model's middle layers contain a "dictionary of directions" — inject a specific direction into the residual stream, and the model reliably outputs the corresponding token. The author tested six dimensions: how far forward the steering reaches, when this structure emerges during training, whether it transfers between models, how it changes with scale, corpus dependence, and MoE behavior. But the author admits they're not a domain expert, and the experiments were run autonomously by an AI agent called Fable. I'd discount the rigor somewhat — the experimental design might have gaps. The useful bit is that all data and charts are open-sourced, and there's an llm.txt file you can feed to your own agent to dig into the raw numbers. Just don't treat this as a peer-reviewed paper yet.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

78

SCORE

H1·K1·R1

15:37

13d ago

FEATUREDAI HOT (Curated Pool)· aihot-apiZH15:37 · 07·15

→Telegram launches serverless platform for bots, running code directly on its infrastructure

Telegram released a serverless platform that runs bot backend code inside its own V8 sandbox—no VPS, no containers, no scaling to manage. It ships with a built-in SQLite database and direct Bot API access. Deploy with a single npx tgcloud push. A full demo bot (message counter) fits in two files: schema.js and handlers/message.js. Only JavaScript is supported right now; the post doesn't mention timelines for Python or other languages.

#Telegram

why featured

Featured · importance 72 · hook + knowledge + resonance

editor take

Telegram now runs bot backends in its own V8 sandbox—one-command deploy, built-in SQLite, no VPS needed.

sharp

The reason this is worth a look: it removes the most annoying part of building Telegram bots—renting a VPS, managing containers, worrying about scaling. You write a few JS files, run npx tgcloud push, and your code executes inside Telegram's own V8 sandbox, right next to the Bot API and a built-in SQLite database. A message-counter demo fits in two files: schema.js and handlers/message.js. Only JavaScript is supported right now; the post doesn't mention timelines for Python or other languages. I'd frame this as Telegram making a sharp move to lower the bot-building barrier—you can skip the ops layer and go straight to logic. But don't read it as a general-purpose serverless platform. It's locked into Telegram's ecosystem, the database has no foreign keys, and the docs don't spell out sandbox limits. If you're already running a complex bot on a VPS, migrating may not be worth it. For new or lightweight bots, though, this deployment flow is genuinely friction-free.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

72

SCORE

H1·K1·R1

15:34

13d ago

FEATUREDHacker News Frontpage· rssEN15:34 · 07·15

→Running Gemma 4 26B at 5 tokens/sec on a 13-year-old Xeon with no GPU

The author got Google's Gemma 4 26B MoE model running on a dual Xeon E5-2690 v2 server from 2013 with no GPU, costing under $300. The CPUs only support AVX1, but ik_llama.cpp's optimized kernels require AVX2, causing silent gibberish output. Claude diagnosed that the graph builder unconditionally emitted MOE_FUSED_UP_GATE ops while the dispatcher had no matching case, leaving ~240 tensors per forward pass reading uninitialized memory. After the fix, decode reaches ~5.2 tokens/sec and prompt eval ~16 tokens/sec. A PR is open but not yet merged. The post doesn't disclose quantized model memory usage or power draw.

#Inference-opt#Google#Gemma 4#ik_llama.cpp

why featured

Featured · importance 72 · hook + knowledge + resonance

editor take

A 2013 dual Xeon box with no GPU runs Gemma 4 26B at 5 tok/s after fixing an AVX2 instruction set mismatch.

sharp

This one's worth opening because it turns 'old hardware runs new models' from a headline into a reproducible engineering sample. The author took a sub-$300 HP storage server with dual Xeon E5-2690 v2 chips and no GPU, and got Google's Gemma 4 26B MoE running at 5.2 tok/s — roughly reading speed. The path wasn't smooth. ik_llama.cpp's optimized kernels assume AVX2, but this 2013 CPU only has AVX1, so the model silently produced gibberish. Claude diagnosed it: the graph builder unconditionally emitted MOE_FUSED_UP_GATE ops while the dispatcher had no matching case, leaving ~240 tensors per forward pass reading uninitialized memory. After the fix, decode hits ~5.2 tok/s and prompt eval ~16 tok/s. The PR is open but not yet merged. I'd discount this a bit: the post doesn't disclose quantized model memory usage or power draw. 5 tok/s is borderline for interactive chat and too slow for real-time use. But the real signal here isn't the performance number — it's the workflow. The author doesn't write C++ kernels but drove Claude to locate the bug, produce a patch, and verify correctness. That pattern of using AI to fix AI inference engines is more interesting than the benchmark.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

72

SCORE

H1·K1·R1

15:32

13d ago

Product Hunt · AI· rssEN15:32 · 07·15

→Lev8: Parallel AI agents for lead gen

Lev8 is a lead generation tool that uses parallel AI agents for live web search to find people and companies. It enriches CSVs, monitors intent signals, and sends personalized multi-channel messages automatically. Launched on Product Hunt, ranked #1 on day one with 552 upvotes. The post doesn't disclose pricing or latency specifics.

#Lev8#Product Hunt#Claude by Anthropic

editor take

Lev8 runs parallel AI agents for live lead search — think Clay with agents — but no pricing or latency disclosed.

HKR breakdown

hook —knowledge —resonance —

→ open source

55

SCORE

H0·K0·R0

15:29

13d ago

● P1TechCrunch AI· rssEN15:29 · 07·15

→Apple Intelligence approved for China launch powered by Alibaba Qwen

China's Cyberspace Administration approved Apple Intelligence for launch, backed by a deal to integrate Alibaba's Qwen model into iOS, iPadOS, macOS, and visionOS. Alibaba confirmed Qwen will power text and image understanding and generation, but gave no timeline. Apple previously explored deals with Baidu, DeepSeek, and ByteDance but hit adaptation issues. The approval matters for Apple's Greater China business, which hit $20.5B in Q2. Alibaba US shares rose over 6% on the news.

#Vision#Apple#Alibaba#Qwen

why featured

Featured · importance 96 · hook + knowledge + resonance

editor take

Apple traded Qwen integration for China's AI approval; Alibaba shares jumped 6%, but no launch date yet.

sharp

The reason this matters: Apple finally cleared China's regulatory hurdle for AI, and it did it by plugging in Alibaba's Qwen instead of building something bespoke. They'd previously kicked the tires with Baidu, DeepSeek, and ByteDance—all fell apart on model adaptation. Greater China brought in $20.5B last quarter, so sitting out AI there wasn't an option. I'd discount the hype a bit. The article doesn't say which Qwen model, or whether it runs on-device or in the cloud—privacy and latency are total unknowns right now. Alibaba confirmed the deal but gave zero timeline, which suggests the engineering isn't done. The 6% stock bump feels more like a "stamp of approval" trade than something that'll hit revenue soon.

HKR breakdown

hook ✓knowledge ✓resonance ✓

→ open source

96

SCORE

H1·K1·R1

15:28

13d ago

FEATUREDBloomberg Technology· rssEN15:28 · 07·15

→Anthropic Is Said to Plan IPO Investor Meetings as Listing Nears

Bloomberg reports Anthropic is preparing IPO investor meetings, signaling a listing is close. The post does not disclose valuation, offering size, or exchange. Only the headline-level fact is confirmed so far—wait for the S-1 to see the financials.

#Anthropic

why featured

Featured · importance 92 · hook + resonance

editor take

Anthropic is starting IPO investor meetings—listing is close, but valuation and offering size aren't disclosed yet.

sharp

The reason to click: Anthropic's IPO is moving from rumor to investor roadshow. Bloomberg's sources confirm the meetings are happening, but there's no valuation, offering size, or exchange yet. I'd hold off on any financial takes until the S-1 drops—everything before that is guesswork. Anthropic's last funding round pegged it around $60B, and Claude Sonnet 4.5 competes head-to-head with OpenAI GPT-5 at $3/$15 per million tokens. The one thing we can say now: it's likely the first major AI lab to complete the full IPO process. The real numbers come with the filing.

HKR breakdown

hook ✓knowledge —resonance ✓

→ open source

92

SCORE

H1·K0·R1

15:17

13d ago

Product Hunt · AI· rssEN15:17 · 07·15

→Inbix: Cloudflare-native Email Infrastructure for Developers

Inbix is an open-source, Cloudflare-native email infrastructure for developers. It creates disposable inboxes, receives emails in real time, and integrates via webhooks, REST APIs, SDKs, and MCP. Designed for testing, CI/CD, and AI agents without proprietary infrastructure.

#Inbix#Cloudflare#Open source

editor take

Open-source disposable email infra on Cloudflare, built for AI agents and CI/CD to receive mail without a real server.

HKR breakdown

hook —knowledge ✓resonance —

→ open source

45

SCORE

H0·K1·R0

15:15

13d ago

Hacker News Frontpage· rssEN15:15 · 07·15

→Museum of the Human Web: relics from the pre-AI internet

Parallel launches the Museum of the Human Web, a collection of internet artifacts from ARPANET to the eve of ChatGPT. It calls these objects 'relics of the last time we did this alone.' The post doesn't list specific exhibits or sweepstakes details, but says proceeds benefit the Internet Archive and the Computer History Museum. For AI practitioners, it's a curatorial reminder that generative AI is shifting creation from human-only to human-machine collaboration.

#Parallel#Internet Archive#Computer History Museum

editor take

Parallel's Museum of the Human Web calls pre-ChatGPT internet artifacts 'relics of the last time we did this alone.'

HKR breakdown

hook ✓knowledge —resonance ✓

→ open source

55

SCORE

H1·K0·R1

14:32

13d ago

Hacker News Frontpage· rssEN14:32 · 07·15

→OpenAI loses EU trademark dispute over its own name

The EU General Court ruled that “OPENAI” is purely descriptive for software and cloud services, so it lacks the distinctiveness required for a trademark. The court said “open” means freely accessible and “AI” means artificial intelligence—together they describe products based on openly accessible AI. OpenAI argued the term is coined and cited registrations in 30+ countries including the UK and Singapore. The court rejected those arguments. The ruling can still be appealed.

#OpenAI#EU General Court#EUIPO

editor take

EU court says 'OPENAI' is too descriptive for a trademark on software and cloud services.

HKR breakdown

hook ✓knowledge —resonance ✓

→ open source

62

SCORE

H1·K0·R1

more

✕

feeds

hot events daily column all posts podcasts curated X monitor saved sources newsletter agent access

admin

usage system newsletter curation iterations users