ax@ax-radar:~/feed $ tail -f signal.log
41 srcsignal 1208%cycle 04:32

hot events · 2026-05-15

39 signals · updated 3m ago
live · 217 today·policy v2
LATENT SPACEAnthropic pulls Fable and Mythos after US e…96·LATENT SPACEAnthropic launches Claude Fable 5, its firs…88·HACKER NEWS FRONTPAGDid Anthropic ask for its own export contro…82·HACKER NEWS FRONTPAGAnthropic flies senior technical staff to D…82·AI HOT (CURATED POOLWSJ: OpenAI weighs steep price cuts and pla…82·HACKER NEWS FRONTPAGBram Cohen: Claude is turning into an assho…78·R/LOCALLLAMAXiaomi serves MiMo V2.5 at 1000–3000 tps wi…78·IMPORT AI (JACK CLARAI learns to game society's rules, and Anth…78·MIT TECHNOLOGY REVIEGoogle DeepMind is worried about what happe…78·DWARKESH PATELThe sample efficiency black hole: AI models…78·LATENT SPACECognition launches FrontierCode: a coding b…78·HACKER NEWS FRONTPAGGabriel Weinberg argues with data that “eve…78·LATENT SPACEAnthropic pulls Fable and Mythos after US e…96·LATENT SPACEAnthropic launches Claude Fable 5, its firs…88·HACKER NEWS FRONTPAGDid Anthropic ask for its own export contro…82·HACKER NEWS FRONTPAGAnthropic flies senior technical staff to D…82·AI HOT (CURATED POOLWSJ: OpenAI weighs steep price cuts and pla…82·HACKER NEWS FRONTPAGBram Cohen: Claude is turning into an assho…78·R/LOCALLLAMAXiaomi serves MiMo V2.5 at 1000–3000 tps wi…78·IMPORT AI (JACK CLARAI learns to game society's rules, and Anth…78·MIT TECHNOLOGY REVIEGoogle DeepMind is worried about what happe…78·DWARKESH PATELThe sample efficiency black hole: AI models…78·LATENT SPACECognition launches FrontierCode: a coding b…78·HACKER NEWS FRONTPAGGabriel Weinberg argues with data that “eve…78·LATENT SPACEAnthropic pulls Fable and Mythos after US e…96·LATENT SPACEAnthropic launches Claude Fable 5, its firs…88·HACKER NEWS FRONTPAGDid Anthropic ask for its own export contro…82·HACKER NEWS FRONTPAGAnthropic flies senior technical staff to D…82·AI HOT (CURATED POOLWSJ: OpenAI weighs steep price cuts and pla…82·HACKER NEWS FRONTPAGBram Cohen: Claude is turning into an assho…78·R/LOCALLLAMAXiaomi serves MiMo V2.5 at 1000–3000 tps wi…78·IMPORT AI (JACK CLARAI learns to game society's rules, and Anth…78·MIT TECHNOLOGY REVIEGoogle DeepMind is worried about what happe…78·DWARKESH PATELThe sample efficiency black hole: AI models…78·LATENT SPACECognition launches FrontierCode: a coding b…78·HACKER NEWS FRONTPAGGabriel Weinberg argues with data that “eve…78·
RSS live
2026-05-15 · Fri
22:38
30d ago
● P1Hacker News Frontpage· rssEN22:38 · 05·15
Orthrus-Qwen3 achieves 7.8× faster inference tokens per forward pass
Orthrus-Qwen3 claims up to 7.8× tokens per forward on Qwen3 with an identical output distribution; the post does not disclose the mechanism, benchmark conditions, or reproduction steps beyond the GitHub and Hacker News links.
#Inference-opt#Qwen#Orthrus-Qwen3#Open source
why featured
HKR-H/K/R pass on the 7.8× identical-distribution claim, but the body lacks mechanism, benchmark setup, and repro steps. Defaulting below featured keeps it in the 60–71 band.
editor take
An open-source project claims 7.8× faster inference on Qwen3-8B with identical output distribution, but both sources are community posts — no independent reproduction yet.
sharp
This hit both Hacker News front page and r/LocalLLaMA today, which tells you the community is hungry for inference speedups. Orthrus freezes Qwen3-8B's backbone and uses dual-view diffusion decoding to generate multiple tokens per forward pass instead of one-at-a-time autoregression. The 7.8× claim comes from that batching effect, and the output distribution is theoretically identical to the original model. I'd discount this on two fronts. One, we only have a GitHub repo and community chatter — no paper or technical report yet, so the method's edge cases are unknown. Does it hold up on long sequences? What's the memory cost? Two, both sources use nearly identical headlines pulled straight from the README, with no independent benchmarking. If the numbers check out, the real win is no retraining and no quality loss, which matters a lot for local inference. I'm waiting for someone to reproduce it before taking the 7.8× at face value.
HKR breakdown
hook knowledge resonance
open source
88
SCORE
H1·K1·R1
17:56
30d ago
● P1AI HOT (Curated Pool)· aihot-apiZH17:56 · 05·15
Yann LeCun interview: LLM limits, AI's future, and a new startup path
Yann LeCun discussed LLM limitations on the Unsupervised Learning podcast, covering his 2027 forecast, AMI’s bet on world models, his reasons for leaving Meta, and major disagreements with Geoffrey Hinton and Yoshua Bengio over Turing Award-era views.
#Reasoning#Robotics#Safety#Yann LeCun
why featured
HKR-H/K/R all pass: LeCun combines LLM limits, 2027 forecasts, world models, and Meta departure in one interview, matching the 85–94 band for major AGI-timeline commentary.
editor take
LeCun’s world-model bet is coherent, but “PhDs should stop doing LLMs” sounds too clean; LLMs aren’t dead, the obvious LLM work is crowded.
sharp
LeCun’s sharpest move is not another anti-LLM rant; it is tying that critique to AMI’s world-model bet and telling PhD students to stop working on LLMs. The snippet gives hooks: a 2027 forecast, leaving Meta, disputes with Hinton and Bengio, and comparing OpenAI and Anthropic to Sun Microsystems. It gives no architecture, funding, benchmark, or reproducible result. I don’t buy the clean “stop doing LLMs” line. The 2025–2026 gains practitioners felt came from the LLM perimeter: tool use, code execution, long context, agent evals, synthetic data loops. LeCun is right that physical world modeling and robotics need something beyond next-token training. But until AMI shows a repeatable experiment, this is a route declaration, not a death certificate for LLM research.
HKR breakdown
hook knowledge resonance
open source
86
SCORE
H1·K1·R1
16:04
30d ago
● P1Dwarkesh Patel· rssEN16:04 · 05·15
Eric Jang Rebuilds AlphaGo from Scratch with Modern Tools
Eric Jang explains how to build AlphaGo from scratch with modern AI tools, comparing MCTS training targets with credit assignment in LLM reinforcement learning over 100k+ token trajectories.
#Reasoning#Agent#Code#Eric Jang
why featured
HKR-H/K/R all pass: the hook is a modern rebuild of AlphaGo, with concrete MCTS and 100k+ token credit-assignment details. This is a strong technical interview, not a model or product launch, so 78 fits.
editor take
Eric Jang rebuilt AlphaGo from scratch with modern tools. The real insight isn't the rebuild — it's his side-by-side comparison of why MCTS-style RL works for Go but breaks for LLMs, and what that ...
sharp
Eric Jang walked through his from-scratch AlphaGo rebuild on Dwarkesh's podcast. Both sources are Dwarkesh's own content (article plus YouTube), so there's no independent angle here — but the material is Jang's firsthand technical explanation, not a secondhand summary. His core comparison is sharp: AlphaGo uses Monte Carlo Tree Search for self-play, where every move gets a clear "this is better than that" training signal. LLM RL training, by contrast, has to deal with trajectories of 100k+ tokens, and the model has to guess which specific action earned the reward. That's the credit assignment problem, and Jang argues human learning looks more like the former. Current LLM RL is stuck with the latter's inefficiency. He also touched on using LLMs for automated AI research — implementing experiments and tuning hyperparameters works decently, but picking the right research question and escaping dead ends still doesn't. That connects directly to the intelligence explosion debate. I'd treat the automation section as personal experience rather than a systematic evaluation, since he only ran this on one project.
HKR breakdown
hook knowledge resonance
open source
88
SCORE
H1·K1·R1
15:50
30d ago
● P1Bloomberg Technology· rssEN15:50 · 05·15
Apple-OpenAI Partnership Relationship Deteriorates Amid Disputes
Bloomberg says Apple and OpenAI’s two-year partnership has become strained, with OpenAI failing to see expected benefits and preparing possible legal action; the RSS snippet does not disclose the disputed terms or filing timetable.
#Apple#OpenAI#Anurag Rana#Partnership
why featured
Bloomberg reports the Apple-OpenAI alliance is fraying, with possible legal action, so HKR-H/K/R all pass. Missing contract terms and financial detail keep it in the 78-84 band.
editor take
Three outlets are tracking Apple-OpenAI friction; the iPhone AI gatekeeping fight has moved from keynote slides to lawyers, and OpenAI is done playing channel partner.
sharp
Three outlets are tracking the Apple-OpenAI split, with aligned headlines but thin disclosed facts. The available body is only a Bloomberg scrape fragment, so legal claims, contract terms, and damages are not disclosed; FT frames legal action, while TechCrunch frames Apple burning another partner. I read this less as a lawsuit story and more as OpenAI discovering the cost of renting the iPhone AI surface. Apple Intelligence put ChatGPT inside Siri as a distribution win, but the moment Apple can negotiate with Google, Anthropic, or its own models, OpenAI becomes a replaceable backend. For model companies, default placement on-device is harsher than a benchmark loss.
HKR breakdown
hook knowledge resonance
open source
96
SCORE
H1·K1·R1
11:48
30d ago
● P1r/LocalLLaMA· rssEN11:48 · 05·15
Modified RTX 2080 Ti Cards Run Qwen 27B Model at 38 Tokens per Second
A Reddit user ran Qwen3.6 27B on two modified RTX 2080 Ti cards with 22GB VRAM each, using IQ4_XS quantization, f16 KV cache, and tensor split, raising throughput from 14 to 38 token/s under a 150W per-card power limit.
#Inference-opt#Code#Qwen#NVIDIA
why featured
HKR-H/K/R all pass, but this is a single Reddit hardware experiment without full reproducibility, power, or stability data; the first-person numbers lift it, not enough for featured.
editor take
Three LocalLLaMA posts, but the body is a 403. Treat this as Qwen MTP tinkering heat, not verified RTX 3090 performance data.
sharp
All three sources are Reddit LocalLLaMA posts, and their titles cluster around Qwen 27B/122B MTP configs. The article body is only a 403, so no throughput, llama.cpp flags, VRAM use, quant level, or context length is disclosed. That is not media consensus; it is one community stress-testing a runnable setup. My read: useful for practitioners, weak as evidence of model performance. A single RTX 3090 has 24GB VRAM, so Qwen 27B MTP hinges on quantization, KV cache, batch size, and context length. The title only says “Single 3090.” LocalLLaMA often finds usable paths before official docs do, but it also turns one successful boot into a performance claim too easily.
HKR breakdown
hook knowledge resonance
open source
87
SCORE
H1·K1·R1
02:39
31d ago
● P1Bloomberg Technology· rssEN02:39 · 05·15
OpenAI CFO Says Company Faces Computing Shortage, May Seek More Funding
OpenAI CFO Sarah Friar said the company may raise more capital after completing what she described as the largest private fundraising round ever, as OpenAI seeks computing power to meet rising AI demand; the RSS snippet does not disclose the round size, target amount, or timeline.
#Inference-opt#OpenAI#Sarah Friar#Bloomberg
why featured
HKR-H/K/R pass: a named OpenAI CFO links more fundraising to the compute crunch. The score stays in the lower featured band because this is not a closed round and amount, investors, and timing are not disclosed.
editor take
OpenAI's CFO says compute is tight and more fundraising may be needed — this isn't a tech bottleneck, it's costs outpacing revenue.
sharp
This comes from OpenAI CFO Sarah Friar speaking at a Bloomberg event — both Bloomberg channels ran it with near-identical headlines, so the core message is from one official appearance, not a leak. Friar said the company is facing a compute crunch and may need to raise more capital to keep growing. I'd read this as OpenAI managing expectations: they're telling the market upfront that costs aren't slowing down, so don't get comfortable with profitability timelines. No specific numbers yet — we don't know the size of the gap, when they'd raise, or at what valuation. If investment banks start floating valuation ranges in the next few weeks, that's the real signal to watch.
HKR breakdown
hook knowledge resonance
open source
86
SCORE
H1·K1·R1
02:06
31d ago
● P1Synced (机器之心) · WeChat· rssZH02:06 · 05·15
Amazon employees inflate AI tool usage to meet internal quotas
Amazon required more than 80% of developers to use AI tools each week and created an internal token-consumption leaderboard. Employees reportedly used the internal MeshClaw agent to inflate usage, while Amazon has limited visibility of the statistics to each employee and their direct manager.
#Agent#Tools#Safety#Amazon
why featured
HKR-H/K/R all pass: Amazon’s AI-use KPI became token-gaming, with >80% target, leaderboard, MeshClaw, and visibility changes. Impact is workplace-significant, not major-release level, so featured not p1.
editor take
Amazon staff are gaming internal AI usage metrics by running pointless tasks — same old KPI disease, now with an AI wrapper.
sharp
FT and Jiqizhixin both picked this up, but the FT article is behind a paywall — we only have the headline and Jiqizhixin's summary. Both sources agree on the core story: Amazon set internal AI usage targets, and employees responded by running pointless tasks through the tool to inflate their token consumption numbers. The interesting part isn't Amazon specifically — it's the pattern. When companies measure AI adoption by "how many times did you use it" or "how many tokens did you burn," people will find the laziest way to hit the number. Same dynamic as call centers optimizing for call duration or dev teams optimizing for lines of code. The metric drifts from the actual work. What's missing: which specific tool, what the targets were, how many teams were involved. If internal emails or employee interviews surface later, we'll know whether this was a localized workaround or a systemic design flaw in how the targets were set.
HKR breakdown
hook knowledge resonance
open source
86
SCORE
H1·K1·R1
00:23
31d ago
● P1Financial Times · Technology· rssEN00:23 · 05·15
Anthropic raises $30 billion at $900 billion valuation
Anthropic agreed terms for a $30bn funding round at a $900bn valuation, led by Dragoneer, Greenoaks, Sequoia Capital, and Altimeter Capital; the RSS snippet does not disclose deal structure, timing, or investor allocation.
#Anthropic#Dragoneer#Sequoia Capital#Funding
why featured
HKR-H/K/R all pass: FT reports Anthropic agreeing terms for a $30B raise at a $900B valuation. The deal is not closed and disclosed mechanics are thin, so it stays just below the 95+ band.
editor take
Anthropic raising $30B at a $900B pre-money valuation reads less like strength than securitizing future compute burn.
sharp
Two sources converge on a $30B raise and a $900B pre-money valuation; the available body only shows Bloomberg’s headline, while aihot looks like a secondary relay of the same chain. That matters: this is pricing Anthropic as a permanent compute-financing vehicle, not a normal software company. I’m wary of the victory lap here. A $30B round is infrastructure-project scale, far beyond ordinary growth equity. Claude has real developer pull, but the disclosed text gives no revenue, margin, cloud commitment, or investor mix. Compared with OpenAI’s giant compute obligations, this market is no longer valuing model labs on ARR multiples. It is selling access to the next training cluster.
HKR breakdown
hook knowledge resonance
open source
100
SCORE
H1·K1·R1
00:02
31d ago
● P1AI Era (新智元) · WeChat· rssZH00:02 · 05·15
Google DeepMind Releases Gemini-Powered AI-Enabled Pointer Technology
Google DeepMind released a Gemini-powered AI-enabled pointer and opened two demos in Google AI Studio: image editing and place finding on maps, while the post says Chrome pointer selection and a Googlebook Magic Pointer are planned product paths.
#Agent#Multimodal#Tools#Google DeepMind
why featured
HKR-H/K/R all pass: the prompt-free pointer is clickable, the two AI Studio demos add concrete facts, and UI replacement resonates. Scope is still demo-level, with no metrics or API details, so 78 not 85+.
editor take
Three outlets amplified DeepMind’s AI pointer, but the body gives no usable product details; this smells like Google staking an OS-level Gemini entry point.
sharp
Three sources covered DeepMind’s AI pointer, and all orbit the same Gemini-plus-cursor story, suggesting an official-blog source chain. HN keeps it restrained; the Chinese headlines push Hassabis and the “50-year mouse” angle, so the split is tone, not facts. My read: Google is trying to move Gemini out of the chat box and onto the cursor layer. The captured body exposes mostly navigation and the title, with no demo conditions, permission model, latency, API surface, or privacy boundary beyond the publication date. That gap matters. If this cannot read selections, screen state, and act across apps, it is a polished interaction demo. If it can, it becomes an entry-point fight across Android, ChromeOS, Chrome, and Workspace.
HKR breakdown
hook knowledge resonance
open source
90
SCORE
H1·K1·R1
00:00
31d ago
● P1OpenAI Blog· rssEN00:00 · 05·15
OpenAI launches personal finance experience feature in ChatGPT
OpenAI previewed a personal finance experience in ChatGPT for Pro users in the U.S.; it lets users securely connect financial accounts and receive guidance grounded in their financial context, goals, and priorities, but the post does not disclose launch timing, partner institutions, or pricing.
#Tools#OpenAI#ChatGPT#Product update
why featured
HKR-H/K/R all pass: OpenAI is moving ChatGPT into high-sensitivity personal finance. The post lacks launch timing, partners, and pricing, so this stays a mid-weight product update at 77.
editor take
OpenAI just put ChatGPT inside bank-account context; 12,000 institutions is the hook, persistent cash-flow memory is the power grab.
sharp
Three sources followed the same launch, with aligned facts. TechCrunch foregrounded bank-account linking; OpenAI supplied the core numbers: U.S. Pro preview, Plaid, 12,000 institutions, and 200 million monthly finance-related ChatGPT users. That alignment reads like coordinated official rollout, not independent discovery. My take: OpenAI is going after Mint, Credit Karma, and Rocket Money, but with GPT-5.5 plus Financial memories it turns budgeting into a persistent advisory surface. The danger is also obvious. OpenAI says this is not professional financial advice, while ChatGPT reads transactions, subscriptions, portfolio performance, investment risks, and personal goals. A hallucinated meal plan is annoying; a hallucinated allocation call is regulatory shrapnel.
HKR breakdown
hook knowledge resonance
open source
100
SCORE
H1·K1·R1
00:00
31d ago
● P1OpenAI Blog· rssEN00:00 · 05·15
Databricks integrates GPT-5.5 into enterprise agent workflows
Databricks uses GPT-5.5 for enterprise agent workflows after the model reached a new state of the art on the OfficeQA Pro benchmark; the post does not disclose the score, deployment scope, or rollout timeline.
#Agent#Benchmarking#Databricks#OpenAI
why featured
hard-exclusion-pure-marketing applies: the known facts read like a partner/customer use-case for OpenAI. HKR-H and HKR-R pass, but HKR-K lacks scores, scope, and timing, so importance is capped at 39.
editor take
OpenAI's own case study: Databricks integrated GPT-5.5 into enterprise agent workflows, hitting 50%+ on OfficeQA Pro for the first time. No pricing or latency disclosed.
sharp
This is an OpenAI-published customer case study, and both sources covering it are working from the same material — so the alignment isn't independent confirmation, it's a single narrative. The numbers: GPT-5.5 hit over 50% accuracy on Databricks' OfficeQA Pro benchmark, with a 46% error reduction vs GPT-5.4. The benchmark tests parsing, retrieval, and reasoning across scanned PDFs, legacy files, and long-context documents. Databricks' research engineer called it a "step-function lift" in parsing old documents, with fewer unnecessary search detours during multi-step tasks. I'd take the 50% number with some context. It's SOTA, but 50% isn't high in absolute terms — this benchmark is genuinely hard, and enterprise document workflows still have a long way to go before they're hands-off reliable. The bigger gap: no pricing, latency, or concurrency numbers for running GPT-5.5 through Databricks' AI Unity Gateway. Those are the numbers that actually matter for production budgeting.
HKR breakdown
hook knowledge resonance
open source
92
SCORE
H1·K0·R1

more

feeds

admin