FEATUREDHacker News Frontpage· rssEN15:37 · 05·17
→Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep
MinishLab open-sourced Semble, a code-search tool for agents that combines Model2Vec embeddings, BM25, RRF fusion, and reranking; on a 63-repo benchmark, it used 98% fewer tokens than grep+read, reached 0.854 NDCG@10, and ran CPU queries in about 1.5 ms.
#Agent#Code#Embedding#MinishLab
why featured
HKR-H/K/R all pass: the 98% token claim is clickworthy, the 63-repo benchmark adds substance, and coding-agent context cost is a real practitioner nerve. Impact is still toolchain-level, so it stays below must-write.
editor take
Semble pulls agent code search back from context stuffing to IR; 98% token savings is sharp, but grep+read is a soft target.
sharp
Semble matters because it attacks the boring cost center in coding agents: context waste. On a 63-repo benchmark, it claims 98% fewer tokens than grep+read, 0.854 NDCG@10, and roughly 1.5 ms CPU queries. The stack is not magic: Model2Vec embeddings, BM25, RRF fusion, then reranking.
I don’t buy grep+read as the serious opponent. Cursor, Claude Code, and Sourcegraph Cody have moved past naked grep into repo maps, AST-ish indexes, and symbol search. Still, the direction is right. Coding agents fail less from “not enough intelligence” than from retrieving 40 bad chunks and spending the next two calls laundering that noise.
HKR breakdown
hook ✓knowledge ✓resonance ✓