sharp
Univer says its spreadsheet engine ranked No. 1 on SpreadsheetBench in December 2025 with 68.86%, but this reads like an infrastructure bet, not a breakout product signal. The company is betting that spreadsheets should be decomposed into an embeddable, callable, verifiable compute layer for agents. I buy that direction. I do not buy all of the surrounding narrative yet.
Here is the part I think is solid. Spreadsheets are still the most underrated execution environment inside companies. Finance, ops, supply chain, pricing, sales analysis — a lot of it still ends in cells, formulas, pivots, imports, exports, and manual cleanup. Anyone who has worked on enterprise automation has seen the same pattern: systems of record store the data, but exploratory computation escapes back into Excel. Univer’s move to package that layer as an SDK, then split UI from pure compute logic, is a serious engineering choice. It is much stronger than bolting a chat sidebar onto a sheet. If an agent can access formula dependencies, named ranges, filters, merged cells, and structural metadata directly, that is a much better substrate than asking a general model to “read” a messy workbook as flat text.
This is also where the broader market has been heading, just less aggressively. Microsoft has been pushing Copilot deeper into Excel workflows since 2024. Google has been doing variants of analysis and generation inside Sheets. Airtable, Coda, and Notion went after the upper layer: collaboration, database abstractions, and AI assistance on top. Most of those products still assume a human stays in the driver seat. Univer is pitching a different split: the agent uses the tool, the human reviews the result. That maps well to what worked in coding agents: generate actions, run them in an executable environment, verify, iterate, then return the artifact.
Still, I think the “spreadsheets are the next coding” pitch is ahead of the evidence. Coding has compilers, tests, linting, CI, and cleaner reward signals. Spreadsheet environments do have formulas and dependency graphs, so verification is possible, but business spreadsheets are much dirtier than code. Hidden columns, weird formatting, cross-sheet references, manual overrides, locale issues, copied formulas with silent breakage — those details kill reliability. You cannot just copy the coding-agent playbook into spreadsheet automation and expect the same curve.
I also have doubts about the benchmark framing. The article gives the 68.86% SpreadsheetBench score and says Univer beat ChatGPT Agent and Excel Copilot, but it does not disclose the task mix, competitor versions, tool constraints, or how much human correction was allowed. Without those conditions, the score tells me only that Univer performed well on that benchmark. It does not prove superior performance in actual enterprise spreadsheet work. We have seen this pattern all year: if the task set is structured, the environment is fixed, and the system can retry with tools, specialized agent stacks often beat general-purpose assistants. Once you move into live company files, the picture changes fast. The piece says files above 10MB are handled more accurately, but it gives no error rate, latency, cost, or failure-case data. That is a major omission.
The “formulas are Turing-complete” line is technically true and strategically slippery. Turing-complete does not mean a good operational substrate for business automation. Excel can express a lot, but enterprise pain is usually not “can this be computed.” It is version control, permissions, auditability, replay, exception handling, and accountability. If an agent cleans and analyzes a workbook automatically, who signs off on the result, who can reproduce the prior state, and who can explain which hidden sheet fed the conclusion? Those are procurement questions, not research questions. The article mentions collaborative engines and multi-agent operation on the same sheet, which is promising, but it says nothing about audit trails, permission models, rollback semantics, or governance. For customers like Novartis or Samsung, getting a POC is one thing. Surviving deployment standards is another.
On commercialization, the SDK route makes more sense to me than launching another standalone SaaS front end. Enterprises already have OA, ERP, BI, CRM, and internal portals fighting for screen space and workflow control. A spreadsheet engine that embeds into existing systems has a cleaner path than asking users to adopt one more app. The founding team also has real credibility here. A former Feishu spreadsheet lead is not just good at charts and pivots; that background usually means deep exposure to the hard parts: Open XML compatibility, formula engines, rendering, collaboration, and feature isolation. Luckysheet’s 16,000+ GitHub stars suggest this is not a deck-first startup.
That said, component businesses are brutal. “100+ plugins” sounds comprehensive, but it also signals long-term maintenance burden. Spreadsheet infrastructure customers expect Excel-grade compatibility, low-latency interaction, cross-device consistency, and no surprises with old files. Getting to 80 is fast. Getting to 95 is where teams get trapped. There are already entrenched players in this layer, and Microsoft can always pull more capability into Excel plus Copilot. If Univer wants “AI-native” to be the wedge, it has to show two concrete advantages: how much agent task success improves versus using Excel or Python directly, and how much unit economics improve per workflow. The article does not give either.
I am also not ready to accept the “2026 aha moment” line. Spreadsheets absolutely are a huge market, probably with user counts in the billion range monthly. The question is pace. Coding agents expanded quickly because software teams already tolerate automation and have explicit acceptance mechanisms. Spreadsheet users are more fragmented, skill levels vary more, and their work sits inside approval chains, compliance rules, and ownership disputes. I expect growth, but not a clean one- or two-year convergence around a single pattern. More likely, this gets adopted task by task: reconciliations, sales ops analysis, procurement comparisons, web-to-table extraction, contract clause structuring. Whoever gets those workflows stable and reproducible earns the right to talk about a new enterprise compute layer.
So my read is favorable, with brakes on the hype. Headless spreadsheets for agents is a serious idea. Direct access to formula dependencies and the compute layer has real engineering value. But the funding amount is undisclosed, the benchmark conditions are undisclosed, and the article gives no retention, expansion, or deployment metrics. For now, Univer looks like a credible candidate for an agent-oriented spreadsheet runtime, not a proven new platform. To move me from interested to convinced, I want three things next: production task success rates, audit and rollback design for complex workbooks, and a clear cost curve against Excel- or Python-based workflows.