FEATUREDr/LocalLLaMA· rssEN13:14 · 05·25
→NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction
Numind released NuExtract3, a 4B open-weight VLM based on Qwen3.5-4B under Apache-2.0, supporting image and text to Markdown, OCR, and JSON-template extraction, with self-hosting from 4GB VRAM and weights in Safetensors, GGUF, and MLX formats.
#Multimodal#Vision#Tools#Numind
why featured
HKR-H/K/R all pass: NuExtract3 packages OCR, Markdown, and structured extraction into a 4B open-weight VLM with a 4GB self-hosting condition. Source and lab reach keep it in the low featured band.
editor take
Only the summary is usable: NuExtract3 puts OCR, Markdown, and JSON extraction into 4GB VRAM, a better local-model job than another chatty 4B.
sharp
NuExtract3’s useful claim is not “a small 4B model”; it is a self-hosted Apache-2.0 document component. The summary gives three hard hooks: Qwen3.5-4B as the base, Safetensors/GGUF/MLX weights, and a 4GB VRAM floor. The task scope is also tight: image/text to Markdown, OCR, and JSON-template extraction.
I buy the direction. Teams do not need another local chatbot as much as they need invoices, tables, and scans entering structured systems without a closed API hop. Docling, PaddleOCR, and Tesseract already cover pieces of this, but a VLM that unifies Markdown and schema extraction is cleaner for workflow owners. The Reddit body is blocked by 403, so benchmarks, language coverage, and table accuracy are not disclosed. “Runs on 4GB” is not the same as production throughput.
HKR breakdown
hook ✓knowledge ✓resonance ✓