numind/NuExtract3 · Hugging Face
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| NuExtract3 is a unified 4B vision-language reasoning model for document understanding. It combines strong structured information extraction with high-quality image-to-Markdown conversion, making it suitable for extraction pipelines, OCR, and RAG preprocessing for all types of documents such as scans, receipts, forms, invoices, contracts or tables. Overview
GGUF, NVFP4, MLX, VLLM, etc., already there https://huggingface.co/models?other=base_model:quantized:numind/NuExtract3 [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.