OCR, granite-docling-258m vs granite-docling-2stage-258m: has anyone actually noticed any improvements?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Granite Docling 2stage builds upon the Granite Docling, but introduces a key modifications: it builds a dynamic prompt that precomputes layout objects found within a page, making it more robust on out of distribution data.
What do you think?
[link] [comments]
More from r/LocalLLaMA
-
Tencent Hy 30B/7B/1.8B
May 21
-
110 tok/s with 12GB VRAM on Qwen3.6 35B A3B and ik_llama.cpp
May 21
-
'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI.
May 21
-
AMD Powers Next-Generation Agent Computers with New Ryzen AI Halo Developer Platform and Ryzen AI Max PRO 400 Series Processors
May 21
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.