Nvidia LocateAnything - Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding. (10x faster than Qwen3-VL)
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
submitted by /u/Sporeboss
[link] [comments]
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.