r/LocalLLaMA · June 24, 2026 · 1 min read

Unlimited-OCR is now on ModelScope! A 3.3B multilingual OCR model for one-shot parsing across single images, multi-page documents, and PDFs. License: MIT

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Full-document parsing instead of cropped-region OCR

32K output length for long OCR sequences

Base and gundam image modes for different document layouts

Transformers inference + SGLang serving with OpenAI-compatible streaming requests

Built to push DeepSeek-OCR-style document parsing further.

Discussion (0)

No comments yet. Sign in and be the first to say something.