r/LocalLLaMA · June 30, 2026 · 1 min read

TurboOCR v3 — high-speed document OCR server (C++/CUDA), ~520 img/s on RTX 5090

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

TurboOCR is a self-hosted, high-speed document OCR server, runs fully local. Here's What's New in v3:

Speed:

Full pipeline now on the newest PP-OCRv6 models (up from v5): ~270 → ~520 img/s on FUNSD (v6 tiny, RTX 5090).
Still fully local, HTTP + gRPC.

Structured parsing (the main addition):

End-to-end now: layout → tables to HTML → formulas to LaTeX → reading-order Markdown.
Tables and formulas are strict per-request opt-in, so you only pay the cost when you actually need them.

Stack: C++, TensorRT FP16, multi-stream, gRPC/HTTP, direct PDF endpoint, PP-OCRv6.

Discussion (0)

No comments yet. Sign in and be the first to say something.