PaddleOCR Multilingual Document OCR and Structured Data Toolkit

PaddleOCR is a powerful, lightweight OCR toolkit developed by Baidu that converts documents and images into structured, AI-friendly data like JSON and Markdown. It supports 100+ languages with industry-leading accuracy, bridging the gap between images/PDFs and LLMs.

Installation

Requirements and caveats from upstream:

Comprehensive upgrade of the PP-OCRv5 C++ local deployment solution, now supporting both Linux and Windows, with feature parity and identical accuracy to the Python implementation.
The high-stability service-oriented deployment solution is now fully open-sourced, allowing users to customize Docker images and SDKs as required.

Basic usage or getting-started notes:

Documentation has been updated to include key metrics for commonly used configurations on mainstream hardware, such as inference latency and memory usage, providing deployment references for users.
🚀 Quick Start
For local usage, please refer to the following documentation based on your needs:
Source: https://github.com/PaddlePaddle/PaddleOCR
Extracted from upstream docs: https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/HEAD/README.md

PaddleOCR Multilingual Document OCR and Structured Data Toolkit

PaddleOCR Multilingual Document OCR and Structured Data Toolkit

Installation

🚀 Quick Start