๐Ÿ“„ PDF OCR Extractor

Extract text from PDF documents using PaddleOCR + PyMuPDF

๐Ÿ’ก Tip: This tool processes PDFs by rendering each page as a high-resolution image (300 DPI) and then applying OCR. For best results, use clear, well-scanned PDFs with good contrast.

๐Ÿ”— API Usage

Endpoint: /predict

Input: PDF file

Output: Extracted text