BigOCR PDF is a powerful utility integrated into the Linux desktop environment (specifically optimized for BigLinux) that brings Optical Character Recognition (OCR) capabilities to your PDF documents and image files. It seamlessly transforms scanned documents into searchable PDFs and allows for easy text extraction from images or screen regions.
- Make PDFs Searchable: Convert scanned non-searchable PDFs into files where you can search, select, and copy text.
- Image OCR: Extract text directly from standard image files (JPG, PNG, etc.).
- Screen Capture Integration: Extract text from anywhere on your screen—perfect for grabbing text from videos, protected websites, or UI elements—by selecting a rectangular region.
- Batch Processing: Efficiently process multiple files at once directly from your file manager.
Scanned PDFs often lack a text layer. To fix this:
- Open your file manager.
- Select one or more PDF files.
- Right-click and select the "OCR" option.
- A new, searchable version of the file will be generated.
- Right-click on any image file.
- Select "Extract text from image (OCR)".
- The extracted text will be available for use.
For text that cannot be selected normally (e.g., inside a video or image on a website):
- Launch your screenshot tool (e.g., press
Print Screen). - Select the "Rectangular Region" tool.
- Highlight the area containing the text you want to copy.
- Click "Export" and choose "Extract text from image (OCR)".
Ensure you have the following system dependencies installed:
- Python 3.10 or higher
- GTK4 and Libadwaita
- OCRmyPDF (the core OCR engine)
- Tesseract OCR
- Ghostscript
To install the latest version from the repository:
# Clone the repository
git clone https://github.com/biglinux/bigocrpdf.git
cd bigocrpdf
# Install the package
pip install .