All-in-One Development Tool based on PaddlePaddle
-
Updated
Jun 12, 2026 - Python
All-in-One Development Tool based on PaddlePaddle
A Unified Toolkit for Deep Learning Based Document Image Analysis
Open-source batch OCR workbench — a free, local alternative to ABBYY FineReader. Powered by Ollama + GLM-OCR + PP-DocLayoutV3, ~0.5s/page on RTX 4090. Three-panel editor, layout-aware, PDF/image batch processing, Markdown/Word export. 批量OCR工作台,纯本地运行,免费平替ABBYY,适合书籍文档数字化。
[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding
A pre labelled dataset for ui element / layout detection
ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.
中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。
智能文本自动处理工具(Intelligent text automatic processing tool)。AutoText的功能主要有文本纠错,图片ocr、版面检测以及表格结构识别等。The main functions of this project include text error correction, ocr, layout-detection and table structure recognition.
pdfDet aims to simplify PDF layout detect tasks for users.
Docling plugin to integrate PP-DocLayout-V3 model into docling to enhance layout detection capabilities
Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.
A lightweight, type-safe, PaddlePaddle PP-DocLayoutV3 & V2 implementation in Bun/Node.js for document layout analysis in JavaScript environments.
利用c++加载yolov8模型,进行版面检测。yolov8-c++ is used to detect the layout of Chinese document images
Vision Based Document Layout Detection, Segmentation and context classification using MaskRCNN on Tensorflow-Keras, PyTorch & Detectron2.
Document layout analysis tool for extracting structured information from documents using computer vision
ABBYY FineReader PDF 15 Full Version Download | Unlocked Build | Pre-Activated Setup
A lightweight hybrid system for parsing and digitizing historical newspaper pages
A synchronous Python library that converts an academic-thesis PDF into a structured JSON document plus a tar bundle of cropped figures, tables, and formulas.
Language-agnostic OCR benchmark pipeline to discover document images, review/edit layouts in a web UI, run OCR extraction, and build high-quality evaluation datasets.
Add a description, image, and links to the layout-detection topic page so that developers can more easily learn about it.
To associate your repository with the layout-detection topic, visit your repo's landing page and select "manage topics."