3 projects
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
9,001
481
$16M
Tesseract.js
Tesseract.js is a JavaScript library that enables optical character recognition (OCR) in browsers and Node.js. It provides a pure JavaScript port of the Tesseract OCR engine, allowing developers to extract text from images directly in web applications without server-side processing.
1,207
275
$105K
OSS Document Scanner
An open-source document scanner application that enables capturing, processing and digitizing physical documents using a mobile device's camera. It provides features for automatic edge detection, perspective correction, and image enhancement to create high-quality scanned documents.
478
47
$4.5M