O
ocr

Projects with this topic

View DocuMind project

ALEXENDROS.me / DocuMind

DocuMind es un sistema de organización automática de documentos para Linux desktop, impulsado por IA local (Ollama/Llama3 o HuggingFace). Procesa PDFs, imágenes, vídeos, audio y código: extrae texto/OCR, transcribe, analiza contenido y clasifica/archiva según ISO 15489 (facturas, legal, trabajo, personal, multimedia). Detecta duplicados, registra auditoría en SQLite y prioriza privacidad offline.

Desarrollada en Python 3.10+ con PyMuPDF, Tesseract, Vosk/Whisper, multiprocessing y optimizaciones (xxHash, caching, GPU), demuestra expertise en integración LLM locales/multimodales, procesamiento paralelo, arquitectura modular escalable y evolución hacia GUI PyQt6 con drag-and-drop, búsqueda full-text y empaquetado RPM/Flatpak. (612 caracteres)

Linux Python local-ai Document-Man... ollama ocr multimedia-p... desktop-app SQLite offline-ai automation pyqt6

0

Updated Mar 23, 2026

0 0 0 0

Updated Mar 23, 2026
View cloudboys-portfolio project

Jones Johnsson / cloudboys-portfolio

Cloud-native data engineering + ML POC: ingest Reddit images, run OCR, store results in BigQuery/Cloud Storage, and serve analytics via FastAPI + a React dashboard.

Data Enginee... gcp fastapi ocr Python React Docker Git devops Markdown

0

Updated Jan 25, 2026

0 0 0 0

Updated Jan 25, 2026
View jochre3-ocr project

jochre / jochre3-ocr

Jochre3 OCR engine with default implementation for Yiddish - completely new version of https://github.com/urieli/jochre

ocr yiddish

0

Updated Dec 07, 2025

0 1 1 1

Updated Dec 07, 2025
View exam-analysis project

outsource / exam-analysis

Python ocr

0

Updated Sep 27, 2024

0 0 0 0

Updated Sep 27, 2024
View AI-Lang project

carloj / AI-Lang

Packaging n improving Linux tools for preprocessing images and helping the scanning and digitizing AI ...

2023 AI-Languages updates for Sino-Korean and Tibetan (following 2020-21 works on Ukraine reports, and Arabic texts)

AI ML language ocr Tesseract OC... tesseract

0

Updated Nov 01, 2023

0 0 29 29

Updated Nov 01, 2023
View docsrabbit project

encircle360-oss / docsrabbit / docsrabbit

A microservice that renders templates the way you want. Scans/OCRs documents of many standard file types. Converts documents easily and creates thumbnails the way you want it.

pdf document generation API templating templates engine xls excel HTML ocr document2text

3

Updated Jul 06, 2023

3 1 0 3

Updated Jul 06, 2023
View sudoku-solver-ocr project

Christos Angelopoulos / sudoku-solver-ocr

A script in bash, and one in C language that can sove sudoku riddles, by selecting a png file.

C Bash ocr sudoku

0

Updated Jul 03, 2023

0 0 0 0

Updated Jul 03, 2023

Projects with this topic

ALEXENDROS.me / DocuMind

Jones Johnsson / cloudboys-portfolio

jochre / jochre3-ocr

outsource / exam-analysis

carloj / AI-Lang

encircle360-oss / docsrabbit / docsrabbit

Christos Angelopoulos / sudoku-solver-ocr