TestBuddy - Professional Screenshot to Text Converter

TestBuddy is a production-grade OCR application for Windows that converts screenshots directly into editable text. Built with PyQt6 and Tesseract OCR, it provides a seamless workflow for extracting text from images.

Features

Core Functionality

📸 Quick Screenshot Capture — Ctrl+Shift+S to launch screenshot tool
🔤 Accurate OCR — Uses industry-standard Tesseract OCR engine
📋 Easy Clipboard Integration — Ctrl+C to copy extracted text
🔧 Editable Output — Modify OCR results in the built-in editor
💾 Session Export — Save extracted text to file

Advanced Capabilities

Multi-Language Support — Configure OCR for 100+ languages
Persistent History — Automatic session tracking with full-text search
Configuration System — Customize all settings via INI file
Keyboard Shortcuts — Full keyboard support for power users
Activity Logging — Debug console with detailed operation logs
Type-Safe Codebase — Full Python type hints for maintainability

System Requirements

OS: Windows 10/11
Python: 3.8+
Tesseract-OCR: Required (see Installation)

Installation

1. Install Tesseract-OCR

Windows:

Download installer: https://github.com/UB-Mannheim/tesseract/wiki
Run: tesseract-ocr-w64-setup-v5.x.exe
Default path: C:\Program Files\Tesseract-OCR\tesseract.exe

2. Clone/Setup TestBuddy

cd c:\Users\idavi\Documents\Projects\testbuddy
pip install -r requirements.txt

3. Run Application

python main.py

First Run: The app creates testbuddy.ini with default settings.

Configuration

Edit testbuddy.ini to customize behavior:

[tesseract]
path = C:\Program Files\Tesseract-OCR\tesseract.exe
language = eng                    # ISO 639-3 code (eng, fra, deu, jpn, etc.)
psm = 6                           # Page Segmentation Mode
oem = 3                           # OCR Engine Mode

[ui]
window_width = 900
window_height = 600
window_always_on_top = True
theme = light                     # (light/dark planned)
splitter_ratio = 600

[behavior]
clipboard_poll_interval_ms = 500
log_buffer_size = 100
auto_copy_on_ocr = False

[history]
enable_history = True
max_entries = 100
file = testbuddy_history.json

[export]
format = txt
directory = exports

[logging]
file = testbuddy_debug.log
debug_mode = False

Usage

Basic Workflow

Click camera button (📷) or press Ctrl+Shift+S
Windows Snipping Tool opens — capture text area
OCR processes automatically when clipboard updated
Edit text in the editor if needed
Copy result — Ctrl+C or click "COPY TEXT" button
Export session — Save to file with button

Keyboard Shortcuts

Shortcut	Action
`Ctrl+Shift+S`	Take screenshot
`Ctrl+C`	Copy extracted text

History & Sessions

History Button: View last 10 OCR sessions with previews
Auto-Saved: All results saved to testbuddy_history.json
Search Ready: History entries can be searched programmatically

File Structure

testbuddy/
├── main.py                 # Main application
├── config.py              # Configuration management
├── history.py             # Session history & persistence
├── requirements.txt       # Python dependencies
├── testbuddy.ini         # Settings (generated)
├── testbuddy_history.json # History log (generated)
├── testbuddy_debug.log   # Activity log (generated)
├── exports/              # Exported sessions (generated)
└── README.md

Roadmap

Phase 2 (Features)

Phase 3 (Advanced)

Batch processing (multiple images)
CSV/JSON export formats
OCR result corrections & training
System tray integration

Phase 4 (Professional)

Unit tests & CI/CD
Packaging (setup.py, pyproject.toml)
Auto-updater
Windows installer (.msi)

Architecture

Module: `config.py`

ConfigManager: Loads/saves INI configuration
Config: Data class with all settings
Validates all settings and provides sensible defaults

Module: `history.py`

HistoryManager: Persistent JSON-based history
HistoryEntry: OCR session metadata & full text
Features: Add, search, delete, export statistics

Module: `main.py`

OCRWorker: QThread subclass for non-blocking OCR
SnapOCRApp: Main PyQt6 window
Integrates config, history, and all UI components

Troubleshooting

Tesseract Not Found

Error: File not found: C:\Program Files\Tesseract-OCR\tesseract.exe

Fix: Update tesseract.path in testbuddy.ini to your actual Tesseract location.

OCR Produces Empty Output

Check testbuddy_debug.log for errors
Try with simpler text image
Different PSM modes: 6 (uniform), 3 (auto), 11 (sparse)
Configure language: language = eng+fra for English+French

History File Corrupted

rm testbuddy_history.json  # Delete and restart app

Slow Clipboard Polling

Increase clipboard_poll_interval_ms in config (500ms default).

Development

Adding Type Hints

All functions are type-hinted for IDE support and debugging:

def process_image(image: Image.Image, language: str) -> str:
    """Process image and extract text."""

Extending History

from history import HistoryManager

history = HistoryManager()
history.add_entry("Extracted text", language="eng", tags=["important"])
results = history.search("keyword")

Custom OCR Config

Edit tesseract section in config.py:

config.ocr_psm = 6  # Layout-aware
config.ocr_oem = 3  # Use both legacy & LSTM engines

Performance Metrics

OCR Time: 1-3 seconds depending on image complexity
Memory Usage: ~150MB idle, ~300MB during OCR
History Storage: ~2KB per entry

License

[Add your license here]

Contributing

Contributions welcome! Areas for help:

Support

For issues, check:

testbuddy_debug.log for error details
Tesseract documentation: https://github.com/UB-Mannheim/tesseract/wiki
PyQt6 issues: https://www.riverbankcomputing.com/software/pyqt/

Version: 1.0.0
Last Updated: December 2025
Status: Production Ready (Phase 1 Complete)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github		.github
__pycache__		__pycache__
build/main		build/main
dist/main		dist/main
BUILD_COMPLETE.md		BUILD_COMPLETE.md
COMPLETE_SUMMARY.md		COMPLETE_SUMMARY.md
CONFIGURATION.md		CONFIGURATION.md
DEVELOPMENT.md		DEVELOPMENT.md
INTEGRATION_SUMMARY.md		INTEGRATION_SUMMARY.md
LICENSE.txt		LICENSE.txt
PHASE1_SUMMARY.md		PHASE1_SUMMARY.md
PHASE2B_COMPLETE.md		PHASE2B_COMPLETE.md
PHASE2C_COMPLETE.md		PHASE2C_COMPLETE.md
PHASE2C_SUMMARY.md		PHASE2C_SUMMARY.md
PHASE2_INTEGRATION.md		PHASE2_INTEGRATION.md
PHASE2_SPEC_COMPLETION.md		PHASE2_SPEC_COMPLETION.md
PROJECT_COMPLETE.md		PROJECT_COMPLETE.md
QUICKSTART.md		QUICKSTART.md
QUICKSTART_V2.md		QUICKSTART_V2.md
README.md		README.md
README_DISTRIBUTION.md		README_DISTRIBUTION.md
README_V2.md		README_V2.md
app.py		app.py
build_windows.bat		build_windows.bat
build_windows.ps1		build_windows.ps1
config.py		config.py
document_intelligence.py		document_intelligence.py
export.py		export.py
history.py		history.py
icon.ico		icon.ico
main.py		main.py
requirements.txt		requirements.txt
run.py		run.py
snapocr_debug.log		snapocr_debug.log
test_phase2b.py		test_phase2b.py
test_phase2c.py		test_phase2c.py
test_suite.py		test_suite.py
testbuddy.ini		testbuddy.ini
testbuddy_debug.log		testbuddy_debug.log
testbuddy_history.json		testbuddy_history.json
testbuddy_installer.nsi		testbuddy_installer.nsi
ui_skeleton.py		ui_skeleton.py
undo_redo.py		undo_redo.py
validate.py		validate.py

Folders and files

Latest commit

History

Repository files navigation

TestBuddy - Professional Screenshot to Text Converter

Features

Core Functionality

Advanced Capabilities

System Requirements

Installation

1. Install Tesseract-OCR

2. Clone/Setup TestBuddy

3. Run Application

Configuration

Usage

Basic Workflow

Keyboard Shortcuts

History & Sessions

File Structure

Roadmap

Phase 2 (Features)

Phase 3 (Advanced)

Phase 4 (Professional)

Architecture

Module: config.py

Module: history.py

Module: main.py

Troubleshooting

Tesseract Not Found

OCR Produces Empty Output

History File Corrupted

Slow Clipboard Polling

Development

Adding Type Hints

Extending History

Custom OCR Config

Performance Metrics

License

Contributing

Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Module: `config.py`

Module: `history.py`

Module: `main.py`

Packages