微信气泡检测文字提取 · WeChat Bubble YOLO

有问题可以加我qq:`174335601`

环境

uv sync

文本提取需额外安装 OCR（按需）：

uv pip install rapidocr-onnxruntime onnxruntime
# GPU：uv pip install onnxruntime-gpu

0 · 截图

uv run python tools/wechat_window_loop_ocr.py --interval-ms 1000

1 · 标注（二选一）

网页标注 — 启动后打开 http://127.0.0.1:8810/

uv run python tools/fount/server.py

半自动标注

uv run python tools/label_review.py --weights runs/detect/wechat_bubbles-7/weights/best.pt

2 · 划分训练 / 验证集

uv run python tools/split_dataset.py --train-ratio 0.8

3 · 训练（或二次训练）

uv run python train_bubbles.py --model runs/detect/wechat_bubbles-7/weights/best.pt

不指定 --model 则从头训练新模型。

4 · 推理

uv run python predict_bubbles.py --weights runs/detect/wechat_bubbles-7/weights/best.pt --source test

5 · 文本提取

uv run python text_process.py

输出目录

目录	说明
`before_img/`	截图原图
`img/`、`labels/`	标注图片与 YOLO 标签
`dataset/`	训练 / 验证集
`runs/detect/.../weights/`	模型权重
`predict_out/`	推理结果
`text_json/`、`text_img/`	OCR 文本输出

技术栈

类别	技术
语言 / 环境	Python 3.11+、uv
深度学习	PyTorch、TorchVision、Ultralytics YOLOv8
图像	OpenCV、Pillow、NumPy
OCR	RapidOCR + ONNX Runtime
标注前端	HTML / CSS / Canvas + `http.server`
截图	Win32 API（ctypes）+ PIL

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
runs/detect		runs/detect
tools		tools
.gitignore		.gitignore
README.MD		README.MD
bubble_role.py		bubble_role.py
data.yaml		data.yaml
predict_bubbles.py		predict_bubbles.py
pyproject.toml		pyproject.toml
text_process.py		text_process.py
train_bubbles.py		train_bubbles.py
uv.lock		uv.lock
yolo26n.pt		yolo26n.pt
yolov8n.pt		yolov8n.pt
提示词.md		提示词.md
流程图.png		流程图.png
部分说明.md		部分说明.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

微信气泡检测文字提取 · WeChat Bubble YOLO

有问题可以加我qq:`174335601`

环境

0 · 截图

1 · 标注（二选一）

2 · 划分训练 / 验证集

3 · 训练（或二次训练）

4 · 推理

5 · 文本提取

输出目录

技术栈

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

微信气泡检测文字提取 · WeChat Bubble YOLO

有问题可以加我qq:174335601

环境

0 · 截图

1 · 标注（二选一）

2 · 划分训练 / 验证集

3 · 训练（或二次训练）

4 · 推理

5 · 文本提取

输出目录

技术栈

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

有问题可以加我qq:`174335601`

Packages