Skip to content

haohaaorg/shan-ocr-training-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Shan Language OCR training data

Shan language

Language code = shn

Two characters code = sh

Shan Wikipedia

Shan Wiktionary

Example websites that are using Shan scripts = https://shannews.org/ , http://shanunicode.com/

Shan syllable break = https://github.com/haohaaorg/shan-syllable-break

1000 images and labels to train Shan language OCR

These images are generated with nodejs text-to-image and jimp using the below two fonts and the main font is "Panglong". Each image output result is in labels.csv .

Fonts

  1. Panglong [ Shan Only ]
  2. Pyidaungsu [ Shan and Burmese ]

About

Shan language OCR training data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors