published under license Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)copy! share!
posted in category Creativity / Documents
posted at 24. Aug '21
Howto Pack Series Of Images Into DjVU + OCR
Useful for scanned books. Use Tesseract 4, Tesseract 3 is not good.
for i in *.jpg; do convert $i $i.pbm; done
for i in *.pbm; do cjb2 -clean $i $i.djvu; done
djvm -c secretbook.djvu *.djvu
ocrodjvu --engine=tesseract --in-place secretbook.djvu
Add Comment