0

I have a a DJVU file where I can select, copy, highlight the text of it, i.e. the DJVU is text readable or character recognizable. I want to convert this file to character recognizable pdf file. Is there online charge-free system or free software to that?

Thanks.

Michael
  • 103
  • 5

1 Answers1

2

I use DjVuLibre : DjVu

In DjVuLibre

To convert it to pdf : File > Export as… > PDF format

To convert pdf file to character recognizable pdf file :

you can use Acrobat Pro/Standard DC 2020 (direct link Acrobat Pro/Standard DC 2020(it comes with trial version)

in Acrobat > Tools > Scan and OCR > Recognize Text > in this file

IMG1 enter image description here

Or

you can use tesseract-ocr : tesseract-ocr download page.I use it with wsl/linux.Here is the bash script (it uses ghostscript && tesseract)

#!/bin/bash

notify-send "started gs" && pdftoppm -r 660 *.pdf outputname -png && for i in *png; do b=`basename "$i" .png`; tesseract "$i" "$b" pdf; done && gs -dBATCH -dNOPAUSE -q -sDEVICE=pdfwrite -dAutoRotatePages=/None -sOutputFile=finished.pdf outputname-*.pdf && notify-send "done" && notify-send "deleting left overs" && rm *png && rm outputname-*.pdf && notify-send "cleaned"
Madhubala
  • 1,781
  • 3
  • 11
  • 22