A handy tool to read image files via a smart OCR engine. This app uses Tesseract optical character recognition engine to read the image file. Currently, it supports .png, .jpg, .bmp and .pbm image file formats. Drop your image file, set desired accuracy and input language and then wait for the OCR engine to finish the job.
This web application is designed to perform optical character recognition on input image files. It uses Tesseract javascript API. Tesseract API is an emscripten port of the famous Tesseract OCR Engine written in C language. This app supports 4 different levels of accuracies and over 100 input languages for the OCR engine. Please read below to get more info about this web application.
As mentioned above, there are four different levels of accuracy in this app. Different accuracies are as follows: (1) Low (2) Fast (shorter OCR time) (3) Best (better OCR accuracy) (4) Moderate. When you choose a specific accuracy level, the OCR engine fetches the language data from "https://github.com/naptha/tessdata/tree/gh-pages/" GitHub repo. Depending on the accuracy level, loading language data may vary significantly. The result of the OCR engine highly depends on the level of complexity in the image file. For clean and readable images, this level could rise to 95%. The final text result is then rendered in a textbox, which you can copy & paste to other applications/environments.
Image Reader app is also available as a browser extension. As an extension, this app does not need an internet connection and can be used offline in your browser. Download links for three popular browsers are as follows: Chrome, Opera, Firefox.
If you found a bug in this page, or have a feature/function which you would like to see in this web app, please let us know by sending an email or through the discussion form below. Moreover, don't forget to check other web apps in webbrowsertools.com, we have many useful apps related to browser's privacy and security.