RetroSearch Browse

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Showing content from https://github.com/naptha/tesseract.js/issues/877 below:

Character level recognition gives the same results as the word level recognition. · Issue #877 · naptha/tesseract.js · GitHub

Tesseract.js version (version number for npm/GitHub release, or specific commit for repo)
Latest release version 5.0.4

Describe the bug

A clear and concise description of what the bug is.

Running Tesseract.js code in 2 different PSM modes gives the same output.

Is tesseract configured to give word level outputs only?

Am I guessing it right that PSMs just refine the recognition scope, but do not affect the output because it will always will be in words?

Running in

SINGLE_CHAR

and

PSM_SINGLE_WORD

gives the same output from the same sample.

I want to sort the result character by character and in order to do that, I want the bbox data of each character detected to be extracted, and used further. Is this possible?

Device Version:

OS + Version: [e.g. iOS8.1, Windows 10]
Windows 11
Browser [e.g. chrome, safari] or Node version [e.g. Node v18]
Edge

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4