If you want to request a new language support, a PR with 2 following files are needed:
In folder ppocr/utils/dict,
it is necessary to submit the dict text to this path and name it with {language}_dict.txt
that contains a list of all characters. Please see the format example from other files in that folder.
In folder ppocr/utils/corpus,
it is necessary to submit the corpus to this path and name it with {language}_corpus.txt
that contains a list of words in your language.
Maybe, 50000 words per language is necessary at least.
Of course, the more, the better.
call for contributions to add new language support for PaddleOCR.
For anyone might be insterested in traing the new language model, Guidance to train the model is provided. We are calling contributions to add new language support for PaddleOCR.
If your language has unique elements, please tell me in advance within any way, such as useful links, wikipedia and so on.
tink2123, LDOUBLEV, MissPenguin, alteist, D-DanielYang and 28 moreBeyondYourself, D-DanielYang, swbliss, LDOUBLEV, skoetje and 1 moretink2123, D-DanielYang, Ru-Van, LDOUBLEV, thongvhoang and 2 more
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4