tesseract:tldr:b3846

tesseract: List the ISO 639-2 codes of available languages.

tesseract

$ tesseract --list-langs

try on your machine

The command "tesseract --list-langs" is used to list all the languages supported by the Tesseract OCR (Optical Character Recognition) engine. Tesseract is a popular open-source OCR engine developed by Google, capable of recognizing and extracting text from images or scanned documents.

When executed, the command will provide a list of language codes for all the languages that Tesseract can detect. These language codes are used to specify the language during OCR processing, allowing Tesseract to accurately recognize and extract text in the specified language.

For example, the output of the command might look like this:

List of available languages (29):
afr
amh
ara
aze
bel
ben
bul
cat
ceb
ces
chi_sim
chi_tra
dan
deu
ell
eng
enm
epo
est
eus
fin
fra
frk
frm
gle
glg
grc
hin
hrv

This indicates that Tesseract supports a total of 29 languages, and their respective language codes are listed. You can use these language codes as an input parameter while performing OCR with Tesseract to ensure accurate recognition of text in the chosen language.

This explanation was created by an AI. In most cases those are correct. But please always be careful and never run a command you are not sure if it is safe.

back to the tesseract tool