The Solr language detection implementation is based on the Google Code language detection project http://code.google.com/p/language-detection which supports the following 53 languages and has some advanced CJK support.
Language Code | Language |
---|---|
af | Afrikaans |
ar | Arabic |
bg | Bulgarian |
bn | Bengali |
cs | Czech |
da | Danish |
de | German |
el | Greek |
en | English |
es | Spanish |
et | Estonian |
fa | Persian |
fi | Finnish |
fr | French |
gu | Gujarati |
he | Hebrew |
hi | Hindi |
hr | Croatian |
hu | Hungarian |
id | Indonesian |
it | Italian |
ja | Japanese |
kn | Kannada |
ko | Korean |
lt | Lithuanian |
lv | Latvian |
mk | Macedonian |
ml | Malayalam |
mr | Marathi |
ne | Nepali |
nl | Dutch |
no | Norwegian |
pa | Punjabi |
pl | Polish |
pt | Portuguese |
ro | Romanian |
ru | Russian |
sk | Slovak |
sl | Slovene |
so | Somali |
sq | Albanian |
sv | Swedish |
sw | Swahili |
ta | Tamil |
te | Telugu |
th | Thai |
tl | Tagalog |
tr | Turkish |
uk | Ukrainian |
ur | Urdu |
vi | Vietnamese |
zh-cn | Simplified Chinese |
zh-tw | Traditional Chinese |
Table 6.19. Supported Languages