Here’s data from our side quest scraping multilingual dictionary for length check.
| Language | Used Dictionary Size | Avg Length | Common Length | 20% ≤ Length | 50% ≤ Length |
|---|---|---|---|---|---|
| English | 466,434 | 9.42 | 9 | 7 | 9 |
| Arabic | 5,691,498 | 7.97 | 8 | 7 | 8 |
| Hindi | 476,641 | 7.15 | 6 | 5 | 7 |
| Korean | 366,503 | 3.56 | 3 | 2 | 3 |
| Chinese | 406,588 | 3.35 | 3 | 2 | 3 |
~50% of Korean and Chinese dictionary words are less than 3 utf-8 chars.
~20% of Korean and Chinese dictionary words are less than 2 utf-8 chars.
it’d be really nice to give huge discount for non-english ENS domains.
cc @raffy ![]()
