List of languages which Google demonstrate high efficiency

Here is a representative list of languages where models like Gemini generally demonstrate high proficiency. This is not exhaustive, but covers many of the most widely spoken and digitally represented languages:

Languages with Generally High Proficiency:

  1. Afrikaans
  2. Albanian
  3. Amharic
  4. Arabic
  5. Armenian
  6. Azerbaijani
  7. Basque
  8. Belarusian
  9. Bengali
  10. Bosnian
  11. Bulgarian
  12. Burmese
  13. Catalan
  14. Cebuano
  15. Chinese (Simplified & Traditional)
  16. Corsican
  17. Croatian
  18. Czech
  19. Danish
  20. Dutch
  21. English (Typically the most proficient)
  22. Esperanto
  23. Estonian
  24. Filipino (Tagalog)
  25. Finnish
  26. French
  27. Frisian (West)
  28. Galician
  29. Georgian
  30. German
  31. Greek
  32. Gujarati
  33. Haitian Creole
  34. Hausa
  35. Hebrew
  36. Hindi
  37. Hmong
  38. Hungarian
  39. Icelandic
  40. Igbo
  41. Indonesian
  42. Irish
  43. Italian
  44. Japanese
  45. Javanese
  46. Kannada
  47. Kazakh
  48. Khmer
  49. Kinyarwanda
  50. Korean
  51. Kurdish (Kurmanji)
  52. Kyrgyz
  53. Lao
  54. Latin
  55. Latvian
  56. Lithuanian
  57. Luxembourgish
  58. Macedonian
  59. Malagasy
  60. Malay
  61. Malayalam
  62. Maltese
  63. Maori
  64. Marathi
  65. Mongolian
  66. Nepali
  67. Norwegian (Bokmål)
  68. Nyanja (Chichewa)
  69. Odia (Oriya)
  70. Pashto
  71. Persian (Farsi)
  72. Polish
  73. Portuguese (Portugal, Brazil)
  74. Punjabi (Gurmukhi, Shahmukhi)
  75. Romanian
  76. Russian
  77. Samoan
  78. Scots Gaelic
  79. Serbian (Cyrillic & Latin)
  80. Sesotho
  81. Shona
  82. Sindhi
  83. Sinhala
  84. Slovak
  85. Slovenian
  86. Somali
  87. Spanish
  88. Sundanese
  89. Swahili
  90. Swedish
  91. Tajik
  92. Tamil
  93. Tatar
  94. Telugu
  95. Thai
  96. Turkish
  97. Turkmen
  98. Ukrainian
  99. Urdu
  100. Uyghur
  101. Uzbek
  102. Vietnamese
  103. Welsh
  104. Xhosa
  105. Yiddish
  106. Yoruba
  107. Zulu

Important Considerations:

  • Beyond this List: The model likely has some level of understanding (e.g., for translation) of many more languages and dialects not listed here, especially those closely related to the ones above or those with a reasonable amount of text available online.
  • Quality Varies: Performance in tasks like nuanced text generation, understanding complex idioms, or capturing cultural context will be significantly better for languages higher up the list (like English, Spanish, French, German, Chinese) than for less digitally represented languages.
  • Task-Dependent: Proficiency might be higher for translation between languages than for generating original, creative text in a less common language.

So, yes, your intuition that it’s more than 100 languages is correct when considering the broad range of capabilities from basic understanding and translation up to high-fidelity generation. The list above represents those where you can generally expect a strong level of performance.