Text Classification
fastText
language-identification
kargaranamir commited on
Commit
85cd671
·
1 Parent(s): 6877193

update list of languages and License

Browse files
Files changed (2) hide show
  1. LICENSE +253 -0
  2. README.md +364 -37
LICENSE ADDED
@@ -0,0 +1,253 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Apache License 2.0 plus notices
2
+
3
+ ---
4
+
5
+ ### Additional Notice on Training Data and Model Behavior
6
+
7
+ This software includes a machine learning model for language identification
8
+ based on statistical representations (e.g., character n-grams and linear classification).
9
+
10
+ The model was trained on a mixture of publicly available data sources, which may
11
+ include materials subject to copyright or other rights.
12
+
13
+ The model does not store or encode expressive text content. Instead, it represents
14
+ aggregated statistical patterns (such as n-gram frequency weights) that are not
15
+ intended to reproduce original source material.
16
+
17
+ No training data is included or distributed with this software.
18
+
19
+ Due to its design, this model performs classification only and is not capable of
20
+ generating or reconstructing source texts.
21
+
22
+ ---
23
+
24
+ ### Good-Faith Notice
25
+
26
+ If you are a rights holder and believe that your content has been included in the
27
+ training data and wish to request its exclusion from future versions, please contact
28
+ the authors. Reasonable efforts will be made to review and address such requests.
29
+
30
+ ---
31
+
32
+ ### Limitation and Responsibility
33
+
34
+ No representation or warranty is made regarding the licensing status of the
35
+ underlying training data.
36
+
37
+ Users are solely responsible for ensuring that their use of this software complies
38
+ with applicable laws and licensing requirements.
39
+
40
+ ---
41
+
42
+ ### Disclaimer
43
+
44
+ Unless required by applicable law or agreed to in writing, this software is
45
+ provided "AS IS", WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express
46
+ or implied, including, without limitation, any warranties of TITLE,
47
+ NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A PARTICULAR PURPOSE.
48
+
49
+ ---
50
+
51
+ ### Full Apache License Text
52
+
53
+ Apache License
54
+ Version 2.0, January 2004
55
+ http://www.apache.org/licenses/
56
+
57
+ TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
58
+
59
+ 1. Definitions.
60
+
61
+ "License" shall mean the terms and conditions for use, reproduction,
62
+ and distribution as defined by Sections 1 through 9 of this document.
63
+
64
+ "Licensor" shall mean the copyright owner or entity authorized by
65
+ the copyright owner that is granting the License.
66
+
67
+ "Legal Entity" shall mean the union of the acting entity and all
68
+ other entities that control, are controlled by, or are under common
69
+ control with that entity. For the purposes of this definition,
70
+ "control" means (i) the power, direct or indirect, to cause the
71
+ direction or management of such entity, whether by contract or
72
+ otherwise, or (ii) ownership of fifty percent (50%) or more of the
73
+ outstanding shares, or (iii) beneficial ownership of such entity.
74
+
75
+ "You" (or "Your") shall mean an individual or Legal Entity
76
+ exercising permissions granted by this License.
77
+
78
+ "Source" form shall mean the preferred form for making modifications,
79
+ including but not limited to software source code, documentation
80
+ source, and configuration files.
81
+
82
+ "Object" form shall mean any form resulting from mechanical
83
+ transformation or translation of a Source form, including but
84
+ not limited to compiled object code, generated documentation,
85
+ and conversions to other media types.
86
+
87
+ "Work" shall mean the work of authorship, whether in Source or
88
+ Object form, made available under the License, as indicated by a
89
+ copyright notice that is included in or attached to the work
90
+ (an example is provided in the Appendix below).
91
+
92
+ "Derivative Works" shall mean any work, whether in Source or Object
93
+ form, that is based on (or derived from) the Work and for which the
94
+ editorial revisions, annotations, elaborations, or other modifications
95
+ represent, as a whole, an original work of authorship. For the purposes
96
+ of this License, Derivative Works shall not include works that remain
97
+ separable from, or merely link (or bind by name) to the interfaces of,
98
+ the Work and Derivative Works thereof.
99
+
100
+ "Contribution" shall mean any work of authorship, including
101
+ the original version of the Work and any modifications or additions
102
+ to that Work or Derivative Works thereof, that is intentionally
103
+ submitted to Licensor for inclusion in the Work by the copyright owner
104
+ or by an individual or Legal Entity authorized to submit on behalf of
105
+ the copyright owner. For the purposes of this definition, "submitted"
106
+ means any form of electronic, verbal, or written communication sent
107
+ to the Licensor or its representatives, including but not limited to
108
+ communication on electronic mailing lists, source code control systems,
109
+ and issue tracking systems that are managed by, or on behalf of, the
110
+ Licensor for the purpose of discussing and improving the Work, but
111
+ excluding communication that is conspicuously marked or otherwise
112
+ designated in writing by the copyright owner as "Not a Contribution."
113
+
114
+ "Contributor" shall mean Licensor and any individual or Legal Entity
115
+ on behalf of whom a Contribution has been received by Licensor and
116
+ subsequently incorporated within the Work.
117
+
118
+ 2. Grant of Copyright License. Subject to the terms and conditions of
119
+ this License, each Contributor hereby grants to You a perpetual,
120
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
121
+ copyright license to reproduce, prepare Derivative Works of,
122
+ publicly display, publicly perform, sublicense, and distribute the
123
+ Work and such Derivative Works in Source or Object form.
124
+
125
+ 3. Grant of Patent License. Subject to the terms and conditions of
126
+ this License, each Contributor hereby grants to You a perpetual,
127
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
128
+ (except as stated in this section) patent license to make, have made,
129
+ use, offer to sell, sell, import, and otherwise transfer the Work,
130
+ where such license applies only to those patent claims licensable
131
+ by such Contributor that are necessarily infringed by their
132
+ Contribution(s) alone or by combination of their Contribution(s)
133
+ with the Work to which such Contribution(s) was submitted. If You
134
+ institute patent litigation against any entity (including a
135
+ cross-claim or counterclaim in a lawsuit) alleging that the Work
136
+ or a Contribution incorporated within the Work constitutes direct
137
+ or contributory patent infringement, then any patent licenses
138
+ granted to You under this License for that Work shall terminate
139
+ as of the date such litigation is filed.
140
+
141
+ 4. Redistribution. You may reproduce and distribute copies of the
142
+ Work or Derivative Works thereof in any medium, with or without
143
+ modifications, and in Source or Object form, provided that You
144
+ meet the following conditions:
145
+
146
+ (a) You must give any other recipients of the Work or
147
+ Derivative Works a copy of this License; and
148
+
149
+ (b) You must cause any modified files to carry prominent notices
150
+ stating that You changed the files; and
151
+
152
+ (c) You must retain, in the Source form of any Derivative Works
153
+ that You distribute, all copyright, patent, trademark, and
154
+ attribution notices from the Source form of the Work,
155
+ excluding those notices that do not pertain to any part of
156
+ the Derivative Works; and
157
+
158
+ (d) If the Work includes a "NOTICE" text file as part of its
159
+ distribution, then any Derivative Works that You distribute must
160
+ include a readable copy of the attribution notices contained
161
+ within such NOTICE file, excluding those notices that do not
162
+ pertain to any part of the Derivative Works, in at least one
163
+ of the following places: within a NOTICE text file distributed
164
+ as part of the Derivative Works; within the Source form or
165
+ documentation, if provided along with the Derivative Works; or,
166
+ within a display generated by the Derivative Works, if and
167
+ wherever such third-party notices normally appear. The contents
168
+ of the NOTICE file are for informational purposes only and
169
+ do not modify the License. You may add Your own attribution
170
+ notices within Derivative Works that You distribute, alongside
171
+ or as an addendum to the NOTICE text from the Work, provided
172
+ that such additional attribution notices cannot be construed
173
+ as modifying the License.
174
+
175
+ You may add Your own copyright statement to Your modifications and
176
+ may provide additional or different license terms and conditions
177
+ for use, reproduction, or distribution of Your modifications, or
178
+ for any such Derivative Works as a whole, provided Your use,
179
+ reproduction, and distribution of the Work otherwise complies with
180
+ the conditions stated in this License.
181
+
182
+ 5. Submission of Contributions. Unless You explicitly state otherwise,
183
+ any Contribution intentionally submitted for inclusion in the Work
184
+ by You to the Licensor shall be under the terms and conditions of
185
+ this License, without any additional terms or conditions.
186
+ Notwithstanding the above, nothing herein shall supersede or modify
187
+ the terms of any separate license agreement you may have executed
188
+ with Licensor regarding such Contributions.
189
+
190
+ 6. Trademarks. This License does not grant permission to use the trade
191
+ names, trademarks, service marks, or product names of the Licensor,
192
+ except as required for reasonable and customary use in describing the
193
+ origin of the Work and reproducing the content of the NOTICE file.
194
+
195
+ 7. Disclaimer of Warranty. Unless required by applicable law or
196
+ agreed to in writing, Licensor provides the Work (and each
197
+ Contributor provides its Contributions) on an "AS IS" BASIS,
198
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
199
+ implied, including, without limitation, any warranties or conditions
200
+ of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
201
+ PARTICULAR PURPOSE. You are solely responsible for determining the
202
+ appropriateness of using or redistributing the Work and assume any
203
+ risks associated with Your exercise of permissions under this License.
204
+
205
+ 8. Limitation of Liability. In no event and under no legal theory,
206
+ whether in tort (including negligence), contract, or otherwise,
207
+ unless required by applicable law (such as deliberate and grossly
208
+ negligent acts) or agreed to in writing, shall any Contributor be
209
+ liable to You for damages, including any direct, indirect, special,
210
+ incidental, or consequential damages of any character arising as a
211
+ result of this License or out of the use or inability to use the
212
+ Work (including but not limited to damages for loss of goodwill,
213
+ work stoppage, computer failure or malfunction, or any and all
214
+ other commercial damages or losses), even if such Contributor
215
+ has been advised of the possibility of such damages.
216
+
217
+ 9. Accepting Warranty or Additional Liability. While redistributing
218
+ the Work or Derivative Works thereof, You may choose to offer,
219
+ and charge a fee for, acceptance of support, warranty, indemnity,
220
+ or other liability obligations and/or rights consistent with this
221
+ License. However, in accepting such obligations, You may act only
222
+ on Your own behalf and on Your sole responsibility, not on behalf
223
+ of any other Contributor, and only if You agree to indemnify,
224
+ defend, and hold each Contributor harmless for any liability
225
+ incurred by, or claims asserted against, such Contributor by reason
226
+ of your accepting any such warranty or additional liability.
227
+
228
+ END OF TERMS AND CONDITIONS
229
+
230
+ APPENDIX: How to apply the Apache License to your work.
231
+
232
+ To apply the Apache License to your work, attach the following
233
+ boilerplate notice, with the fields enclosed by brackets "[]"
234
+ replaced with your own identifying information. (Don't include
235
+ the brackets!) The text should be enclosed in the appropriate
236
+ comment syntax for the file format. We also recommend that a
237
+ file or class name and description of purpose be included on the
238
+ same "printed page" as the copyright notice for easier
239
+ identification within third-party archives.
240
+
241
+ Copyright [2023] [AUTHORS]
242
+
243
+ Licensed under the Apache License, Version 2.0 (the "License");
244
+ you may not use this file except in compliance with the License.
245
+ You may obtain a copy of the License at
246
+
247
+ http://www.apache.org/licenses/LICENSE-2.0
248
+
249
+ Unless required by applicable law or agreed to in writing, software
250
+ distributed under the License is distributed on an "AS IS" BASIS,
251
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
252
+ See the License for the specific language governing permissions and
253
+ limitations under the License.
README.md CHANGED
@@ -1,13 +1,20 @@
1
  ---
2
- license: apache-2.0
 
 
3
  language:
 
 
4
  - aak
5
  - aau
6
  - aaz
 
7
  - aba
 
8
  - abk
9
  - abn
10
  - abq
 
11
  - abt
12
  - abx
13
  - aby
@@ -28,12 +35,14 @@ language:
28
  - adi
29
  - adj
30
  - adl
 
31
  - ady
32
  - adz
33
  - aeb
34
  - aer
35
  - aeu
36
  - aey
 
37
  - afb
38
  - afh
39
  - afr
@@ -56,6 +65,7 @@ language:
56
  - aji
57
  - ajp
58
  - ajz
 
59
  - aka
60
  - akb
61
  - ake
@@ -71,6 +81,7 @@ language:
71
  - alt
72
  - aly
73
  - alz
 
74
  - ame
75
  - amf
76
  - amh
@@ -82,9 +93,11 @@ language:
82
  - amr
83
  - amu
84
  - amx
 
85
  - ang
86
  - anm
87
  - ann
 
88
  - anv
89
  - any
90
  - aoc
@@ -103,6 +116,8 @@ language:
103
  - apw
104
  - apy
105
  - apz
 
 
106
  - ara
107
  - arb
108
  - are
@@ -112,9 +127,11 @@ language:
112
  - arn
113
  - arp
114
  - arq
 
115
  - ars
116
  - ary
117
  - arz
 
118
  - asg
119
  - asm
120
  - aso
@@ -124,26 +141,34 @@ language:
124
  - atd
125
  - atg
126
  - ati
 
 
127
  - att
128
  - auc
129
  - aui
130
  - auy
 
131
  - ava
132
  - avk
 
133
  - avt
134
  - avu
135
  - awa
136
  - awb
137
  - awi
138
  - awx
 
139
  - aym
140
  - ayo
 
141
  - ayr
 
142
  - azb
143
  - aze
144
  - azg
145
  - azj
146
  - azz
 
147
  - bak
148
  - bal
149
  - bam
@@ -156,6 +181,8 @@ language:
156
  - bbb
157
  - bbc
158
  - bbj
 
 
159
  - bbr
160
  - bcc
161
  - bch
@@ -165,6 +192,8 @@ language:
165
  - bcw
166
  - bdd
167
  - bdh
 
 
168
  - bea
169
  - bef
170
  - bel
@@ -172,18 +201,23 @@ language:
172
  - ben
173
  - beq
174
  - ber
 
175
  - bex
176
  - bfd
177
  - bfo
178
  - bfz
 
179
  - bgr
180
  - bgs
 
181
  - bgz
182
  - bhg
183
  - bhl
184
  - bho
185
  - bhp
186
  - bhw
 
 
187
  - bib
188
  - big
189
  - bih
@@ -198,26 +232,35 @@ language:
198
  - bjr
199
  - bjv
200
  - bkd
 
201
  - bkq
202
  - bku
203
  - bkv
204
  - bla
205
  - blh
 
 
206
  - blw
207
  - blz
 
208
  - bmb
209
  - bmh
210
  - bmk
211
  - bmq
212
  - bmr
213
  - bmu
 
 
214
  - bnj
 
215
  - bnp
 
216
  - boa
217
  - bod
218
  - boj
219
  - bom
220
  - bon
 
221
  - bos
222
  - bov
223
  - box
@@ -227,9 +270,12 @@ language:
227
  - bqc
228
  - bqj
229
  - bqp
 
230
  - bre
 
231
  - bru
232
  - brx
 
233
  - bsc
234
  - bsn
235
  - bsp
@@ -248,6 +294,8 @@ language:
248
  - bul
249
  - bum
250
  - bus
 
 
251
  - bvr
252
  - bvy
253
  - bvz
@@ -265,6 +313,7 @@ language:
265
  - bzi
266
  - bzj
267
  - bzt
 
268
  - caa
269
  - cab
270
  - cac
@@ -279,7 +328,6 @@ language:
279
  - cat
280
  - cav
281
  - cax
282
- - cay
283
  - cbc
284
  - cbi
285
  - cbk
@@ -290,6 +338,9 @@ language:
290
  - cbv
291
  - cce
292
  - cco
 
 
 
293
  - ceb
294
  - ceg
295
  - cek
@@ -297,11 +348,11 @@ language:
297
  - cfm
298
  - cgc
299
  - cgg
 
300
  - cha
301
  - chd
302
  - che
303
  - chf
304
- - chg
305
  - chj
306
  - chk
307
  - chn
@@ -317,8 +368,8 @@ language:
317
  - cjp
318
  - cjs
319
  - cjv
320
- - cjy
321
  - ckb
 
322
  - cko
323
  - ckt
324
  - cle
@@ -328,11 +379,15 @@ language:
328
  - cmi
329
  - cmn
330
  - cmo
 
331
  - cnh
332
  - cni
 
333
  - cnl
 
334
  - cnt
335
  - cnw
 
336
  - coe
337
  - cof
338
  - cok
@@ -341,21 +396,24 @@ language:
341
  - cor
342
  - cos
343
  - cot
 
344
  - cpa
345
  - cpb
346
  - cpc
347
- - cpi
348
  - cpu
349
  - cpy
350
  - crh
351
  - cri
 
352
  - crk
 
353
  - crm
354
  - crn
355
  - crq
356
  - crs
357
  - crt
358
  - crx
 
359
  - csb
360
  - csk
361
  - cso
@@ -366,6 +424,7 @@ language:
366
  - cto
367
  - ctp
368
  - ctu
 
369
  - cub
370
  - cuc
371
  - cui
@@ -373,21 +432,27 @@ language:
373
  - cul
374
  - cut
375
  - cux
 
376
  - cwd
377
  - cwe
378
  - cwt
 
379
  - cya
380
  - cym
381
  - czt
 
382
  - daa
383
  - dad
384
  - daf
 
385
  - dah
386
  - dak
387
  - dan
388
  - dar
 
389
  - ddg
390
- - ddo
 
391
  - ded
392
  - des
393
  - deu
@@ -399,6 +464,7 @@ language:
399
  - dhg
400
  - dhm
401
  - dhv
 
402
  - dig
403
  - dik
404
  - din
@@ -411,55 +477,73 @@ language:
411
  - djk
412
  - djr
413
  - dks
 
414
  - dng
415
  - dnj
 
416
  - dob
 
417
  - dop
 
418
  - dow
419
  - drg
420
  - drt
421
  - dru
422
  - dsb
 
 
423
  - dtp
424
  - dts
 
425
  - dua
426
  - due
427
  - dug
428
  - duo
429
  - dur
 
430
  - dwr
431
  - dws
432
  - dww
433
  - dyi
434
  - dyo
435
  - dyu
 
436
  - dzo
437
  - ebk
 
438
  - efi
439
  - egl
440
  - eka
441
  - ekk
442
  - eko
 
443
  - ell
 
444
  - emi
445
  - eml
446
  - emp
447
- - emx
448
  - enb
449
  - eng
450
  - enl
451
  - enm
 
452
  - enx
 
453
  - epo
454
  - eri
 
455
  - ese
456
  - esi
457
  - esk
 
458
  - est
459
  - esu
 
460
  - eto
461
  - etr
462
  - etu
 
463
  - eus
464
  - eve
465
  - evn
@@ -467,29 +551,38 @@ language:
467
  - ewo
468
  - ext
469
  - eza
 
470
  - faa
471
  - fad
472
  - fai
473
  - fal
474
  - fan
475
  - fao
 
476
  - fas
477
  - fat
478
  - ffm
 
479
  - fij
480
  - fil
481
  - fin
 
 
482
  - fkv
483
  - fmp
 
 
484
  - fon
485
  - for
 
486
  - fra
 
487
  - frm
488
  - fro
 
489
  - frr
490
  - fry
491
  - fub
492
- - fuc
493
  - fud
494
  - fue
495
  - fuf
@@ -497,6 +590,8 @@ language:
497
  - fuq
498
  - fur
499
  - fuv
 
 
500
  - gaa
501
  - gag
502
  - gah
@@ -506,11 +601,11 @@ language:
506
  - gaz
507
  - gba
508
  - gbi
509
- - gbm
510
  - gbo
511
  - gbr
512
  - gcf
513
  - gcr
 
514
  - gde
515
  - gdg
516
  - gdn
@@ -526,12 +621,16 @@ language:
526
  - gjn
527
  - gkn
528
  - gkp
 
529
  - gla
530
  - gle
531
  - glg
532
  - glk
533
  - glv
 
534
  - gmv
 
 
535
  - gnb
536
  - gnd
537
  - gng
@@ -540,6 +639,7 @@ language:
540
  - goa
541
  - gof
542
  - gog
 
543
  - gom
544
  - gor
545
  - gos
@@ -550,6 +650,7 @@ language:
550
  - grt
551
  - gso
552
  - gsw
 
553
  - gub
554
  - guc
555
  - gud
@@ -564,18 +665,23 @@ language:
564
  - guo
565
  - guq
566
  - gur
 
567
  - guw
568
  - gux
569
  - guz
 
570
  - gvc
571
  - gvf
572
  - gvl
573
  - gvn
574
  - gwi
 
575
  - gxx
576
  - gya
577
  - gym
578
  - gyr
 
 
579
  - hae
580
  - hag
581
  - hak
@@ -587,11 +693,12 @@ language:
587
  - hbo
588
  - hbs
589
  - hch
590
- - hdn
591
  - heb
592
  - heg
593
  - heh
594
  - her
 
595
  - hif
596
  - hig
597
  - hil
@@ -606,15 +713,18 @@ language:
606
  - hnj
607
  - hnn
608
  - hns
 
609
  - hoc
610
  - hop
611
  - hot
 
612
  - hra
613
  - hrv
614
  - hrx
615
  - hsb
616
- - hsn
617
  - hto
 
618
  - hub
619
  - hui
620
  - hun
@@ -623,28 +733,36 @@ language:
623
  - huv
624
  - hvn
625
  - hwc
 
626
  - hye
627
  - hyw
 
 
628
  - ian
629
  - iba
630
  - ibg
631
  - ibo
632
  - icr
 
633
  - ido
634
  - idu
 
635
  - ifa
636
  - ifb
637
  - ife
638
  - ifk
639
  - ifu
640
  - ify
 
641
  - ige
642
  - ign
643
  - igs
 
644
  - iii
645
  - ijc
646
  - ike
647
  - ikk
 
648
  - ikw
649
  - ilb
650
  - ile
@@ -653,18 +771,23 @@ language:
653
  - ina
654
  - inb
655
  - ind
 
656
  - ino
 
657
  - iou
658
  - ipi
659
  - iqw
660
  - iri
661
  - irk
662
  - iry
 
663
  - isd
664
  - ish
665
  - isl
666
  - iso
 
667
  - ita
 
668
  - its
669
  - itv
670
  - ium
@@ -675,6 +798,8 @@ language:
675
  - izh
676
  - izr
677
  - izz
 
 
678
  - jac
679
  - jae
680
  - jam
@@ -689,10 +814,14 @@ language:
689
  - jpa
690
  - jpn
691
  - jra
 
 
692
  - jvn
 
693
  - kaa
694
  - kab
695
  - kac
 
696
  - kal
697
  - kam
698
  - kan
@@ -706,9 +835,13 @@ language:
706
  - kbd
707
  - kbh
708
  - kbm
 
709
  - kbp
710
  - kbq
711
  - kbr
 
 
 
712
  - kck
713
  - kdc
714
  - kde
@@ -716,44 +849,58 @@ language:
716
  - kdi
717
  - kdj
718
  - kdl
 
 
719
  - kea
720
  - kei
721
  - kek
722
  - ken
723
- - ket
 
724
  - kew
725
  - kex
726
  - kez
727
  - kff
 
728
  - kgf
729
  - kgk
730
  - kgp
 
731
  - kha
 
732
  - khk
733
  - khm
 
734
  - khs
735
  - khy
736
  - khz
 
737
  - kia
 
738
  - kik
739
  - kin
740
  - kir
741
  - kiu
742
  - kix
 
743
  - kjb
744
  - kje
745
  - kjh
746
  - kjs
 
747
  - kkc
748
  - kki
749
  - kkj
750
  - kkl
751
- - klj
 
752
  - kln
753
  - klt
754
  - klv
 
755
  - kma
756
  - kmb
 
757
  - kmg
758
  - kmh
759
  - kmk
@@ -762,6 +909,8 @@ language:
762
  - kmr
763
  - kms
764
  - kmu
 
 
765
  - knc
766
  - kne
767
  - knf
@@ -772,6 +921,7 @@ language:
772
  - knv
773
  - knx
774
  - kny
 
775
  - kog
776
  - koi
777
  - kom
@@ -779,14 +929,17 @@ language:
779
  - koo
780
  - kor
781
  - kos
 
782
  - kpf
783
  - kpg
784
  - kpj
 
785
  - kpr
786
  - kpv
787
  - kpw
788
  - kpx
789
  - kpz
 
790
  - kqc
791
  - kqe
792
  - kqf
@@ -802,6 +955,8 @@ language:
802
  - krj
803
  - krl
804
  - kru
 
 
805
  - ksb
806
  - ksc
807
  - ksd
@@ -817,6 +972,7 @@ language:
817
  - ktm
818
  - kto
819
  - ktu
 
820
  - kua
821
  - kub
822
  - kud
@@ -825,8 +981,11 @@ language:
825
  - kum
826
  - kup
827
  - kus
 
 
828
  - kvj
829
  - kvn
 
830
  - kwd
831
  - kwf
832
  - kwi
@@ -836,6 +995,7 @@ language:
836
  - kxc
837
  - kxm
838
  - kxw
 
839
  - kyc
840
  - kyf
841
  - kyg
@@ -846,22 +1006,26 @@ language:
846
  - kzf
847
  - kzj
848
  - kzn
849
- - laa
850
  - lac
851
  - lad
852
  - lai
853
  - laj
854
  - lam
855
  - lao
 
856
  - las
857
  - lat
858
  - lav
 
 
859
  - lbb
860
  - lbe
861
  - lbj
862
  - lbk
863
  - lch
864
  - lcm
 
865
  - ldi
866
  - ldn
867
  - lea
@@ -875,10 +1039,14 @@ language:
875
  - lex
876
  - lez
877
  - lfn
 
 
 
878
  - lgm
879
  - lhi
880
  - lhm
881
  - lhu
 
882
  - lia
883
  - lid
884
  - lif
@@ -887,37 +1055,49 @@ language:
887
  - lin
888
  - lip
889
  - lir
 
890
  - lit
891
  - liv
892
  - ljp
893
- - lkt
894
  - llb
895
  - lld
 
896
  - lln
897
  - lmk
898
  - lmo
899
  - lmp
 
 
 
900
  - lob
901
  - loe
902
  - log
 
903
  - lol
904
  - lom
905
  - loq
906
- - lou
907
  - loz
 
908
  - lsi
909
  - lsm
 
910
  - ltg
911
  - ltz
 
912
  - lua
913
  - lub
 
 
914
  - lue
915
  - lug
916
  - lun
917
  - luo
918
  - lus
919
  - lut
 
920
  - lvs
 
921
  - lwo
922
  - lww
923
  - lzh
@@ -971,28 +1151,33 @@ language:
971
  - men
972
  - meq
973
  - mer
 
974
  - meu
975
  - mev
976
  - mfa
977
  - mfe
 
978
  - mfh
979
  - mfi
980
  - mfk
981
  - mfq
982
  - mfy
983
  - mfz
 
984
  - mgc
985
  - mgh
986
  - mgm
987
  - mgo
988
  - mgr
989
  - mgv
 
990
  - mhi
991
  - mhl
992
  - mhr
993
  - mhw
994
  - mhx
995
  - mhy
 
996
  - mib
997
  - mic
998
  - mie
@@ -1001,8 +1186,10 @@ language:
1001
  - mih
1002
  - mik
1003
  - mil
 
1004
  - min
1005
  - mio
 
1006
  - miq
1007
  - mir
1008
  - mit
@@ -1010,11 +1197,13 @@ language:
1010
  - miz
1011
  - mjc
1012
  - mjw
 
1013
  - mkd
1014
  - mkl
1015
  - mkn
1016
  - mks
1017
  - mkz
 
1018
  - mlg
1019
  - mlh
1020
  - mlp
@@ -1023,13 +1212,13 @@ language:
1023
  - mmn
1024
  - mmo
1025
  - mmx
 
1026
  - mna
1027
  - mnb
1028
- - mnc
1029
  - mnf
1030
  - mni
1031
  - mnk
1032
- - mnr
1033
  - mnw
1034
  - mnx
1035
  - mny
@@ -1052,12 +1241,14 @@ language:
1052
  - mqb
1053
  - mqj
1054
  - mqy
 
1055
  - mrg
1056
  - mri
1057
  - mrj
1058
  - mrq
1059
  - mrv
1060
  - mrw
 
1061
  - msa
1062
  - msb
1063
  - msc
@@ -1065,6 +1256,7 @@ language:
1065
  - msk
1066
  - msm
1067
  - msy
 
1068
  - mta
1069
  - mtg
1070
  - mti
@@ -1074,6 +1266,8 @@ language:
1074
  - mua
1075
  - mug
1076
  - muh
 
 
1077
  - mur
1078
  - mus
1079
  - mux
@@ -1081,7 +1275,6 @@ language:
1081
  - mva
1082
  - mvn
1083
  - mvp
1084
- - mvv
1085
  - mwc
1086
  - mwf
1087
  - mwl
@@ -1096,6 +1289,7 @@ language:
1096
  - mxq
1097
  - mxt
1098
  - mxv
 
1099
  - mya
1100
  - myb
1101
  - myk
@@ -1117,13 +1311,12 @@ language:
1117
  - nah
1118
  - nak
1119
  - nan
1120
- - nan
1121
  - nap
1122
  - naq
1123
  - nas
1124
- - nau
1125
  - nav
1126
  - naw
 
1127
  - nba
1128
  - nbc
1129
  - nbe
@@ -1134,9 +1327,11 @@ language:
1134
  - nch
1135
  - ncj
1136
  - ncl
 
1137
  - nct
1138
  - ncu
1139
  - ncx
 
1140
  - ndc
1141
  - nde
1142
  - ndh
@@ -1145,17 +1340,19 @@ language:
1145
  - ndo
1146
  - ndp
1147
  - nds
 
1148
  - ndz
 
1149
  - neb
1150
  - nep
1151
  - new
1152
  - nfa
1153
  - nfr
 
1154
  - ngb
1155
  - ngc
1156
  - ngl
1157
  - ngp
1158
- - ngt
1159
  - ngu
1160
  - nhd
1161
  - nhe
@@ -1174,9 +1371,9 @@ language:
1174
  - nij
1175
  - nim
1176
  - nin
 
1177
  - niq
1178
  - niu
1179
- - niv
1180
  - niy
1181
  - njb
1182
  - njm
@@ -1184,18 +1381,21 @@ language:
1184
  - njo
1185
  - njz
1186
  - nka
 
1187
  - nki
1188
  - nko
 
1189
  - nla
1190
  - nlc
1191
  - nld
1192
- - nlv
1193
  - nma
1194
  - nmf
1195
  - nmh
1196
  - nmo
1197
  - nmw
1198
  - nmz
 
1199
  - nnb
1200
  - nng
1201
  - nnh
@@ -1204,8 +1404,10 @@ language:
1204
  - nnp
1205
  - nnq
1206
  - nnw
 
1207
  - noa
1208
  - nob
 
1209
  - nog
1210
  - non
1211
  - nop
@@ -1218,9 +1420,12 @@ language:
1218
  - npl
1219
  - npo
1220
  - npy
 
 
1221
  - nre
1222
  - nrf
1223
  - nri
 
1224
  - nsa
1225
  - nse
1226
  - nsm
@@ -1231,13 +1436,18 @@ language:
1231
  - nsu
1232
  - ntp
1233
  - ntr
 
 
1234
  - nus
1235
  - nuy
 
 
1236
  - nvm
1237
  - nwb
1238
  - nwi
1239
  - nwx
1240
  - nxd
 
1241
  - nya
1242
  - nyf
1243
  - nyk
@@ -1245,40 +1455,49 @@ language:
1245
  - nyo
1246
  - nyu
1247
  - nyy
 
1248
  - nzb
1249
  - nzi
1250
  - nzm
1251
- - oar
1252
  - obo
 
1253
  - oci
1254
- - ofs
1255
  - ogo
 
1256
  - ojb
1257
  - oji
1258
  - ojs
1259
  - oke
 
1260
  - okv
1261
  - old
 
 
 
1262
  - omw
1263
  - ong
1264
  - ons
1265
  - ood
1266
  - opm
 
1267
  - ori
1268
  - orm
1269
  - orv
1270
  - ory
1271
- - osp
1272
  - oss
1273
  - ota
 
1274
  - ote
1275
  - otm
1276
  - otn
 
1277
  - otq
1278
  - ots
1279
  - otw
1280
  - oym
1281
  - ozm
 
1282
  - pab
1283
  - pad
1284
  - pag
@@ -1300,35 +1519,40 @@ language:
1300
  - pdt
1301
  - pem
1302
  - pes
 
1303
  - pfe
1304
  - pfl
1305
  - phm
1306
- - phn
1307
  - pib
1308
  - pid
 
1309
  - pio
1310
  - pir
1311
  - pis
1312
  - pjt
1313
  - pkb
 
1314
  - plg
1315
- - pli
1316
  - pls
1317
  - plt
1318
  - plu
1319
  - plw
1320
  - pma
1321
  - pmf
 
1322
  - pms
1323
  - pmx
1324
  - pnb
1325
  - pne
 
 
1326
  - poe
1327
  - poh
1328
  - poi
1329
  - pol
1330
  - pon
1331
  - por
 
1332
  - pot
1333
  - pov
1334
  - poy
@@ -1340,14 +1564,22 @@ language:
1340
  - prg
1341
  - pri
1342
  - prk
 
1343
  - prs
 
1344
  - pse
 
 
1345
  - ptp
1346
  - ptu
1347
  - pua
 
1348
  - pus
1349
  - pwg
 
1350
  - pww
 
 
1351
  - qub
1352
  - quc
1353
  - que
@@ -1376,19 +1608,25 @@ language:
1376
  - qxl
1377
  - qxn
1378
  - qxo
1379
- - qxq
1380
  - qxr
1381
  - qya
1382
  - rad
1383
  - rai
1384
  - rap
1385
  - rar
 
 
1386
  - rcf
 
 
 
1387
  - rhg
1388
  - ria
1389
  - rif
1390
  - rim
 
1391
  - rkb
 
1392
  - rmc
1393
  - rme
1394
  - rml
@@ -1396,51 +1634,68 @@ language:
1396
  - rmo
1397
  - rmq
1398
  - rmy
 
1399
  - rnd
1400
  - rng
1401
  - rnl
 
1402
  - roh
1403
  - rom
1404
  - ron
1405
  - roo
1406
  - rop
 
1407
  - rro
1408
  - rtm
 
1409
  - rub
1410
  - rue
1411
  - ruf
 
1412
  - run
1413
  - rup
1414
  - rus
 
1415
  - rwo
1416
- - ryu
1417
  - sab
1418
  - sag
1419
  - sah
 
1420
  - san
1421
  - sas
1422
  - sat
 
1423
  - sba
1424
  - sbd
1425
  - sbe
1426
  - sbl
1427
  - sbs
1428
  - sby
 
1429
  - sck
1430
  - scn
1431
  - sco
 
1432
  - sda
 
1433
  - sdh
1434
  - sdo
 
 
1435
  - seh
 
1436
  - ses
1437
  - sey
1438
  - sfw
 
1439
  - sgb
 
1440
  - sgh
1441
  - sgs
1442
  - sgw
1443
  - sgz
 
1444
  - shi
1445
  - shk
1446
  - shn
@@ -1449,6 +1704,7 @@ language:
1449
  - shs
1450
  - shu
1451
  - shy
 
1452
  - sid
1453
  - sig
1454
  - sil
@@ -1456,18 +1712,27 @@ language:
1456
  - sin
1457
  - sja
1458
  - sjn
 
 
 
1459
  - skg
1460
  - skr
 
1461
  - sld
1462
  - slk
1463
  - sll
1464
  - slv
 
1465
  - sma
1466
  - sme
 
1467
  - smk
1468
  - sml
 
1469
  - smo
 
1470
  - smt
 
1471
  - sna
1472
  - snc
1473
  - snd
@@ -1476,6 +1741,7 @@ language:
1476
  - snp
1477
  - snw
1478
  - sny
 
1479
  - soe
1480
  - som
1481
  - sop
@@ -1488,7 +1754,9 @@ language:
1488
  - spp
1489
  - sps
1490
  - spy
 
1491
  - sqi
 
1492
  - srd
1493
  - sri
1494
  - srm
@@ -1496,13 +1764,16 @@ language:
1496
  - srp
1497
  - srq
1498
  - srr
 
1499
  - ssd
1500
  - ssg
1501
  - ssw
1502
  - ssx
 
1503
  - stn
1504
  - stp
1505
  - stq
 
1506
  - sua
1507
  - suc
1508
  - sue
@@ -1512,6 +1783,8 @@ language:
1512
  - sus
1513
  - sux
1514
  - suz
 
 
1515
  - swa
1516
  - swb
1517
  - swc
@@ -1527,6 +1800,8 @@ language:
1527
  - syl
1528
  - szb
1529
  - szl
 
 
1530
  - tab
1531
  - tac
1532
  - tah
@@ -1538,11 +1813,13 @@ language:
1538
  - tat
1539
  - tav
1540
  - taw
 
1541
  - tbc
1542
  - tbg
1543
  - tbk
1544
  - tbl
1545
  - tbo
 
1546
  - tby
1547
  - tbz
1548
  - tca
@@ -1553,6 +1830,7 @@ language:
1553
  - tcz
1554
  - tdt
1555
  - tdx
 
1556
  - ted
1557
  - tee
1558
  - tel
@@ -1562,13 +1840,17 @@ language:
1562
  - tet
1563
  - tew
1564
  - tfr
 
1565
  - tgk
1566
  - tgl
1567
  - tgo
1568
  - tgp
 
1569
  - tha
1570
  - thk
 
1571
  - thv
 
1572
  - tif
1573
  - tig
1574
  - tih
@@ -1577,24 +1859,29 @@ language:
1577
  - tir
1578
  - tiv
1579
  - tiy
 
1580
  - tke
1581
  - tkl
1582
  - tkr
1583
  - tku
 
1584
  - tlb
1585
  - tlf
1586
  - tlh
1587
  - tlj
1588
  - tll
1589
  - tly
 
1590
  - tmd
1591
  - tmr
1592
- - tmw
1593
  - tna
1594
  - tnc
1595
  - tnk
1596
  - tnn
1597
  - tnp
 
 
1598
  - tob
1599
  - toc
1600
  - tod
@@ -1610,22 +1897,29 @@ language:
1610
  - tpa
1611
  - tpi
1612
  - tpm
 
1613
  - tpp
1614
  - tpt
1615
  - tpw
1616
  - tpz
1617
  - tqb
 
 
1618
  - trc
1619
  - trn
1620
  - tro
1621
  - trp
1622
  - trq
 
 
 
1623
  - tsc
1624
  - tsg
1625
  - tsn
1626
  - tso
1627
  - tsw
1628
  - tsz
 
1629
  - ttc
1630
  - tte
1631
  - ttj
@@ -1636,17 +1930,21 @@ language:
1636
  - tuf
1637
  - tui
1638
  - tuk
 
1639
  - tum
1640
  - tuo
1641
  - tur
1642
  - tuv
1643
  - tvk
1644
  - tvl
 
 
1645
  - twi
1646
  - twu
1647
  - twx
1648
  - txq
1649
  - txu
 
1650
  - tyv
1651
  - tzh
1652
  - tzj
@@ -1657,10 +1955,14 @@ language:
1657
  - ubu
1658
  - udm
1659
  - udu
 
1660
  - uig
 
1661
  - ukr
1662
  - umb
 
1663
  - upv
 
1664
  - ura
1665
  - urb
1666
  - urd
@@ -1668,19 +1970,26 @@ language:
1668
  - uri
1669
  - urk
1670
  - urt
 
 
1671
  - usa
1672
  - usp
 
1673
  - uvh
1674
  - uvl
 
1675
  - uzb
1676
  - uzn
 
1677
  - vag
1678
  - vap
1679
  - var
 
1680
  - vec
1681
  - ven
1682
  - vep
1683
  - vgt
 
1684
  - vid
1685
  - vie
1686
  - viv
@@ -1688,10 +1997,13 @@ language:
1688
  - vmk
1689
  - vmw
1690
  - vmy
 
1691
  - vol
 
1692
  - vro
1693
  - vun
1694
  - vut
 
1695
  - waj
1696
  - wal
1697
  - wap
@@ -1714,15 +2026,18 @@ language:
1714
  - wln
1715
  - wls
1716
  - wlv
 
1717
  - wmt
1718
  - wmw
1719
  - wnc
1720
  - wnu
 
1721
  - wob
1722
  - wol
1723
  - wos
1724
  - wrk
1725
  - wrs
 
1726
  - wsk
1727
  - wuu
1728
  - wuv
@@ -1732,15 +2047,16 @@ language:
1732
  - xbi
1733
  - xbr
1734
  - xed
 
1735
  - xho
1736
  - xla
1737
  - xmf
 
1738
  - xmv
1739
  - xnn
1740
  - xog
1741
  - xon
1742
  - xpe
1743
- - xqa
1744
  - xrb
1745
  - xsb
1746
  - xsi
@@ -1750,6 +2066,7 @@ language:
1750
  - xtd
1751
  - xtm
1752
  - xtn
 
1753
  - xuo
1754
  - yaa
1755
  - yad
@@ -1759,17 +2076,21 @@ language:
1759
  - yao
1760
  - yap
1761
  - yaq
 
 
 
1762
  - ybb
1763
  - yby
1764
  - ycn
1765
  - ydd
 
1766
  - yid
1767
  - yim
1768
  - yka
1769
- - ykg
1770
  - yle
1771
  - yli
1772
  - yml
 
1773
  - yom
1774
  - yon
1775
  - yor
@@ -1803,11 +2124,13 @@ language:
1803
  - zdj
1804
  - zea
1805
  - zgh
 
1806
  - zho
1807
  - zia
1808
  - ziw
1809
  - zlm
1810
  - zne
 
1811
  - zom
1812
  - zos
1813
  - zpa
@@ -1829,10 +2152,13 @@ language:
1829
  - zsr
1830
  - ztq
1831
  - zty
 
1832
  - zul
 
1833
  - zyb
1834
  - zyp
1835
  - zza
 
1836
  tags:
1837
  - text-classification
1838
  - language-identification
@@ -1840,6 +2166,7 @@ library_name: fasttext
1840
  datasets:
1841
  - cis-lmu/GlotSparse
1842
  - cis-lmu/GlotStoryBook
 
1843
  metrics:
1844
  - f1
1845
  ---
@@ -1850,10 +2177,10 @@ metrics:
1850
 
1851
  ## Description
1852
 
1853
- **GlotLID** is a Fasttext language identification (LID) model that supports more than **1600 languages**.
1854
 
1855
 
1856
- **Latest:** GlotLID is now updated to **V3**. V3 supports **2114 labels** (three-letter ISO codes with script). For more details on the supported languages and performance, as well as significant changes from previous versions, please refer to [https://github.com/cisnlp/GlotLID/blob/main/languages-v3.md](https://github.com/cisnlp/GlotLID/blob/main/languages-v3.md).
1857
 
1858
 
1859
  - **Demo:** [huggingface](https://huggingface.co/spaces/cis-lmu/glotlid-space)
@@ -1893,7 +2220,7 @@ If you are not a fan of huggingface_hub, then download the model directyly:
1893
 
1894
  ## License
1895
 
1896
- The model is distributed under the Apache License, Version 2.0.
1897
 
1898
  ## Version
1899
 
 
1
  ---
2
+ license: other
3
+ license_name: apache-2.0-plus-notices
4
+ license_link: LICENSE
5
  language:
6
+ - aah
7
+ - aai
8
  - aak
9
  - aau
10
  - aaz
11
+ - ab
12
  - aba
13
+ - abi
14
  - abk
15
  - abn
16
  - abq
17
+ - abs
18
  - abt
19
  - abx
20
  - aby
 
35
  - adi
36
  - adj
37
  - adl
38
+ - adx
39
  - ady
40
  - adz
41
  - aeb
42
  - aer
43
  - aeu
44
  - aey
45
+ - af
46
  - afb
47
  - afh
48
  - afr
 
65
  - aji
66
  - ajp
67
  - ajz
68
+ - ak
69
  - aka
70
  - akb
71
  - ake
 
81
  - alt
82
  - aly
83
  - alz
84
+ - am
85
  - ame
86
  - amf
87
  - amh
 
93
  - amr
94
  - amu
95
  - amx
96
+ - an
97
  - ang
98
  - anm
99
  - ann
100
+ - anp
101
  - anv
102
  - any
103
  - aoc
 
116
  - apw
117
  - apy
118
  - apz
119
+ - aqz
120
+ - ar
121
  - ara
122
  - arb
123
  - are
 
127
  - arn
128
  - arp
129
  - arq
130
+ - arr
131
  - ars
132
  - ary
133
  - arz
134
+ - as
135
  - asg
136
  - asm
137
  - aso
 
141
  - atd
142
  - atg
143
  - ati
144
+ - atj
145
+ - atq
146
  - att
147
  - auc
148
  - aui
149
  - auy
150
+ - av
151
  - ava
152
  - avk
153
+ - avn
154
  - avt
155
  - avu
156
  - awa
157
  - awb
158
  - awi
159
  - awx
160
+ - ay
161
  - aym
162
  - ayo
163
+ - ayp
164
  - ayr
165
+ - az
166
  - azb
167
  - aze
168
  - azg
169
  - azj
170
  - azz
171
+ - ba
172
  - bak
173
  - bal
174
  - bam
 
181
  - bbb
182
  - bbc
183
  - bbj
184
+ - bbk
185
+ - bbo
186
  - bbr
187
  - bcc
188
  - bch
 
192
  - bcw
193
  - bdd
194
  - bdh
195
+ - bdq
196
+ - be
197
  - bea
198
  - bef
199
  - bel
 
201
  - ben
202
  - beq
203
  - ber
204
+ - bew
205
  - bex
206
  - bfd
207
  - bfo
208
  - bfz
209
+ - bg
210
  - bgr
211
  - bgs
212
+ - bgt
213
  - bgz
214
  - bhg
215
  - bhl
216
  - bho
217
  - bhp
218
  - bhw
219
+ - bhz
220
+ - bi
221
  - bib
222
  - big
223
  - bih
 
232
  - bjr
233
  - bjv
234
  - bkd
235
+ - bkl
236
  - bkq
237
  - bku
238
  - bkv
239
  - bla
240
  - blh
241
+ - blk
242
+ - blt
243
  - blw
244
  - blz
245
+ - bm
246
  - bmb
247
  - bmh
248
  - bmk
249
  - bmq
250
  - bmr
251
  - bmu
252
+ - bmv
253
+ - bn
254
  - bnj
255
+ - bno
256
  - bnp
257
+ - bo
258
  - boa
259
  - bod
260
  - boj
261
  - bom
262
  - bon
263
+ - bor
264
  - bos
265
  - bov
266
  - box
 
270
  - bqc
271
  - bqj
272
  - bqp
273
+ - br
274
  - bre
275
+ - brh
276
  - bru
277
  - brx
278
+ - bs
279
  - bsc
280
  - bsn
281
  - bsp
 
294
  - bul
295
  - bum
296
  - bus
297
+ - bvc
298
+ - bvd
299
  - bvr
300
  - bvy
301
  - bvz
 
313
  - bzi
314
  - bzj
315
  - bzt
316
+ - ca
317
  - caa
318
  - cab
319
  - cac
 
328
  - cat
329
  - cav
330
  - cax
 
331
  - cbc
332
  - cbi
333
  - cbk
 
338
  - cbv
339
  - cce
340
  - cco
341
+ - ccp
342
+ - cdf
343
+ - ce
344
  - ceb
345
  - ceg
346
  - cek
 
348
  - cfm
349
  - cgc
350
  - cgg
351
+ - ch
352
  - cha
353
  - chd
354
  - che
355
  - chf
 
356
  - chj
357
  - chk
358
  - chn
 
368
  - cjp
369
  - cjs
370
  - cjv
 
371
  - ckb
372
+ - ckm
373
  - cko
374
  - ckt
375
  - cle
 
379
  - cmi
380
  - cmn
381
  - cmo
382
+ - cmr
383
  - cnh
384
  - cni
385
+ - cnk
386
  - cnl
387
+ - cnr
388
  - cnt
389
  - cnw
390
+ - co
391
  - coe
392
  - cof
393
  - cok
 
396
  - cor
397
  - cos
398
  - cot
399
+ - cou
400
  - cpa
401
  - cpb
402
  - cpc
 
403
  - cpu
404
  - cpy
405
  - crh
406
  - cri
407
+ - crj
408
  - crk
409
+ - crl
410
  - crm
411
  - crn
412
  - crq
413
  - crs
414
  - crt
415
  - crx
416
+ - cs
417
  - csb
418
  - csk
419
  - cso
 
424
  - cto
425
  - ctp
426
  - ctu
427
+ - cu
428
  - cub
429
  - cuc
430
  - cui
 
432
  - cul
433
  - cut
434
  - cux
435
+ - cv
436
  - cwd
437
  - cwe
438
  - cwt
439
+ - cy
440
  - cya
441
  - cym
442
  - czt
443
+ - da
444
  - daa
445
  - dad
446
  - daf
447
+ - dag
448
  - dah
449
  - dak
450
  - dan
451
  - dar
452
+ - dbq
453
  - ddg
454
+ - ddn
455
+ - de
456
  - ded
457
  - des
458
  - deu
 
464
  - dhg
465
  - dhm
466
  - dhv
467
+ - did
468
  - dig
469
  - dik
470
  - din
 
477
  - djk
478
  - djr
479
  - dks
480
+ - dln
481
  - dng
482
  - dnj
483
+ - dnw
484
  - dob
485
+ - doi
486
  - dop
487
+ - dos
488
  - dow
489
  - drg
490
  - drt
491
  - dru
492
  - dsb
493
+ - dsh
494
+ - dtb
495
  - dtp
496
  - dts
497
+ - dty
498
  - dua
499
  - due
500
  - dug
501
  - duo
502
  - dur
503
+ - dv
504
  - dwr
505
  - dws
506
  - dww
507
  - dyi
508
  - dyo
509
  - dyu
510
+ - dz
511
  - dzo
512
  - ebk
513
+ - ee
514
  - efi
515
  - egl
516
  - eka
517
  - ekk
518
  - eko
519
+ - el
520
  - ell
521
+ - eme
522
  - emi
523
  - eml
524
  - emp
525
+ - en
526
  - enb
527
  - eng
528
  - enl
529
  - enm
530
+ - enq
531
  - enx
532
+ - eo
533
  - epo
534
  - eri
535
+ - es
536
  - ese
537
  - esi
538
  - esk
539
+ - ess
540
  - est
541
  - esu
542
+ - et
543
  - eto
544
  - etr
545
  - etu
546
+ - eu
547
  - eus
548
  - eve
549
  - evn
 
551
  - ewo
552
  - ext
553
  - eza
554
+ - fa
555
  - faa
556
  - fad
557
  - fai
558
  - fal
559
  - fan
560
  - fao
561
+ - far
562
  - fas
563
  - fat
564
  - ffm
565
+ - fi
566
  - fij
567
  - fil
568
  - fin
569
+ - fit
570
+ - fj
571
  - fkv
572
  - fmp
573
+ - fmu
574
+ - fo
575
  - fon
576
  - for
577
+ - fr
578
  - fra
579
+ - frd
580
  - frm
581
  - fro
582
+ - frp
583
  - frr
584
  - fry
585
  - fub
 
586
  - fud
587
  - fue
588
  - fuf
 
590
  - fuq
591
  - fur
592
  - fuv
593
+ - fy
594
+ - ga
595
  - gaa
596
  - gag
597
  - gah
 
601
  - gaz
602
  - gba
603
  - gbi
 
604
  - gbo
605
  - gbr
606
  - gcf
607
  - gcr
608
+ - gd
609
  - gde
610
  - gdg
611
  - gdn
 
621
  - gjn
622
  - gkn
623
  - gkp
624
+ - gl
625
  - gla
626
  - gle
627
  - glg
628
  - glk
629
  - glv
630
+ - gmh
631
  - gmv
632
+ - gn
633
+ - gna
634
  - gnb
635
  - gnd
636
  - gng
 
639
  - goa
640
  - gof
641
  - gog
642
+ - goh
643
  - gom
644
  - gor
645
  - gos
 
650
  - grt
651
  - gso
652
  - gsw
653
+ - gu
654
  - gub
655
  - guc
656
  - gud
 
665
  - guo
666
  - guq
667
  - gur
668
+ - guu
669
  - guw
670
  - gux
671
  - guz
672
+ - gv
673
  - gvc
674
  - gvf
675
  - gvl
676
  - gvn
677
  - gwi
678
+ - gwr
679
  - gxx
680
  - gya
681
  - gym
682
  - gyr
683
+ - ha
684
+ - hac
685
  - hae
686
  - hag
687
  - hak
 
693
  - hbo
694
  - hbs
695
  - hch
696
+ - he
697
  - heb
698
  - heg
699
  - heh
700
  - her
701
+ - hi
702
  - hif
703
  - hig
704
  - hil
 
713
  - hnj
714
  - hnn
715
  - hns
716
+ - ho
717
  - hoc
718
  - hop
719
  - hot
720
+ - hr
721
  - hra
722
  - hrv
723
  - hrx
724
  - hsb
725
+ - ht
726
  - hto
727
+ - hu
728
  - hub
729
  - hui
730
  - hun
 
733
  - huv
734
  - hvn
735
  - hwc
736
+ - hy
737
  - hye
738
  - hyw
739
+ - hz
740
+ - ia
741
  - ian
742
  - iba
743
  - ibg
744
  - ibo
745
  - icr
746
+ - id
747
  - ido
748
  - idu
749
+ - ie
750
  - ifa
751
  - ifb
752
  - ife
753
  - ifk
754
  - ifu
755
  - ify
756
+ - ig
757
  - ige
758
  - ign
759
  - igs
760
+ - ii
761
  - iii
762
  - ijc
763
  - ike
764
  - ikk
765
+ - ikt
766
  - ikw
767
  - ilb
768
  - ile
 
771
  - ina
772
  - inb
773
  - ind
774
+ - inh
775
  - ino
776
+ - io
777
  - iou
778
  - ipi
779
  - iqw
780
  - iri
781
  - irk
782
  - iry
783
+ - is
784
  - isd
785
  - ish
786
  - isl
787
  - iso
788
+ - it
789
  - ita
790
+ - itl
791
  - its
792
  - itv
793
  - ium
 
798
  - izh
799
  - izr
800
  - izz
801
+ - ja
802
+ - jaa
803
  - jac
804
  - jae
805
  - jam
 
814
  - jpa
815
  - jpn
816
  - jra
817
+ - jun
818
+ - jv
819
  - jvn
820
+ - ka
821
  - kaa
822
  - kab
823
  - kac
824
+ - kak
825
  - kal
826
  - kam
827
  - kan
 
835
  - kbd
836
  - kbh
837
  - kbm
838
+ - kbo
839
  - kbp
840
  - kbq
841
  - kbr
842
+ - kby
843
+ - kca
844
+ - kcg
845
  - kck
846
  - kdc
847
  - kde
 
849
  - kdi
850
  - kdj
851
  - kdl
852
+ - kdp
853
+ - kdr
854
  - kea
855
  - kei
856
  - kek
857
  - ken
858
+ - keo
859
+ - ker
860
  - kew
861
  - kex
862
  - kez
863
  - kff
864
+ - kg
865
  - kgf
866
  - kgk
867
  - kgp
868
+ - kgr
869
  - kha
870
+ - khg
871
  - khk
872
  - khm
873
+ - khq
874
  - khs
875
  - khy
876
  - khz
877
+ - ki
878
  - kia
879
+ - kij
880
  - kik
881
  - kin
882
  - kir
883
  - kiu
884
  - kix
885
+ - kj
886
  - kjb
887
  - kje
888
  - kjh
889
  - kjs
890
+ - kk
891
  - kkc
892
  - kki
893
  - kkj
894
  - kkl
895
+ - kl
896
+ - kle
897
  - kln
898
  - klt
899
  - klv
900
+ - km
901
  - kma
902
  - kmb
903
+ - kmd
904
  - kmg
905
  - kmh
906
  - kmk
 
909
  - kmr
910
  - kms
911
  - kmu
912
+ - kmy
913
+ - kn
914
  - knc
915
  - kne
916
  - knf
 
921
  - knv
922
  - knx
923
  - kny
924
+ - ko
925
  - kog
926
  - koi
927
  - kom
 
929
  - koo
930
  - kor
931
  - kos
932
+ - kpe
933
  - kpf
934
  - kpg
935
  - kpj
936
+ - kpq
937
  - kpr
938
  - kpv
939
  - kpw
940
  - kpx
941
  - kpz
942
+ - kqa
943
  - kqc
944
  - kqe
945
  - kqf
 
955
  - krj
956
  - krl
957
  - kru
958
+ - krx
959
+ - ks
960
  - ksb
961
  - ksc
962
  - ksd
 
972
  - ktm
973
  - kto
974
  - ktu
975
+ - ktz
976
  - kua
977
  - kub
978
  - kud
 
981
  - kum
982
  - kup
983
  - kus
984
+ - kv
985
+ - kvg
986
  - kvj
987
  - kvn
988
+ - kw
989
  - kwd
990
  - kwf
991
  - kwi
 
995
  - kxc
996
  - kxm
997
  - kxw
998
+ - ky
999
  - kyc
1000
  - kyf
1001
  - kyg
 
1006
  - kzf
1007
  - kzj
1008
  - kzn
1009
+ - la
1010
  - lac
1011
  - lad
1012
  - lai
1013
  - laj
1014
  - lam
1015
  - lao
1016
+ - lap
1017
  - las
1018
  - lat
1019
  - lav
1020
+ - law
1021
+ - lb
1022
  - lbb
1023
  - lbe
1024
  - lbj
1025
  - lbk
1026
  - lch
1027
  - lcm
1028
+ - lcp
1029
  - ldi
1030
  - ldn
1031
  - lea
 
1039
  - lex
1040
  - lez
1041
  - lfn
1042
+ - lg
1043
+ - lgg
1044
+ - lgl
1045
  - lgm
1046
  - lhi
1047
  - lhm
1048
  - lhu
1049
+ - li
1050
  - lia
1051
  - lid
1052
  - lif
 
1055
  - lin
1056
  - lip
1057
  - lir
1058
+ - lis
1059
  - lit
1060
  - liv
1061
  - ljp
1062
+ - lki
1063
  - llb
1064
  - lld
1065
+ - llg
1066
  - lln
1067
  - lmk
1068
  - lmo
1069
  - lmp
1070
+ - ln
1071
+ - lnd
1072
+ - lo
1073
  - lob
1074
  - loe
1075
  - log
1076
+ - lok
1077
  - lol
1078
  - lom
1079
  - loq
 
1080
  - loz
1081
+ - lrc
1082
  - lsi
1083
  - lsm
1084
+ - lt
1085
  - ltg
1086
  - ltz
1087
+ - lu
1088
  - lua
1089
  - lub
1090
+ - luc
1091
+ - lud
1092
  - lue
1093
  - lug
1094
  - lun
1095
  - luo
1096
  - lus
1097
  - lut
1098
+ - lv
1099
  - lvs
1100
+ - lwg
1101
  - lwo
1102
  - lww
1103
  - lzh
 
1151
  - men
1152
  - meq
1153
  - mer
1154
+ - met
1155
  - meu
1156
  - mev
1157
  - mfa
1158
  - mfe
1159
+ - mfg
1160
  - mfh
1161
  - mfi
1162
  - mfk
1163
  - mfq
1164
  - mfy
1165
  - mfz
1166
+ - mg
1167
  - mgc
1168
  - mgh
1169
  - mgm
1170
  - mgo
1171
  - mgr
1172
  - mgv
1173
+ - mh
1174
  - mhi
1175
  - mhl
1176
  - mhr
1177
  - mhw
1178
  - mhx
1179
  - mhy
1180
+ - mi
1181
  - mib
1182
  - mic
1183
  - mie
 
1186
  - mih
1187
  - mik
1188
  - mil
1189
+ - mim
1190
  - min
1191
  - mio
1192
+ - mip
1193
  - miq
1194
  - mir
1195
  - mit
 
1197
  - miz
1198
  - mjc
1199
  - mjw
1200
+ - mk
1201
  - mkd
1202
  - mkl
1203
  - mkn
1204
  - mks
1205
  - mkz
1206
+ - ml
1207
  - mlg
1208
  - mlh
1209
  - mlp
 
1212
  - mmn
1213
  - mmo
1214
  - mmx
1215
+ - mn
1216
  - mna
1217
  - mnb
 
1218
  - mnf
1219
  - mni
1220
  - mnk
1221
+ - mns
1222
  - mnw
1223
  - mnx
1224
  - mny
 
1241
  - mqb
1242
  - mqj
1243
  - mqy
1244
+ - mr
1245
  - mrg
1246
  - mri
1247
  - mrj
1248
  - mrq
1249
  - mrv
1250
  - mrw
1251
+ - ms
1252
  - msa
1253
  - msb
1254
  - msc
 
1256
  - msk
1257
  - msm
1258
  - msy
1259
+ - mt
1260
  - mta
1261
  - mtg
1262
  - mti
 
1266
  - mua
1267
  - mug
1268
  - muh
1269
+ - mui
1270
+ - mup
1271
  - mur
1272
  - mus
1273
  - mux
 
1275
  - mva
1276
  - mvn
1277
  - mvp
 
1278
  - mwc
1279
  - mwf
1280
  - mwl
 
1289
  - mxq
1290
  - mxt
1291
  - mxv
1292
+ - my
1293
  - mya
1294
  - myb
1295
  - myk
 
1311
  - nah
1312
  - nak
1313
  - nan
 
1314
  - nap
1315
  - naq
1316
  - nas
 
1317
  - nav
1318
  - naw
1319
+ - nb
1320
  - nba
1321
  - nbc
1322
  - nbe
 
1327
  - nch
1328
  - ncj
1329
  - ncl
1330
+ - ncq
1331
  - nct
1332
  - ncu
1333
  - ncx
1334
+ - nd
1335
  - ndc
1336
  - nde
1337
  - ndh
 
1340
  - ndo
1341
  - ndp
1342
  - nds
1343
+ - ndy
1344
  - ndz
1345
+ - ne
1346
  - neb
1347
  - nep
1348
  - new
1349
  - nfa
1350
  - nfr
1351
+ - ng
1352
  - ngb
1353
  - ngc
1354
  - ngl
1355
  - ngp
 
1356
  - ngu
1357
  - nhd
1358
  - nhe
 
1371
  - nij
1372
  - nim
1373
  - nin
1374
+ - nio
1375
  - niq
1376
  - niu
 
1377
  - niy
1378
  - njb
1379
  - njm
 
1381
  - njo
1382
  - njz
1383
  - nka
1384
+ - nkf
1385
  - nki
1386
  - nko
1387
+ - nl
1388
  - nla
1389
  - nlc
1390
  - nld
1391
+ - nlg
1392
  - nma
1393
  - nmf
1394
  - nmh
1395
  - nmo
1396
  - nmw
1397
  - nmz
1398
+ - nn
1399
  - nnb
1400
  - nng
1401
  - nnh
 
1404
  - nnp
1405
  - nnq
1406
  - nnw
1407
+ - no
1408
  - noa
1409
  - nob
1410
+ - nod
1411
  - nog
1412
  - non
1413
  - nop
 
1420
  - npl
1421
  - npo
1422
  - npy
1423
+ - nqo
1424
+ - nr
1425
  - nre
1426
  - nrf
1427
  - nri
1428
+ - nrm
1429
  - nsa
1430
  - nse
1431
  - nsm
 
1436
  - nsu
1437
  - ntp
1438
  - ntr
1439
+ - ntu
1440
+ - nuj
1441
  - nus
1442
  - nuy
1443
+ - nuz
1444
+ - nv
1445
  - nvm
1446
  - nwb
1447
  - nwi
1448
  - nwx
1449
  - nxd
1450
+ - ny
1451
  - nya
1452
  - nyf
1453
  - nyk
 
1455
  - nyo
1456
  - nyu
1457
  - nyy
1458
+ - nza
1459
  - nzb
1460
  - nzi
1461
  - nzm
 
1462
  - obo
1463
+ - oc
1464
  - oci
 
1465
  - ogo
1466
+ - oj
1467
  - ojb
1468
  - oji
1469
  - ojs
1470
  - oke
1471
+ - oku
1472
  - okv
1473
  - old
1474
+ - olo
1475
+ - om
1476
+ - omb
1477
  - omw
1478
  - ong
1479
  - ons
1480
  - ood
1481
  - opm
1482
+ - or
1483
  - ori
1484
  - orm
1485
  - orv
1486
  - ory
1487
+ - os
1488
  - oss
1489
  - ota
1490
+ - otd
1491
  - ote
1492
  - otm
1493
  - otn
1494
+ - oto
1495
  - otq
1496
  - ots
1497
  - otw
1498
  - oym
1499
  - ozm
1500
+ - pa
1501
  - pab
1502
  - pad
1503
  - pag
 
1519
  - pdt
1520
  - pem
1521
  - pes
1522
+ - pez
1523
  - pfe
1524
  - pfl
1525
  - phm
 
1526
  - pib
1527
  - pid
1528
+ - pih
1529
  - pio
1530
  - pir
1531
  - pis
1532
  - pjt
1533
  - pkb
1534
+ - pl
1535
  - plg
 
1536
  - pls
1537
  - plt
1538
  - plu
1539
  - plw
1540
  - pma
1541
  - pmf
1542
+ - pmq
1543
  - pms
1544
  - pmx
1545
  - pnb
1546
  - pne
1547
+ - pnt
1548
+ - pny
1549
  - poe
1550
  - poh
1551
  - poi
1552
  - pol
1553
  - pon
1554
  - por
1555
+ - pos
1556
  - pot
1557
  - pov
1558
  - poy
 
1564
  - prg
1565
  - pri
1566
  - prk
1567
+ - prq
1568
  - prs
1569
+ - ps
1570
  - pse
1571
+ - pss
1572
+ - pt
1573
  - ptp
1574
  - ptu
1575
  - pua
1576
+ - pui
1577
  - pus
1578
  - pwg
1579
+ - pwn
1580
  - pww
1581
+ - pxm
1582
+ - qu
1583
  - qub
1584
  - quc
1585
  - que
 
1608
  - qxl
1609
  - qxn
1610
  - qxo
 
1611
  - qxr
1612
  - qya
1613
  - rad
1614
  - rai
1615
  - rap
1616
  - rar
1617
+ - rav
1618
+ - raw
1619
  - rcf
1620
+ - rej
1621
+ - rel
1622
+ - rgu
1623
  - rhg
1624
  - ria
1625
  - rif
1626
  - rim
1627
+ - rjs
1628
  - rkb
1629
+ - rm
1630
  - rmc
1631
  - rme
1632
  - rml
 
1634
  - rmo
1635
  - rmq
1636
  - rmy
1637
+ - rn
1638
  - rnd
1639
  - rng
1640
  - rnl
1641
+ - ro
1642
  - roh
1643
  - rom
1644
  - ron
1645
  - roo
1646
  - rop
1647
+ - row
1648
  - rro
1649
  - rtm
1650
+ - ru
1651
  - rub
1652
  - rue
1653
  - ruf
1654
+ - rug
1655
  - run
1656
  - rup
1657
  - rus
1658
+ - rw
1659
  - rwo
1660
+ - sa
1661
  - sab
1662
  - sag
1663
  - sah
1664
+ - saj
1665
  - san
1666
  - sas
1667
  - sat
1668
+ - say
1669
  - sba
1670
  - sbd
1671
  - sbe
1672
  - sbl
1673
  - sbs
1674
  - sby
1675
+ - sc
1676
  - sck
1677
  - scn
1678
  - sco
1679
+ - sd
1680
  - sda
1681
+ - sdc
1682
  - sdh
1683
  - sdo
1684
+ - sdq
1685
+ - se
1686
  - seh
1687
+ - sel
1688
  - ses
1689
  - sey
1690
  - sfw
1691
+ - sg
1692
  - sgb
1693
+ - sgc
1694
  - sgh
1695
  - sgs
1696
  - sgw
1697
  - sgz
1698
+ - sh
1699
  - shi
1700
  - shk
1701
  - shn
 
1704
  - shs
1705
  - shu
1706
  - shy
1707
+ - si
1708
  - sid
1709
  - sig
1710
  - sil
 
1712
  - sin
1713
  - sja
1714
  - sjn
1715
+ - sjo
1716
+ - sju
1717
+ - sk
1718
  - skg
1719
  - skr
1720
+ - sl
1721
  - sld
1722
  - slk
1723
  - sll
1724
  - slv
1725
+ - sm
1726
  - sma
1727
  - sme
1728
+ - smj
1729
  - smk
1730
  - sml
1731
+ - smn
1732
  - smo
1733
+ - sms
1734
  - smt
1735
+ - sn
1736
  - sna
1737
  - snc
1738
  - snd
 
1741
  - snp
1742
  - snw
1743
  - sny
1744
+ - so
1745
  - soe
1746
  - som
1747
  - sop
 
1754
  - spp
1755
  - sps
1756
  - spy
1757
+ - sq
1758
  - sqi
1759
+ - sr
1760
  - srd
1761
  - sri
1762
  - srm
 
1764
  - srp
1765
  - srq
1766
  - srr
1767
+ - ss
1768
  - ssd
1769
  - ssg
1770
  - ssw
1771
  - ssx
1772
+ - st
1773
  - stn
1774
  - stp
1775
  - stq
1776
+ - su
1777
  - sua
1778
  - suc
1779
  - sue
 
1783
  - sus
1784
  - sux
1785
  - suz
1786
+ - sv
1787
+ - sw
1788
  - swa
1789
  - swb
1790
  - swc
 
1800
  - syl
1801
  - szb
1802
  - szl
1803
+ - szy
1804
+ - ta
1805
  - tab
1806
  - tac
1807
  - tah
 
1813
  - tat
1814
  - tav
1815
  - taw
1816
+ - tay
1817
  - tbc
1818
  - tbg
1819
  - tbk
1820
  - tbl
1821
  - tbo
1822
+ - tbw
1823
  - tby
1824
  - tbz
1825
  - tca
 
1830
  - tcz
1831
  - tdt
1832
  - tdx
1833
+ - te
1834
  - ted
1835
  - tee
1836
  - tel
 
1840
  - tet
1841
  - tew
1842
  - tfr
1843
+ - tg
1844
  - tgk
1845
  - tgl
1846
  - tgo
1847
  - tgp
1848
+ - th
1849
  - tha
1850
  - thk
1851
+ - thl
1852
  - thv
1853
+ - ti
1854
  - tif
1855
  - tig
1856
  - tih
 
1859
  - tir
1860
  - tiv
1861
  - tiy
1862
+ - tk
1863
  - tke
1864
  - tkl
1865
  - tkr
1866
  - tku
1867
+ - tl
1868
  - tlb
1869
  - tlf
1870
  - tlh
1871
  - tlj
1872
  - tll
1873
  - tly
1874
+ - tmc
1875
  - tmd
1876
  - tmr
1877
+ - tn
1878
  - tna
1879
  - tnc
1880
  - tnk
1881
  - tnn
1882
  - tnp
1883
+ - tnr
1884
+ - to
1885
  - tob
1886
  - toc
1887
  - tod
 
1897
  - tpa
1898
  - tpi
1899
  - tpm
1900
+ - tpn
1901
  - tpp
1902
  - tpt
1903
  - tpw
1904
  - tpz
1905
  - tqb
1906
+ - tqo
1907
+ - tr
1908
  - trc
1909
  - trn
1910
  - tro
1911
  - trp
1912
  - trq
1913
+ - trs
1914
+ - trv
1915
+ - ts
1916
  - tsc
1917
  - tsg
1918
  - tsn
1919
  - tso
1920
  - tsw
1921
  - tsz
1922
+ - tt
1923
  - ttc
1924
  - tte
1925
  - ttj
 
1930
  - tuf
1931
  - tui
1932
  - tuk
1933
+ - tul
1934
  - tum
1935
  - tuo
1936
  - tur
1937
  - tuv
1938
  - tvk
1939
  - tvl
1940
+ - tw
1941
+ - twb
1942
  - twi
1943
  - twu
1944
  - twx
1945
  - txq
1946
  - txu
1947
+ - ty
1948
  - tyv
1949
  - tzh
1950
  - tzj
 
1955
  - ubu
1956
  - udm
1957
  - udu
1958
+ - ug
1959
  - uig
1960
+ - uk
1961
  - ukr
1962
  - umb
1963
+ - und
1964
  - upv
1965
+ - ur
1966
  - ura
1967
  - urb
1968
  - urd
 
1970
  - uri
1971
  - urk
1972
  - urt
1973
+ - urw
1974
+ - ury
1975
  - usa
1976
  - usp
1977
+ - uth
1978
  - uvh
1979
  - uvl
1980
+ - uz
1981
  - uzb
1982
  - uzn
1983
+ - uzs
1984
  - vag
1985
  - vap
1986
  - var
1987
+ - ve
1988
  - vec
1989
  - ven
1990
  - vep
1991
  - vgt
1992
+ - vi
1993
  - vid
1994
  - vie
1995
  - viv
 
1997
  - vmk
1998
  - vmw
1999
  - vmy
2000
+ - vo
2001
  - vol
2002
+ - vot
2003
  - vro
2004
  - vun
2005
  - vut
2006
+ - wa
2007
  - waj
2008
  - wal
2009
  - wap
 
2026
  - wln
2027
  - wls
2028
  - wlv
2029
+ - wlx
2030
  - wmt
2031
  - wmw
2032
  - wnc
2033
  - wnu
2034
+ - wo
2035
  - wob
2036
  - wol
2037
  - wos
2038
  - wrk
2039
  - wrs
2040
+ - wsg
2041
  - wsk
2042
  - wuu
2043
  - wuv
 
2047
  - xbi
2048
  - xbr
2049
  - xed
2050
+ - xh
2051
  - xho
2052
  - xla
2053
  - xmf
2054
+ - xmm
2055
  - xmv
2056
  - xnn
2057
  - xog
2058
  - xon
2059
  - xpe
 
2060
  - xrb
2061
  - xsb
2062
  - xsi
 
2066
  - xtd
2067
  - xtm
2068
  - xtn
2069
+ - xum
2070
  - xuo
2071
  - yaa
2072
  - yad
 
2076
  - yao
2077
  - yap
2078
  - yaq
2079
+ - yas
2080
+ - yat
2081
+ - yaz
2082
  - ybb
2083
  - yby
2084
  - ycn
2085
  - ydd
2086
+ - yi
2087
  - yid
2088
  - yim
2089
  - yka
 
2090
  - yle
2091
  - yli
2092
  - yml
2093
+ - yo
2094
  - yom
2095
  - yon
2096
  - yor
 
2124
  - zdj
2125
  - zea
2126
  - zgh
2127
+ - zh
2128
  - zho
2129
  - zia
2130
  - ziw
2131
  - zlm
2132
  - zne
2133
+ - zoc
2134
  - zom
2135
  - zos
2136
  - zpa
 
2152
  - zsr
2153
  - ztq
2154
  - zty
2155
+ - zu
2156
  - zul
2157
+ - zxx
2158
  - zyb
2159
  - zyp
2160
  - zza
2161
+
2162
  tags:
2163
  - text-classification
2164
  - language-identification
 
2166
  datasets:
2167
  - cis-lmu/GlotSparse
2168
  - cis-lmu/GlotStoryBook
2169
+ - cis-lmu/glotlid-corpus
2170
  metrics:
2171
  - f1
2172
  ---
 
2177
 
2178
  ## Description
2179
 
2180
+ **GlotLID** is a Fasttext language identification (LID) model that supports more than **2000 labels**.
2181
 
2182
 
2183
+ **Latest:** GlotLID is now updated to **V3**. V3 supports **2102 labels** (three-letter ISO codes with script). For more details on the supported languages and performance, as well as significant changes from previous versions, please refer to [https://github.com/cisnlp/GlotLID/blob/main/languages-v3.md](https://github.com/cisnlp/GlotLID/blob/main/languages-v3.md).
2184
 
2185
 
2186
  - **Demo:** [huggingface](https://huggingface.co/spaces/cis-lmu/glotlid-space)
 
2220
 
2221
  ## License
2222
 
2223
+ The model is distributed under the Apache License, Version 2.0 plus notices (see LICENSE file for full terms).
2224
 
2225
  ## Version
2226