]>
Commit | Line | Data |
---|---|---|
ea1898dd EK |
1 | AkH-14-a1 acél ; The "AkH" tests are from: |
2 | AkH-14-a1 cukor ; | |
3 | AkH-14-a1 csók ; A magyar helyesírás szabályai, 12. kiadás | |
4 | AkH-14-a1 gép ; [The Rules of Hungarian Orthography, 12th edition] | |
5 | AkH-14-a1 hideg ; | |
6 | AkH-14-a1 kettő ; often referred to as akadémiai helyesírás (AkH.) [academic orthography] | |
7 | AkH-14-a1 Nagy ; | |
8 | AkH-14-a1 nyúl ; http://helyesiras.mta.hu/helyesiras/default/akh12 | |
9 | AkH-14-a1 olasz ; | |
10 | AkH-14-a1 öröm ; Alphabetical ordering described in #14-16. | |
11 | AkH-14-a1 remény | |
12 | AkH-14-a1 sokáig ; #14-a1: Sort based on first letter. | |
13 | AkH-14-a1 szabad | |
14 | AkH-14-a1 Tamás | |
15 | AkH-14-a1 vásárol | |
16 | AkH-14-a2 jácint ; #14-a2: If no other difference, lowercase initial precedes uppercase. | |
17 | AkH-14-a2 Jácint | |
18 | AkH-14-a2 opera | |
19 | AkH-14-a2 Opera | |
20 | AkH-14-a2 szűcs | |
21 | AkH-14-a2 Szűcs | |
22 | AkH-14-a2 viola | |
23 | AkH-14-a2 Viola | |
24 | AkH-14-a3 cudar ; #14-a3: Compound letters (cs, dz, dzs, gy, ly, ny, sz, ty, zs) | |
25 | AkH-14-a3 cukor ; are sorted separately, after their first letter: | |
26 | AkH-14-a3 cuppant ; a b c cs d dz dzs e f g gy h ... l ly m n ny o ... s sz t ty u ... z zs | |
27 | AkH-14-a3 csalit | |
28 | AkH-14-a3 csata | |
29 | AkH-14-a3 Csepel | |
30 | AkH-14-a3 Zoltán | |
31 | AkH-14-a3 zongora | |
32 | AkH-14-a3 zúdul | |
33 | AkH-14-a3 zsalu | |
34 | AkH-14-a3 zseni | |
35 | AkH-14-a3 Zsigmond | |
36 | AkH-14-b1 lom ; #14-b1: The first difference matters. | |
37 | AkH-14-b1 lomb | |
38 | AkH-14-b1 lombik | |
39 | AkH-14-b1 Lontay | |
40 | AkH-14-b1 lovagol | |
41 | AkH-14-b1 pirinkó | |
42 | AkH-14-b1 pirinyó | |
43 | AkH-14-b1 pirít | |
44 | AkH-14-b1 pirkad | |
45 | AkH-14-b1 Piroska | |
46 | AkH-14-b1 tükör | |
47 | AkH-14-b1 Tünde | |
48 | AkH-14-b1 tünemény | |
49 | AkH-14-b1 tüntet | |
50 | AkH-14-b1 tüzér | |
51 | AkH-14-b2 kas ; #14-b2: If a compound letter is pronounced long, only the first letter | |
52 | AkH-14-b2 Kasmír ; is duplicated in writing: <cs><cs> becomes ccs, <dzs><dzs> is ddzs etc. | |
53 | AkH-14-b2 Kassák ; (unless it's at the boundary of a compound word where it's written out twice). | |
54 | AkH-14-b2 kastély ; Sort according to the actual tokens, not the shorthand written form. | |
55 | AkH-14-b2 kasza ; <k><a><sz><a> | |
56 | AkH-14-b2 kaszinó ; <k><a><sz><i><n><ó> | |
57 | AkH-14-b2 kassza ; <k><a><sz><sz><a> | |
58 | AkH-14-b2 kaszt ; <k><a><sz><t> | |
59 | AkH-14-b2 mennek | |
60 | AkH-14-b2 mennének | |
61 | AkH-14-b2 menü | |
62 | AkH-14-b2 menza | |
63 | AkH-14-b2 meny ; <m><e><ny> | |
64 | AkH-14-b2 Menyhért ; <M><e><ny><h><é><r><t> | |
65 | AkH-14-b2 mennybolt ; <m><e><ny><ny><b><o><l><t> | |
66 | AkH-14-b2 mennyi ; <m><e><ny><ny><i> | |
67 | AkH-14-b2 nagy ; <n><a><gy> | |
68 | AkH-14-b2 naggyá ; <n><a><gy><gy><á> | |
69 | AkH-14-b2 nagygyakorlat ; <n><a><gy><gy><a><k><o><r><l><a><t> (compound word: nagy+gyakorlat) | |
70 | AkH-14-b2 naggyal ; <n><a><gy><gy><a><l> | |
71 | AkH-14-b2 nagyít ; <n><a><gy><í><t> | |
72 | AkH-14-b2 nagyobb | |
73 | AkH-14-b2 nagyol | |
74 | AkH-14-b2 nagyoll | |
75 | AkH-14-c1 ír ; #14-c1: Vowels collate equally in pairs: a-á, e-é, i-í, o-ó, ö-ő, u-ú, ü-ű. | |
76 | AkH-14-c1 Irak | |
77 | AkH-14-c1 iram | |
78 | AkH-14-c1 Irán | |
79 | AkH-14-c1 írandó | |
80 | AkH-14-c1 iránt | |
81 | AkH-14-c1 író | |
82 | AkH-14-c1 iroda | |
83 | AkH-14-c1 irónia | |
84 | AkH-14-c2 Eger ; #14-c2: Short vowel (unaccented, or with diaeresis) comes first if that's the only difference. | |
85 | AkH-14-c2 egér | |
86 | AkH-14-c2 egyfelé | |
87 | AkH-14-c2 egyféle | |
88 | AkH-14-c2 elöl | |
89 | AkH-14-c2 elől | |
90 | AkH-14-c2 kerek | |
91 | AkH-14-c2 kerék | |
92 | AkH-14-c2 keres | |
93 | AkH-14-c2 kérés | |
94 | AkH-14-c2 koros | |
95 | AkH-14-c2 kóros | |
96 | AkH-14-c2 szel | |
97 | AkH-14-c2 szél | |
98 | AkH-14-c2 szeles | |
99 | AkH-14-c2 széles | |
100 | AkH-14-c2 szüret | |
101 | AkH-14-c2 szűret | |
102 | AkH-14-d1 kis részben ; #14-d1: Spaces, hyphens are ignored. | |
103 | AkH-14-d1 kissé | |
104 | AkH-14-d1 Kiss Ernő | |
105 | AkH-14-d1 kis sorozat | |
106 | AkH-14-d1 kissorozat-gyártás | |
107 | AkH-14-d1 kis számban | |
108 | AkH-14-d1 kistányér | |
109 | AkH-14-d1 kis virág | |
110 | AkH-14-d1 márvány | |
111 | AkH-14-d1 márványkő | |
112 | AkH-14-d1 márvány sírkő | |
113 | AkH-14-d1 Márvány-tenger | |
114 | AkH-14-d1 márványtömb | |
115 | AkH-14-d1 Márvány Zsolt | |
116 | AkH-14-d1 másféle | |
117 | AkH-14-d1 másol | |
118 | AkH-14-d1 tiszafa | |
119 | AkH-14-d1 Tiszahát | |
120 | AkH-14-d1 Tisza Kálmán | |
121 | AkH-14-d1 Tisza menti | |
122 | AkH-14-d1 Tiszántúl | |
123 | AkH-14-d1 Tisza-part | |
124 | AkH-14-d1 tiszavirág | |
125 | AkH-14-d1 tiszt | |
126 | AkH-15 cérna ; #15: Foreign accents are ignored, unless they're the only difference, | |
127 | AkH-15 Černý ; in which case they are sorted after the Hungarian ones (in unspecified order). | |
128 | AkH-15 Champagne | |
129 | AkH-15 Cholnoky | |
130 | AkH-15 címez | |
131 | AkH-15 cukor | |
132 | AkH-15 Czuczor | |
133 | AkH-15 csapat | |
134 | AkH-15 Gaal | |
135 | AkH-15 galamb | |
136 | AkH-15 Gärtner | |
137 | AkH-15 gáz | |
138 | AkH-15 geodézia | |
139 | AkH-15 Georges | |
140 | AkH-15 góc | |
141 | AkH-15 Goethe | |
142 | AkH-15 moshat | |
143 | AkH-15 mosna | |
144 | AkH-15 Mošna | |
145 | AkH-15 mosópor | |
146 | AkH-15 Møsstrand | |
147 | AkH-15 mostan | |
148 | AkH-15 munka | |
149 | AkH-15 Muñoz | |
150 | alphabet a ; All the remaining tests were added by glibc. | |
151 | alphabet á | |
152 | alphabet aa ; a = á unless that's the only difference in which case a < á. | |
153 | alphabet aá ; (Same for e = é, i = í, o = ó, ö = ő, u = ú, ü = ű below.) | |
154 | alphabet áa ; Differences in accents matter from left to right. | |
155 | alphabet áá | |
156 | alphabet áp | |
157 | alphabet aq | |
158 | alphabet b | |
159 | alphabet c | |
160 | alphabet cz ; <c><z> | |
161 | alphabet cs ; <cs> -- or rarely <c><s>, can't tell for sure, assume <cs>. | |
162 | alphabet csc ; <cs><c> | |
163 | alphabet ccs ; <cs><cs> -- or rarely <c><cs>, can't tell for sure, assume <cs><cs>. | |
164 | alphabet cscs ; <cs><cs> -- Make sure ccs and cscs don't collate as equal, see bug 13547. | |
165 | alphabet ccsa ; <cs><cs><a> -- The order of ccs and cscs is not specified in the rules and is arbitrarily chosen by glibc. | |
166 | alphabet cscsa ; <cs><cs><a> | |
167 | alphabet csd ; <cs><d> -- (These comments also apply to all other compound letters below.) | |
168 | alphabet d | |
169 | alphabet dz ; <dz> | |
170 | alphabet dzd ; <dz><d> | |
171 | alphabet ddz ; <dz><dz> | |
172 | alphabet dzdz ; <dz><dz> | |
173 | alphabet ddza ; <dz><dz><a> | |
174 | alphabet dzdza ; <dz><dz><a> | |
175 | alphabet dzdzs ; <dz><dzs> | |
176 | alphabet dze ; <dz><e> | |
177 | alphabet dzz ; <dz><z> | |
178 | alphabet dzs ; <dzs> | |
179 | alphabet dzsdz ; <dzs><dz> | |
180 | alphabet ddzs ; <dzs><dzs> | |
181 | alphabet dzsdzs ; <dzs><dzs> | |
182 | alphabet ddzsa ; <dzs><dzs><a> | |
183 | alphabet dzsdzsa ; <dzs><dzs><a> | |
184 | alphabet dzse ; <dzs><e> | |
185 | alphabet e | |
186 | alphabet é | |
187 | alphabet ee | |
188 | alphabet eé | |
189 | alphabet ée | |
190 | alphabet éé | |
191 | alphabet ép | |
192 | alphabet eq | |
193 | alphabet f | |
194 | alphabet g | |
195 | alphabet gz ; <g><z> | |
196 | alphabet gy ; <gy> | |
197 | alphabet gyg ; <gy><g> | |
198 | alphabet ggy ; <gy><gy> | |
199 | alphabet gygy ; <gy><gy> | |
200 | alphabet ggya ; <gy><gy><a> | |
201 | alphabet gygya ; <gy><gy><a> | |
202 | alphabet gyh ; <gy><h> | |
203 | alphabet h | |
204 | alphabet i | |
205 | alphabet í | |
206 | alphabet ii | |
207 | alphabet ií | |
208 | alphabet íi | |
209 | alphabet íí | |
210 | alphabet íp | |
211 | alphabet iq | |
212 | alphabet j | |
213 | alphabet k | |
214 | alphabet l | |
215 | alphabet lz ; <l><z> | |
216 | alphabet ly ; <ly> | |
217 | alphabet lyl ; <ly><l> | |
218 | alphabet lly ; <ly><ly> | |
219 | alphabet lyly ; <ly><ly> | |
220 | alphabet llya ; <ly><ly><a> | |
221 | alphabet lylya ; <ly><ly><a> | |
222 | alphabet lym ; <ly><m> | |
223 | alphabet m | |
224 | alphabet n | |
225 | alphabet nz ; <n><z> | |
226 | alphabet ny ; <ny> | |
227 | alphabet nyn ; <ny><n> | |
228 | alphabet nny ; <ny><ny> | |
229 | alphabet nyny ; <ny><ny> | |
230 | alphabet nnya ; <ny><ny><a> | |
231 | alphabet nynya ; <ny><ny><a> | |
232 | alphabet nyo ; <ny><o> | |
233 | alphabet o | |
234 | alphabet ó | |
235 | alphabet oo | |
236 | alphabet oó | |
237 | alphabet óo | |
238 | alphabet óó | |
239 | alphabet óp | |
240 | alphabet oq | |
241 | alphabet ö ; ö = ő (unless that's the only difference), but these come strictly after o and ó. | |
242 | alphabet ő | |
243 | alphabet öö | |
244 | alphabet öő | |
245 | alphabet őö | |
246 | alphabet őő | |
247 | alphabet őp | |
248 | alphabet öq | |
249 | alphabet p | |
250 | alphabet q | |
251 | alphabet r | |
252 | alphabet s | |
253 | alphabet sz ; <sz> | |
254 | alphabet szs ; <sz><s> | |
255 | alphabet ssz ; <sz><sz> | |
256 | alphabet szsz ; <sz><sz> | |
257 | alphabet ssza ; <sz><sz><a> | |
258 | alphabet szsza ; <sz><sz><a> | |
259 | alphabet szt ; <sz><t> | |
260 | alphabet t | |
261 | alphabet tz ; <t><z> | |
262 | alphabet ty ; <ty> | |
263 | alphabet tyt ; <ty><t> | |
264 | alphabet tty ; <ty><ty> | |
265 | alphabet tyty ; <ty><ty> | |
266 | alphabet ttya ; <ty><ty><a> | |
267 | alphabet tytya ; <ty><ty><a> | |
268 | alphabet tyu ; <ty><u> | |
269 | alphabet u | |
270 | alphabet ú | |
271 | alphabet úp | |
272 | alphabet uq | |
273 | alphabet uu | |
274 | alphabet uú | |
275 | alphabet úu | |
276 | alphabet úú | |
277 | alphabet ü ; ü = ű (unless that's the only difference), but these come strictly after u and ú. | |
278 | alphabet ű | |
279 | alphabet űp | |
280 | alphabet üq | |
281 | alphabet üü | |
282 | alphabet üű | |
283 | alphabet űü | |
284 | alphabet űű | |
285 | alphabet v | |
286 | alphabet w | |
287 | alphabet x | |
288 | alphabet y | |
289 | alphabet z | |
290 | alphabet zz ; <z><z> | |
291 | alphabet zs ; <zs> | |
292 | alphabet zsz ; <zs><z> | |
293 | alphabet zzs ; <zs><zs> | |
294 | alphabet zszs ; <zs><zs> | |
295 | alphabet zzsa ; <zs><zs><a> | |
296 | alphabet zszsa ; <zs><zs><a> | |
297 | case a ; #14-a2 specifies that if the same word appears in lowercase as well as with | |
298 | case A ; uppercase initial, the lowercase one is to be sorted first. | |
299 | case á ; Arbitrarily extend this to all other weird combinations of upper- and lowercases in compound letters. | |
300 | case Á | |
301 | case cs ; <cs> | |
302 | case cS | |
303 | case Cs | |
304 | case CS | |
305 | case ccs ; <cs><cs> | |
306 | case ccS | |
307 | case cCs | |
308 | case cCS | |
309 | case Ccs | |
310 | case CcS | |
311 | case CCs | |
312 | case CCS | |
313 | case dz ; <dz> | |
314 | case dZ | |
315 | case Dz | |
316 | case DZ | |
317 | case ddz ; <dz><dz> | |
318 | case ddZ | |
319 | case dDz | |
320 | case dDZ | |
321 | case Ddz | |
322 | case DdZ | |
323 | case DDz | |
324 | case DDZ | |
325 | case dzs ; <dzs> | |
326 | case dzS | |
327 | case dZs | |
328 | case dZS | |
329 | case Dzs | |
330 | case DzS | |
331 | case DZs | |
332 | case DZS | |
333 | case ddzs ; <dzs><dzs> | |
334 | case ddzS | |
335 | case ddZs | |
336 | case ddZS | |
337 | case dDzs | |
338 | case dDzS | |
339 | case dDZs | |
340 | case dDZS | |
341 | case Ddzs | |
342 | case DdzS | |
343 | case DdZs | |
344 | case DdZS | |
345 | case DDzs | |
346 | case DDzS | |
347 | case DDZs | |
348 | case DDZS | |
349 | case e | |
350 | case E | |
351 | case é | |
352 | case É | |
353 | case gy ; <gy> | |
354 | case gY | |
355 | case Gy | |
356 | case GY | |
357 | case ggy ; <gy><gy> | |
358 | case ggY | |
359 | case gGy | |
360 | case gGY | |
361 | case Ggy | |
362 | case GgY | |
363 | case GGy | |
364 | case GGY | |
365 | case i | |
366 | case I | |
367 | case í | |
368 | case Í | |
369 | case ly ; <ly> | |
370 | case lY | |
371 | case Ly | |
372 | case LY | |
373 | case lly ; <ly><ly> | |
374 | case llY | |
375 | case lLy | |
376 | case lLY | |
377 | case Lly | |
378 | case LlY | |
379 | case LLy | |
380 | case LLY | |
381 | case ny ; <ny> | |
382 | case nY | |
383 | case Ny | |
384 | case NY | |
385 | case nny ; <ny><ny> | |
386 | case nnY | |
387 | case nNy | |
388 | case nNY | |
389 | case Nny | |
390 | case NnY | |
391 | case NNy | |
392 | case NNY | |
393 | case o | |
394 | case O | |
395 | case ó | |
396 | case Ó | |
397 | case ö | |
398 | case Ö | |
399 | case ő | |
400 | case Ő | |
401 | case sz ; <sz> | |
402 | case sZ | |
403 | case Sz | |
404 | case SZ | |
405 | case ssz ; <sz><sz> | |
406 | case ssZ | |
407 | case sSz | |
408 | case sSZ | |
409 | case Ssz | |
410 | case SsZ | |
411 | case SSz | |
412 | case SSZ | |
413 | case ty ; <ty> | |
414 | case tY | |
415 | case Ty | |
416 | case TY | |
417 | case tty ; <ty><ty> | |
418 | case ttY | |
419 | case tTy | |
420 | case tTY | |
421 | case Tty | |
422 | case TtY | |
423 | case TTy | |
424 | case TTY | |
425 | case u | |
426 | case U | |
427 | case ú | |
428 | case Ú | |
429 | case ü | |
430 | case Ü | |
431 | case ű | |
432 | case Ű | |
433 | case zs ; <zs> | |
434 | case zS | |
435 | case Zs | |
436 | case ZS | |
437 | case zzs ; <zs><zs> | |
438 | case zzS | |
439 | case zZs | |
440 | case zZS | |
441 | case Zzs | |
442 | case ZzS | |
443 | case ZZs | |
444 | case ZZS | |
445 | foreign-a1 á ; More thorough tests for foreign accents (#15). | |
446 | foreign-a1 à ; Each test consists of 4 lines. The foreign accent is in the middle two. | |
447 | foreign-a1 àp ; That is, on their own they come after the Hungarian accent, but a | |
448 | foreign-a1 áq ; subsequent difference (p and q) overrides this. | |
449 | foreign-a2 á | |
450 | foreign-a2 â | |
451 | foreign-a2 âp | |
452 | foreign-a2 áq | |
453 | foreign-a3 á | |
454 | foreign-a3 ã | |
455 | foreign-a3 ãp | |
456 | foreign-a3 áq | |
457 | foreign-a4 á | |
458 | foreign-a4 ä | |
459 | foreign-a4 äp | |
460 | foreign-a4 áq | |
461 | foreign-a5 á | |
462 | foreign-a5 å | |
463 | foreign-a5 åp | |
464 | foreign-a5 áq | |
465 | foreign-a6 á | |
466 | foreign-a6 ă | |
467 | foreign-a6 ăp | |
468 | foreign-a6 áq | |
469 | foreign-c1 c | |
470 | foreign-c1 ç | |
471 | foreign-c1 çp | |
472 | foreign-c1 cq | |
473 | foreign-d1 d | |
474 | foreign-d1 đ | |
475 | foreign-d1 đp | |
476 | foreign-d1 dq | |
477 | foreign-e1 é | |
478 | foreign-e1 è | |
479 | foreign-e1 èp | |
480 | foreign-e1 éq | |
481 | foreign-e2 é | |
482 | foreign-e2 ê | |
483 | foreign-e2 êp | |
484 | foreign-e2 éq | |
485 | foreign-e3 é | |
486 | foreign-e3 ë | |
487 | foreign-e3 ëp | |
488 | foreign-e3 éq | |
489 | foreign-e4 é | |
490 | foreign-e4 ě | |
491 | foreign-e4 ěp | |
492 | foreign-e4 éq | |
493 | foreign-i1 í | |
494 | foreign-i1 ì | |
495 | foreign-i1 ìp | |
496 | foreign-i1 íq | |
497 | foreign-i2 í | |
498 | foreign-i2 î | |
499 | foreign-i2 îp | |
500 | foreign-i2 íq | |
501 | foreign-i3 í | |
502 | foreign-i3 ï | |
503 | foreign-i3 ïp | |
504 | foreign-i3 íq | |
505 | foreign-l1 l | |
506 | foreign-l1 ł | |
507 | foreign-l1 łp | |
508 | foreign-l1 lq | |
509 | foreign-n1 n | |
510 | foreign-n1 ñ | |
511 | foreign-n1 ñp | |
512 | foreign-n1 nq | |
513 | foreign-n2 n | |
514 | foreign-n2 ň | |
515 | foreign-n2 ňp | |
516 | foreign-n2 nq | |
517 | foreign-o1 ó ; The rules are not explicit whether foreign accents on top of o or u | |
518 | foreign-o1 ò ; should be sorted among o-ó and u-ú, or among ö-ő and ü-ű, but the | |
519 | foreign-o1 òp ; AkH #15 example with Møsstrand implicitly shows that it's the former. | |
520 | foreign-o1 óq | |
521 | foreign-o2 ó | |
522 | foreign-o2 ô | |
523 | foreign-o2 ôp | |
524 | foreign-o2 óq | |
525 | foreign-o3 ó | |
526 | foreign-o3 õ | |
527 | foreign-o3 õp | |
528 | foreign-o3 óq | |
529 | foreign-o4 ó | |
530 | foreign-o4 ø | |
531 | foreign-o4 øp | |
532 | foreign-o4 óq | |
533 | foreign-r1 r | |
534 | foreign-r1 ř | |
535 | foreign-r1 řp | |
536 | foreign-r1 rq | |
537 | foreign-s1 s | |
538 | foreign-s1 š | |
539 | foreign-s1 šp | |
540 | foreign-s1 sq | |
541 | foreign-u1 ú | |
542 | foreign-u1 ù | |
543 | foreign-u1 ùp | |
544 | foreign-u1 úq | |
545 | foreign-u2 ú | |
546 | foreign-u2 û | |
547 | foreign-u2 ûp | |
548 | foreign-u2 úq | |
549 | foreign-u3 ú | |
550 | foreign-u3 ũ | |
551 | foreign-u3 ũp | |
552 | foreign-u3 úq | |
553 | foreign-u4 ú | |
554 | foreign-u4 ů | |
555 | foreign-u4 ůp | |
556 | foreign-u4 úq | |
557 | foreign-y1 y | |
558 | foreign-y1 ÿ | |
559 | foreign-y1 ÿp | |
560 | foreign-y1 yq |