These statistics are programmatically generated with several tools. Most of them are made for this purpose. Some of them are generic. For more tool information, see the Making stats page. For frequency lists, see the Frequency lists page.

If you want to edit the stats here, please use the tools presented on the Making stats page for the sake of consistency. If you can't use them, find a friend or acquaintance who can. Above all, do not edit the stats here directly based on manual statistical analysis.

Scroll down to the bottom of the page for explanations of what the columns mean.

Higher % values are easier. Higher Hayashi values are easier.

Click a column header to sort by it.

script name kanji (unique) kanji (2+) chars per line characters lexemes 1k words BCCWJ 3k BCCWJ 5k VN 5k VN 5k sans 50 Core6k Core6k sans 250 chars per sentence Hayashi modified Hayashi modified Hayashi 2
aete mushisuru 1988 1725 16.3762 543761 296504 77.81% 76.69% 82.87% 89.56% 90.75% 82.16% 84.51% 10.23 79.29 75.43 94.98
ao no kanata 2130 1847 22.9081 938614 514246 80.91% 80.55% 86.19% 91.11% 91.33% 85.06% 87.35% 12.06 77.03 78.86 100.03
astelight shuushuubako 2983 2598 57.2095 518364 293732 63.25% 63.90% 70.79% 78.01% 78.48% 69.23% 71.06% 27.62 63.27 54.36 75.20
axanael 1862 1594 15.4847 485875 223616 80.70% 77.31% 83.35% 87.93% 87.93% 81.32% 85.41% 7.28 78.51 71.61 105.74
biman1 1751 1395 27.6612 197611 105809 80.16% 67.80% 73.57% 86.80% 90.23% 73.76% 81.36% 8.67 74.04 66.39 96.25
chronobox trials 1266 915 23.9832 66548 36876 84.27% 75.82% 82.23% 91.66% 91.98% 80.85% 86.08% 7.82 79.95 86.98 97.01
chronobox 1986 1738 25.8646 521122 289167 79.05% 74.48% 79.76% 89.57% 90.19% 77.78% 84.16% 7.67 77.31 81.83 96.55
cloverpoint 2259 2005 23.7604 906755 495514 74.50% 74.87% 81.53% 87.93% 88.96% 80.27% 83.23% 9.57 76.06 76.10 97.29
daitoshokan fandisk 2036 1761 18.9773 496826 273779 76.85% 76.26% 82.50% 89.49% 90.78% 81.63% 84.78% 11.70 75.63 77.29 93.86
daitoshokan 2313 2058 18.6452 1161599 647307 77.03% 78.47% 84.29% 89.88% 90.52% 82.72% 85.59% 12.15 75.33 77.94 92.81
dies irae 2799 2570 32.8806 1812969 989522 72.88% 74.24% 80.20% 87.05% 87.05% 77.48% 79.57% 17.03 70.99 66.98 89.46
dracuriot 2219 1949 27.3986 1265085 708912 80.75% 79.20% 85.36% 91.58% 92.07% 82.94% 86.58% 10.61 77.97 78.22 99.85
eustia 2455 2216 17.8062 1052020 588786 77.37% 75.85% 82.02% 88.42% 90.10% 80.39% 84.49% 11.88 77.03 72.77 95.02
fate stay night 2559 2330 26.1082 1861769 1041505 76.92% 74.00% 80.94% 87.36% 88.58% 78.64% 83.81% 14.64 73.38 63.75 90.06
flowers1+2+3 2413 2180 27.1889 893604 523210 76.47% 75.24% 81.60% 87.15% 87.67% 80.60% 83.27% 14.46 72.30 64.66 87.41
flowers1 1980 1692 27.0263 330543 192965 80.04% 77.94% 83.80% 89.10% 89.89% 83.62% 85.69% 13.29 73.86 71.02 90.08
flowers2 2033 1734 27.0760 269271 156333 77.12% 73.61% 80.10% 85.76% 86.99% 78.60% 82.93% 15.16 70.90 61.11 86.31
flowers3 2085 1809 27.4825 293790 173912 77.20% 73.75% 80.54% 86.24% 86.84% 79.07% 83.24% 15.32 71.91 60.74 85.44
flyable heart 1850 1632 20.9062 864065 473839 84.75% 84.43% 89.31% 94.22% 94.22% 87.34% 90.12% 11.52 81.91 92.23 102.48
fortunearterial 2187 1946 15.6327 949197 521921 77.95% 78.56% 83.80% 90.58% 91.32% 82.64% 85.96% 11.34 78.89 84.29 97.44
fureraba 2084 1854 22.7575 1055501 573300 80.20% 80.92% 86.78% 92.03% 92.03% 85.34% 87.22% 10.29 77.43 75.35 99.08
gensou no idea 2482 2227 29.0149 865797 490491 71.81% 75.59% 81.86% 87.37% 87.66% 79.82% 81.95% 13.64 77.20 69.51 92.38
hanachirasu 2151 1746 27.5822 154411 88671 70.98% 70.24% 77.10% 81.51% 82.46% 74.56% 77.16% 17.13 72.73 74.77 80.07
hanahira 948 655 16.6173 51485 26301 89.69% 80.27% 85.94% 90.98% 91.99% 86.91% 89.37% 8.90 83.48 92.80 109.85
hoshimemo 2081 1865 18.2732 977540 551636 79.96% 76.26% 83.08% 88.08% 89.72% 81.47% 87.06% 11.35 81.14 87.04 100.12
imakoi 1447 1158 27.9471 159416 84295 83.97% 75.03% 80.99% 92.12% 92.95% 79.37% 84.68% 11.63 80.79 70.53 100.81
inganock 2066 1822 26.5465 438336 244564 80.00% 76.18% 81.56% 85.67% 87.15% 78.39% 84.02% 10.61 75.52 63.57 86.62
itsusora 2466 2185 21.8124 646806 361699 76.57% 76.22% 82.71% 88.76% 89.16% 80.70% 83.50% 12.63 82.27 89.75 94.14
jingai makyou 2496 2235 27.4848 906760 509375 74.05% 73.50% 80.09% 88.01% 88.63% 79.78% 82.05% 14.52 73.92 73.86 88.72
kagerou 2939 2601 43.2278 853370 492836 70.10% 64.17% 72.15% 80.71% 83.75% 70.17% 75.16% 25.96 73.15 84.25 86.36
kajiri akebono 2862 2573 37.1914 1128495 634907 73.57% 72.03% 78.23% 84.22% 85.21% 74.81% 78.15% 19.09 81.44 85.32 86.51
kamimaho 2161 1947 23.0001 969307 532684 81.47% 78.91% 83.99% 90.74% 92.16% 82.45% 86.40% 15.30 77.13 85.20 92.65
katahane 1866 1662 24.3593 694494 345501 80.65% 79.76% 86.17% 89.90% 91.46% 84.18% 87.72% 11.16 79.93 71.30 100.92
kaziklu 2071 1728 38.8561 225348 123625 76.01% 75.09% 80.83% 87.52% 87.73% 77.63% 80.95% 20.81 70.55 65.37 94.41
leyline1+2+3 2020 1797 22.8403 1210036 683088 83.77% 79.82% 85.95% 92.37% 92.95% 83.58% 88.26% 10.78 81.54 88.31 98.27
leyline1 1597 1326 21.5464 353024 198447 86.53% 80.88% 86.67% 92.39% 94.16% 83.96% 89.35% 10.31 84.43 94.05 99.87
leyline2 1749 1474 23.2131 453826 256489 84.40% 80.03% 86.26% 92.32% 92.60% 83.96% 88.71% 11.09 80.98 87.24 98.41
leyline3 1662 1444 23.6588 403186 228152 83.65% 78.65% 84.96% 92.39% 92.69% 82.81% 87.69% 10.86 80.12 84.34 96.78
magical charming 2119 1804 21.1496 699095 377716 80.59% 80.31% 85.93% 91.43% 92.36% 84.02% 86.71% 8.85 78.52 83.03 105.02
majokoi 1971 1728 18.5559 629301 346800 81.58% 81.27% 86.82% 91.86% 91.86% 84.92% 87.36% 9.44 82.25 95.66 107.73
muramasa 3071 2797 17.8912 1400558 756342 67.73% 70.71% 77.17% 81.38% 82.12% 74.01% 76.32% 10.84 72.65 68.82 80.45
nanarin 1852 1730 21.1745 750968 409014 81.01% 77.74% 83.71% 91.24% 93.10% 82.23% 86.37% 8.32 77.11 89.50 103.14
nanatsuiro 1544 1313 20.2550 487860 271395 88.11% 81.86% 88.03% 93.23% 94.66% 85.63% 91.84% 11.23 81.84 91.07 102.35
princessfrontier 2343 2099 19.6364 907832 464695 75.18% 73.72% 80.46% 86.43% 87.43% 79.00% 82.70% 10.68 80.06 72.56 98.75
satsukoi 1883 1588 15.2492 319473 174578 79.09% 76.05% 81.90% 90.04% 91.52% 81.05% 84.89% 10.61 78.77 77.24 93.61
senrenbanka 2223 1920 23.3741 1149829 641863 82.31% 79.98% 85.26% 91.85% 92.42% 84.56% 88.01% 10.78 79.37 84.69 101.01
sensinkan bansenzin 2645 2379 36.9887 923229 522288 74.09% 76.37% 82.02% 87.96% 87.96% 79.00% 81.16% 19.45 76.24 82.61 90.81
sensinkan hatimyouzin 2859 2549 38.3709 1250645 705201 72.20% 74.50% 80.42% 86.74% 86.74% 77.20% 79.08% 20.14 77.19 82.38 89.19
sharnoth fvr 2125 1887 27.3698 532052 291647 78.43% 76.84% 82.88% 87.96% 88.46% 79.88% 83.33% 11.94 73.20 60.90 90.37
shirokuma 2433 2129 24.7324 1082543 581305 73.21% 72.68% 80.15% 84.87% 86.28% 78.16% 82.25% 10.86 77.96 70.51 102.15
shugaten 1764 1522 17.4341 413763 221918 79.89% 76.76% 82.82% 87.78% 89.67% 81.89% 85.87% 10.80 82.19 78.21 108.03
silverio vendetta 2759 2488 43.5207 954884 514962 67.45% 68.63% 75.37% 82.61% 82.95% 71.85% 74.02% 22.03 67.75 59.59 81.91
simulacre 1772 1516 24.3116 302739 180391 82.94% 82.18% 87.04% 92.66% 93.77% 86.18% 88.88% 11.31 80.97 89.60 95.51
snowwhite 1547 1247 33.6587 189737 108130 85.14% 81.08% 86.79% 92.21% 93.90% 85.73% 89.42% 18.43 78.11 80.48 97.50
soramitsu 2178 1982 25.9613 1101991 613527 78.67% 78.50% 84.47% 91.83% 92.40% 83.32% 85.72% 10.43 77.01 81.54 97.11
sourire 1895 1638 24.4430 690684 372704 83.08% 80.24% 86.10% 92.64% 94.54% 84.93% 88.83% 9.94 78.64 89.80 100.02
subahibi 2239 2031 22.0398 1127394 626396 79.38% 79.87% 85.64% 90.63% 90.92% 83.71% 86.25% 7.91 78.25 78.89 97.00
sukinara 2070 1840 24.3595 1377391 725057 81.45% 78.52% 84.09% 90.35% 92.14% 83.03% 87.21% 12.30 80.24 89.38 108.70
tarareba 1798 1554 26.0075 488678 275749 82.64% 77.42% 83.19% 91.59% 93.05% 81.15% 86.84% 11.79 77.55 72.70 99.03
trinoline 1955 1693 21.8147 545133 296702 82.20% 81.05% 86.10% 92.60% 93.75% 84.75% 87.91% 11.26 76.65 80.29 96.84
tsujidou 2270 2027 16.9892 1280257 680942 74.63% 74.96% 81.59% 87.14% 87.47% 80.27% 82.74% 9.67 83.16 75.49 104.50
tsuriotsu 2391 2104 38.4322 1093199 641966 80.34% 79.81% 85.84% 89.11% 91.64% 84.31% 87.65% 14.58 78.91 78.84 97.40
tsuushinbo 1869 1620 25.2615 656951 360210 83.05% 78.87% 84.69% 92.64% 93.87% 83.67% 87.46% 10.47 83.27 98.16 107.74
twinklecrusaders 2587 2314 23.2599 1609755 871249 75.35% 75.45% 81.79% 86.75% 87.34% 79.63% 83.30% 12.01 77.72 69.56 100.25

Kanji (unique): The number of kanji codepoints that occur at least once in the entire script.

Kanji (2+): Same, but at least twice, not at least once.

Chars per line: Number of characters per line in file, after stripping whitespace from the front/back of the line, excluding " " from the line, and ignoring blank lines, in that order.

Characters: Total characters in the file, ignoring characters from r'『』「」[]()()【】〈〉《》«»‹›〚〛〘〙{}{} ―-~。、…‥\n\r'. That includes ignoring various whitespace characters.

Lexemes: Number of lexeme events in the script according to kuromoji-unidic with a slightly modified dictionary. Lexemes are similar to words. 限り is a lexeme. とした may be interpreted as three separate lexemes. Does not ignore any lexemes the parser understands, not even some names.

1k words: How much of the script is covered by the top 1k most common non-grammatical lexemes from that particular script. Grammatical lexemes are particles, true auxiliary verbs like ます, interjections, and symbols. Grammatical lexemes are ignored both when finding the top 1k most common lexemes and when calculating coverage.

BCCWJ Nk: Coverage based on the top N most common words from the BCCWJ frequency list, which was generated using mecab-unidic. The coverage value ignores grammatical lexemes, BCCWJ does not; in other words, hundreds of grammatical lexemes are inflating the BCCWJ word count required to reach a given coverage level.

VN 5k: Coverage based on the top 5k most common words from a frequency list generated from VNs. Subject to massive change at any time as more VNs are added. This frequency list excludes grammatical lexemes.

VN 5k sans 50: Above, but ignoring any words in the top 50 words for that script that are not in the top 5k in the frequency list. This is slightly different from pretending that any top 50 words are known. If the entire script consisted of top 50 words that were not in the top 5k in the frequency list, the coverage would be undefined.

Core6k: Coverage based on the lexemes the analyzer recognizes from Core6k, with several hundred manual corrections. Inherently going to cover less than VN 5k, no matter what, because VN 5k is derived from the scripts it's ranking.

Core6k sans 250: Like VN 5k sans 50, but with Core6k and the top 250 from the script.

Chars per sentence: Like chars per line, but attempts to identify sentence boundaries.

Hayashi: An estimate of structural complexity intended for the school grade level of textbooks and reading material. For more information: http://www.lrec-conf.org/proceedings/lrec2008/pdf/165_paper.pdf

modified Hayashi: Above, but recalibrated to the ratio of kanji/hiragana/katakana in each VN's script, adjusting for a flaw in the design of the original Hayashi metric. This recalibration is fuzzy, and causes the scale to have a different linear correlation.

modified Hayashi 2: Same, but ignoring the contribution of katakana sequences entirely.

These fields might be axed in the future:

1k words

BCCWJ 3k

Chars per line (superseded by Chars per sentence)

Hayashi (inherently broken, only useful when comparing to stats elsewhere)