Size: 13146
Comment: white album and parfait
|
Size: 13195
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 14: | Line 14: |
script name; kanji\n(unique); kanji\n(2+); chars\nper line; characters; lexemes; 1k words; BCCWJ 3k; BCCWJ 5k; VN 5k; VN 5k\nsans 50; Core6k; Core6k\nsans 250; chars per\nsentence; aete mushisuru; 1988; 1725; 16.3762; 543761; 296504; 77.81%; 76.69%; 82.87%; 89.61%; 90.80%; 82.16%; 84.51%; 10.23; 79.29; 75.43; 94.98 | script name; kanji\n(unique); kanji\n(2+); chars\nper line; characters; lexemes; 1k words; BCCWJ 3k; BCCWJ 5k; VN 5k; VN 5k\nsans 50; Core6k; Core6k\nsans 250; chars per\nsentence; Hayashi; modified\nHayashi; modified\nHayashi 2 aete mushisuru; 1988; 1725; 16.3762; 543761; 296504; 77.81%; 76.69%; 82.87%; 89.61%; 90.80%; 82.16%; 84.51%; 10.23; 79.29; 75.43; 94.98 |
These statistics are programmatically generated with several tools. Most of them are made for this purpose. Some of them are generic. For more tool information, see the Making stats page. For frequency lists, see the Frequency lists page.
If you want to edit the stats here, please use the tools presented on the Making stats page for the sake of consistency. If you can't use them, find a friend or acquaintance who can. Above all, do not edit the stats here directly based on manual statistical analysis.
Scroll down to the bottom of the page for explanations of what the columns mean.
Higher % values are easier. Higher Hayashi values are easier.
Click a column header to sort by it.
Add requests to this page: Requests
script name | kanji | kanji | chars | characters | lexemes | 1k words | BCCWJ 3k | BCCWJ 5k | VN 5k | VN 5k | Core6k | Core6k | chars per | Hayashi | modified | modified |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
aete mushisuru | 1988 | 1725 | 16.3762 | 543761 | 296504 | 77.81% | 76.69% | 82.87% | 89.61% | 90.80% | 82.16% | 84.51% | 10.23 | 79.29 | 75.43 | 94.98 |
ao no kanata | 2130 | 1847 | 22.9081 | 938614 | 514246 | 80.91% | 80.55% | 86.19% | 91.16% | 91.38% | 85.06% | 87.35% | 12.06 | 77.03 | 78.86 | 100.03 |
astelight shuushuubako | 2983 | 2598 | 57.2095 | 518364 | 293732 | 63.25% | 63.90% | 70.79% | 77.91% | 78.38% | 69.23% | 71.06% | 27.62 | 63.27 | 54.36 | 75.20 |
axanael | 1862 | 1594 | 15.4847 | 485875 | 223616 | 80.70% | 77.31% | 83.35% | 87.96% | 87.96% | 81.32% | 85.41% | 7.28 | 78.51 | 71.61 | 105.74 |
biman1 | 1751 | 1395 | 27.6612 | 197611 | 105809 | 80.16% | 67.80% | 73.57% | 86.77% | 90.21% | 73.76% | 81.36% | 8.67 | 74.04 | 66.39 | 96.25 |
chronobox trials | 1266 | 915 | 23.9832 | 66548 | 36876 | 84.27% | 75.82% | 82.23% | 91.56% | 91.89% | 80.85% | 86.08% | 7.82 | 79.95 | 86.98 | 97.01 |
chronobox | 1986 | 1738 | 25.8646 | 521122 | 289167 | 79.05% | 74.48% | 79.76% | 89.55% | 90.16% | 77.78% | 84.16% | 7.67 | 77.31 | 81.83 | 96.55 |
cloverpoint | 2259 | 2005 | 23.7604 | 906755 | 495514 | 74.50% | 74.87% | 81.53% | 87.95% | 88.98% | 80.27% | 83.23% | 9.57 | 76.06 | 76.10 | 97.29 |
daitoshokan fandisk | 2036 | 1761 | 18.9773 | 496826 | 273779 | 76.85% | 76.26% | 82.50% | 89.54% | 90.83% | 81.63% | 84.78% | 11.70 | 75.63 | 77.29 | 93.86 |
daitoshokan | 2313 | 2058 | 18.6452 | 1161599 | 647307 | 77.03% | 78.47% | 84.29% | 89.95% | 90.59% | 82.72% | 85.59% | 12.15 | 75.33 | 77.94 | 92.81 |
dies irae | 2799 | 2570 | 32.8806 | 1812969 | 989522 | 72.88% | 74.24% | 80.20% | 87.01% | 87.01% | 77.48% | 79.57% | 17.03 | 70.99 | 66.98 | 89.46 |
dracuriot | 2219 | 1949 | 27.3986 | 1265085 | 708912 | 80.75% | 79.20% | 85.36% | 91.59% | 92.08% | 82.94% | 86.58% | 10.61 | 77.97 | 78.22 | 99.85 |
eustia | 2455 | 2216 | 17.8062 | 1052020 | 588786 | 77.37% | 75.85% | 82.02% | 88.33% | 90.01% | 80.39% | 84.49% | 11.88 | 77.03 | 72.77 | 95.02 |
fate stay night | 2559 | 2330 | 26.1082 | 1861769 | 1041505 | 76.92% | 74.00% | 80.94% | 87.38% | 88.60% | 78.64% | 83.81% | 14.64 | 73.38 | 63.75 | 90.06 |
flowers1+2+3 | 2413 | 2180 | 27.1889 | 893604 | 523210 | 76.47% | 75.24% | 81.60% | 87.36% | 87.89% | 80.60% | 83.27% | 14.46 | 72.30 | 64.66 | 87.41 |
flowers1 | 1980 | 1692 | 27.0263 | 330543 | 192965 | 80.04% | 77.94% | 83.80% | 89.50% | 89.97% | 83.62% | 85.69% | 13.29 | 73.86 | 71.02 | 90.08 |
flowers2 | 2033 | 1734 | 27.0760 | 269271 | 156333 | 77.12% | 73.61% | 80.10% | 85.74% | 86.97% | 78.60% | 82.93% | 15.16 | 70.90 | 61.11 | 86.31 |
flowers3 | 2085 | 1809 | 27.4825 | 293790 | 173912 | 77.20% | 73.75% | 80.54% | 86.46% | 87.07% | 79.07% | 83.24% | 15.32 | 71.91 | 60.74 | 85.44 |
flyable heart | 1850 | 1632 | 20.9062 | 864065 | 473839 | 84.75% | 84.43% | 89.31% | 94.22% | 94.22% | 87.34% | 90.12% | 11.52 | 81.91 | 92.23 | 102.48 |
fortunearterial | 2187 | 1946 | 15.6327 | 949197 | 521921 | 77.95% | 78.56% | 83.80% | 90.59% | 91.34% | 82.64% | 85.96% | 11.34 | 78.89 | 84.29 | 97.44 |
fureraba | 2084 | 1854 | 22.7575 | 1055501 | 573300 | 80.20% | 80.92% | 86.78% | 92.06% | 92.06% | 85.34% | 87.22% | 10.29 | 77.43 | 75.35 | 99.08 |
gensou no idea | 2482 | 2227 | 29.0149 | 865797 | 490491 | 71.81% | 75.59% | 81.86% | 87.34% | 87.64% | 79.82% | 81.95% | 13.64 | 77.20 | 69.51 | 92.38 |
hanachirasu | 2151 | 1746 | 27.5822 | 154411 | 88671 | 70.98% | 70.24% | 77.10% | 81.46% | 82.41% | 74.56% | 77.16% | 17.13 | 72.73 | 74.77 | 80.07 |
hanahira | 948 | 655 | 16.6173 | 51485 | 26301 | 89.69% | 80.27% | 85.94% | 91.09% | 92.11% | 86.91% | 89.37% | 8.90 | 83.48 | 92.80 | 109.85 |
hoshimemo | 2081 | 1865 | 18.2732 | 977540 | 551636 | 79.96% | 76.26% | 83.08% | 88.06% | 89.70% | 81.47% | 87.06% | 11.35 | 81.14 | 87.04 | 100.12 |
imakoi | 1447 | 1158 | 27.9471 | 159416 | 84295 | 83.97% | 75.03% | 80.99% | 92.11% | 92.94% | 79.37% | 84.68% | 11.63 | 80.79 | 70.53 | 100.81 |
inganock | 2066 | 1822 | 26.5465 | 438336 | 244564 | 80.00% | 76.18% | 81.56% | 85.64% | 87.12% | 78.39% | 84.02% | 10.61 | 75.52 | 63.57 | 86.62 |
itsusora | 2466 | 2185 | 21.8124 | 646806 | 361699 | 76.57% | 76.22% | 82.71% | 88.73% | 89.13% | 80.70% | 83.50% | 12.63 | 82.27 | 89.75 | 94.14 |
jingai makyou | 2496 | 2235 | 27.4848 | 906760 | 509375 | 74.05% | 73.50% | 80.09% | 88.01% | 88.63% | 79.78% | 82.05% | 14.52 | 73.92 | 73.86 | 88.72 |
kagerou | 2939 | 2601 | 43.2278 | 853370 | 492836 | 70.10% | 64.17% | 72.15% | 80.65% | 83.69% | 70.17% | 75.16% | 25.96 | 73.15 | 84.25 | 86.36 |
kajiri akebono | 2862 | 2573 | 37.1914 | 1128495 | 634907 | 73.57% | 72.03% | 78.23% | 84.17% | 85.17% | 74.81% | 78.15% | 19.09 | 81.44 | 85.32 | 86.51 |
kamimaho | 2161 | 1947 | 23.0001 | 969307 | 532684 | 81.47% | 78.91% | 83.99% | 90.76% | 92.18% | 82.45% | 86.40% | 15.30 | 77.13 | 85.20 | 92.65 |
katahane | 1866 | 1662 | 24.3593 | 694494 | 345501 | 80.65% | 79.76% | 86.17% | 89.91% | 91.47% | 84.18% | 87.72% | 11.16 | 79.93 | 71.30 | 100.92 |
kaziklu | 2071 | 1728 | 38.8561 | 225348 | 123625 | 76.01% | 75.09% | 80.83% | 87.50% | 87.72% | 77.63% | 80.95% | 20.81 | 70.55 | 65.37 | 94.41 |
leyline1+2+3 | 2020 | 1797 | 22.8403 | 1210036 | 683088 | 83.77% | 79.82% | 85.95% | 92.37% | 92.95% | 83.58% | 88.26% | 10.78 | 81.54 | 88.31 | 98.27 |
leyline1 | 1597 | 1326 | 21.5464 | 353024 | 198447 | 86.53% | 80.88% | 86.67% | 92.42% | 94.19% | 83.96% | 89.35% | 10.31 | 84.43 | 94.05 | 99.87 |
leyline2 | 1749 | 1474 | 23.2131 | 453826 | 256489 | 84.40% | 80.03% | 86.26% | 92.32% | 92.61% | 83.96% | 88.71% | 11.09 | 80.98 | 87.24 | 98.41 |
leyline3 | 1662 | 1444 | 23.6588 | 403186 | 228152 | 83.65% | 78.65% | 84.96% | 92.37% | 92.67% | 82.81% | 87.69% | 10.86 | 80.12 | 84.34 | 96.78 |
magical charming | 2119 | 1804 | 21.1496 | 699095 | 377716 | 80.59% | 80.31% | 85.93% | 91.41% | 92.34% | 84.02% | 86.71% | 8.85 | 78.52 | 83.03 | 105.02 |
majokoi | 1971 | 1728 | 18.5559 | 629301 | 346800 | 81.58% | 81.27% | 86.82% | 91.85% | 91.85% | 84.92% | 87.36% | 9.44 | 82.25 | 95.66 | 107.73 |
muramasa | 3071 | 2797 | 17.8912 | 1400558 | 756342 | 67.73% | 70.71% | 77.17% | 81.28% | 82.02% | 74.01% | 76.32% | 10.84 | 72.65 | 68.82 | 80.45 |
nanarin | 1852 | 1730 | 21.1745 | 750968 | 409014 | 81.01% | 77.74% | 83.71% | 91.23% | 93.09% | 82.23% | 86.37% | 8.32 | 77.11 | 89.50 | 103.14 |
nanatsuiro | 1544 | 1313 | 20.2550 | 487860 | 271395 | 88.11% | 81.86% | 88.03% | 93.21% | 94.64% | 85.63% | 91.84% | 11.23 | 81.84 | 91.07 | 102.35 |
parfait | 2135 | 1888 | 23.0168 | 753595 | 420245 | 77.87% | 77.40% | 83.27% | 90.13% | 90.56% | 82.73% | 85.41% | 10.81 | 69.75 | 77.09 | 95.46 |
princessfrontier | 2343 | 2099 | 19.6364 | 907832 | 464695 | 75.18% | 73.72% | 80.46% | 86.47% | 87.46% | 79.00% | 82.70% | 10.68 | 80.06 | 72.56 | 98.75 |
satsukoi | 1883 | 1588 | 15.2492 | 319473 | 174578 | 79.09% | 76.05% | 81.90% | 90.07% | 91.56% | 81.05% | 84.89% | 10.61 | 78.77 | 77.24 | 93.61 |
senrenbanka | 2223 | 1920 | 23.3741 | 1149829 | 641863 | 82.31% | 79.98% | 85.26% | 91.85% | 92.42% | 84.56% | 88.01% | 10.78 | 79.37 | 84.69 | 101.01 |
sensinkan bansenzin | 2645 | 2379 | 36.9887 | 923229 | 522288 | 74.09% | 76.37% | 82.02% | 87.95% | 87.95% | 79.00% | 81.16% | 19.45 | 76.24 | 82.61 | 90.81 |
sensinkan hatimyouzin | 2859 | 2549 | 38.3709 | 1250645 | 705201 | 72.20% | 74.50% | 80.42% | 86.67% | 86.67% | 77.20% | 79.08% | 20.14 | 77.19 | 82.38 | 89.19 |
sharnoth fvr | 2125 | 1887 | 27.3698 | 532052 | 291647 | 78.43% | 76.84% | 82.88% | 87.97% | 88.47% | 79.88% | 83.33% | 11.94 | 73.20 | 60.90 | 90.37 |
shirokuma | 2433 | 2129 | 24.7324 | 1082543 | 581305 | 73.21% | 72.68% | 80.15% | 84.91% | 86.32% | 78.16% | 82.25% | 10.86 | 77.96 | 70.51 | 102.15 |
shugaten | 1764 | 1522 | 17.4341 | 413763 | 221918 | 79.89% | 76.76% | 82.82% | 87.90% | 89.79% | 81.89% | 85.87% | 10.80 | 82.19 | 78.21 | 108.03 |
silverio vendetta | 2759 | 2488 | 43.5207 | 954884 | 514962 | 67.45% | 68.63% | 75.37% | 82.54% | 82.88% | 71.85% | 74.02% | 22.03 | 67.75 | 59.59 | 81.91 |
simulacre | 1772 | 1516 | 24.3116 | 302739 | 180391 | 82.94% | 82.18% | 87.04% | 92.48% | 93.59% | 86.18% | 88.88% | 11.31 | 80.97 | 89.60 | 95.51 |
snowwhite | 1547 | 1247 | 33.6587 | 189737 | 108130 | 85.14% | 81.08% | 86.79% | 92.21% | 93.90% | 85.73% | 89.42% | 18.43 | 78.11 | 80.48 | 97.50 |
soramitsu | 2178 | 1982 | 25.9613 | 1101991 | 613527 | 78.67% | 78.50% | 84.47% | 91.80% | 92.37% | 83.32% | 85.72% | 10.43 | 77.01 | 81.54 | 97.11 |
sourire | 1895 | 1638 | 24.4430 | 690684 | 372704 | 83.08% | 80.24% | 86.10% | 92.68% | 94.58% | 84.93% | 88.83% | 9.94 | 78.64 | 89.80 | 100.02 |
subahibi | 2239 | 2031 | 22.0398 | 1127394 | 626396 | 79.38% | 79.87% | 85.64% | 90.60% | 90.89% | 83.71% | 86.25% | 7.91 | 78.25 | 78.89 | 97.00 |
sukinara | 2070 | 1840 | 24.3595 | 1377391 | 725057 | 81.45% | 78.52% | 84.09% | 90.36% | 92.16% | 83.03% | 87.21% | 12.30 | 80.24 | 89.38 | 108.70 |
tarareba | 1798 | 1554 | 26.0075 | 488678 | 275749 | 82.64% | 77.42% | 83.19% | 91.59% | 93.05% | 81.15% | 86.84% | 11.79 | 77.55 | 72.70 | 99.03 |
trinoline | 1955 | 1693 | 21.8147 | 545133 | 296702 | 82.20% | 81.05% | 86.10% | 92.60% | 93.75% | 84.75% | 87.91% | 11.26 | 76.65 | 80.29 | 96.84 |
tsujidou | 2270 | 2027 | 16.9892 | 1280257 | 680942 | 74.63% | 74.96% | 81.59% | 87.17% | 87.51% | 80.27% | 82.74% | 9.67 | 83.16 | 75.49 | 104.50 |
tsuriotsu | 2391 | 2104 | 38.4322 | 1093199 | 641966 | 80.34% | 79.81% | 85.84% | 89.13% | 91.66% | 84.31% | 87.65% | 14.58 | 78.91 | 78.84 | 97.40 |
tsuushinbo | 1869 | 1620 | 25.2615 | 656951 | 360210 | 83.05% | 78.87% | 84.69% | 92.64% | 93.87% | 83.67% | 87.46% | 10.47 | 83.27 | 98.16 | 107.74 |
twinklecrusaders | 2587 | 2314 | 23.2599 | 1609755 | 871249 | 75.35% | 75.45% | 81.79% | 86.76% | 87.36% | 79.63% | 83.30% | 12.01 | 77.72 | 69.56 | 100.25 |
white_album | 1832 | 1489 | 18.5374 | 492329 | 271621 | 84.95% | 83.69% | 88.74% | 91.89% | 93.31% | 87.28% | 91.21% | 11.62 | 77.16 | 85.50 | 101.85 |
Kanji (unique): The number of kanji codepoints that occur at least once in the entire script.
Kanji (2+): Same, but at least twice, not at least once.
Chars per line: Number of characters per line in file, after stripping whitespace from the front/back of the line, excluding " " from the line, and ignoring blank lines, in that order.
Characters: Total characters in the file, ignoring characters from r'『』「」[]()()【】〈〉《》«»‹›〚〛〘〙{}{} ―-~。、…‥\n\r'. That includes ignoring various whitespace characters.
Lexemes: Number of lexeme events in the script according to kuromoji-unidic with a slightly modified dictionary. Lexemes are similar to words. 限り is a lexeme. とした may be interpreted as three separate lexemes. Does not ignore any lexemes the parser understands, not even some names.
1k words: How much of the script is covered by the top 1k most common non-grammatical lexemes from that particular script. Grammatical lexemes are particles, true auxiliary verbs like ます, interjections, and symbols. Grammatical lexemes are ignored both when finding the top 1k most common lexemes and when calculating coverage.
BCCWJ Nk: Coverage based on the top N most common words from the BCCWJ frequency list, which was generated using mecab-unidic. The coverage value ignores grammatical lexemes, BCCWJ does not; in other words, hundreds of grammatical lexemes are inflating the BCCWJ word count required to reach a given coverage level.
VN 5k: Coverage based on the top 5k most common words from a frequency list generated from VNs. Subject to massive change at any time as more VNs are added. This frequency list excludes grammatical lexemes.
VN 5k sans 50: Above, but ignoring any words in the top 50 words for that script that are not in the top 5k in the frequency list. This is slightly different from pretending that any top 50 words are known. If the entire script consisted of top 50 words that were not in the top 5k in the frequency list, the coverage would be undefined.
Core6k: Coverage based on the lexemes the analyzer recognizes from Core6k, with several hundred manual corrections. Inherently going to cover less than VN 5k, no matter what, because VN 5k is derived from the scripts it's ranking.
Core6k sans 250: Like VN 5k sans 50, but with Core6k and the top 250 from the script.
Chars per sentence: Like chars per line, but attempts to identify sentence boundaries.
Hayashi: An estimate of structural complexity intended for the school grade level of textbooks and reading material. For more information: http://www.lrec-conf.org/proceedings/lrec2008/pdf/165_paper.pdf
modified Hayashi: Above, but recalibrated to the ratio of kanji/hiragana/katakana in each VN's script, adjusting for a flaw in the design of the original Hayashi metric. This recalibration is fuzzy, and causes the scale to have a different linear correlation.
modified Hayashi 2: Same, but ignoring the contribution of katakana sequences entirely.
These fields might be axed in the future:
1k words
BCCWJ 3k
Chars per line (superseded by Chars per sentence)
Hayashi (inherently broken, only useful when comparing to stats elsewhere)