Bill Moore's Japanese Font Mini-Tutorial Page.


Copyright © 1997 G. William Moore, MD, PhD.


This file may be copied and distributed without restriction.

  • THE JAPANESE WRITING SYSTEM.
  • JAPANESE KANJI. Written Japanese employs four "alphabets". Traditional written Japanese, or KANJI, was borrowed from the Chinese in the seventh century, and consists of a set of ideographs, or characters, each of which corresponds to an idea or concept. Unlike Western alphabets, in which each letter usually corresponds to a few sounds, Japanese Kanji do not necessarily have a unique, associated sound. Although written Chinese now has over 65,000 available ideographs, since World War II, the Japanese Kanji ideograph set has been limited by the Japan Ministry of Education to fewer than 2000, with an additional 5000 or so ideographs used in older literary works. The Kanji system is used to represent major concepts in the Japanese language, such as anatomical sites and diseases.
  • JAPANESE KANA. The two traditional phonetic Japanese "alphabets", or kanas, are: HIRAGANA and KATAKANA. Each of these phonetic alphabets, or more accurately syllabaries, correspond to 46 syllables in the Japanese language. In addition to the 46 basic Japanese syllables, there are additional diacritical marks which modify and expand the basic sounds of the two kana systems. For didactic purposes, the sounds for the two kana systems may be represented in a 10 x 5 table, as follows:
           a    i    u    e    o
          ka   ki   ku   ke   ko
          sa  shi   su   se   so
          ta  chi  tsu   te   to
          na   ni   nu   ne   no
          ha   hi   fu   he   ho
          ma   mi   mu   me   mo
          ya        yu        yo
          ra   ri   ru   re   ro
          wa                  wo    n
    

    Hiragana is used to represent grammatical particles (prepositions, auxiliary verbs, etc.). Katakana is used to represent loan-words from non-Japanese languages. Many high-technology terms, such as computer, monitor, keyboard, etc., are English loan-words transliterated in Katakana.
  • LEARNING THE JAPANESE KANA. as follows:
           a    i    u    e    o
          ka   ki   ku   ke   ko
    
    
    
    

  • JAPANESE ROMAJI. Finally, Japanese may be written phonetically with the Roman alphabet, namely, ROMAJI. Although every educated Japanese person can read and write Romaji, one cannot write unambiguous Japanese purely in Romaji, because many distinct concepts have the same sound, and must be disambiguated in written text by the much richer system of Kanji ideographs. Experts claim that the post-World-War-II technology explosion in Japan has, if anything, increased the Japanese dependence on concept-rich (but Westerner-difficult) Kanji ideographs.
  • FOREIGN LANGUAGE FONTS ON THE COMPUTER.
    All computer text consists internally of a stream of BYTES, or numbers ranging in value from 0 to 255 (i.e., 2x2x2x2x2x2x2x2 possibilities). The assignment table which points each byte-number to a text-character on your computer monitor or on a printout is called a FONT. Since early computer software was largely developed in the USA, the 26 letters of the Roman alphabet, upper- and lower-case, as well as numerals, punctuation, and device-controls (e.g., space, new-line, new-page, back-space, etc.) were assigned to byte-numbers ranging between 0 and 127. This assignment table is known as the American Code for Information Interchange (ASCII), and has been officially adopted by the International Standards Organization (ISO). For example in ASCII, A=65, B=66, C=67,..., a=97, b=98, c=99,..., backspace=8, newpage=12, etc. However, the remaining byte-numbers, ranging between 128 and 255, are not even sufficient to accommodate all the European languages using Roman alphabets, with their various accents, umlauts, circumflexes, cedillas, tildes, etc. Consider the following, partial list of languages requiring alphabetic fonts:
    Afrikaans   Akkadian   Albanian   American-Indian   Amharic   Arabic   
    Armenian   Aymara   Basque   Belgium   Belorussian   Bemba   Bengali   
    Blackfoot   Bohemian   Brazilian   Bretan   Bretonian   Bulgarian   
    Burmese   Byelorussian   Cambodian   Catalan   Chamarra   Cherokee   
    Chinese   Coptic   Cree   Croatian   Crow   Cyrillic   Czech   Danish   
    Dutch   Esperanto   Estonian   Farcese   Farsi   Fijian   Finnish   
    Flemish   French   Frisian   Gaelic   Ganda   Georgian   German   
    Glagolitic   Greek   Guarani   Gujarati   Hawaiian   Hebrew   
    Hieroglyphics   Hindi   Hungarian   Icelandic   Indonesian   Inuit   
    Inuktitut   Irish   Italian   Japanese   Kango   Kannada   Khmer   
    Kikuyu   Korean   Kpeile   Kwakwala   Laotian   Lappish   Latin   
    Latvian   Lithuanian   Littera   Luxembourgian   Macedonian   Malagasy   
    Malay   Malayalam   Malinke   Maltese   Maori   Marathi   Mayan   
    Mohawk   Moldavian   Mongolian   Nahauati   Nepali   Netherlands   
    Norwegian   Nyanja   Nynorsk   Oriya   Papiamento   Pashto   Persian   
    Polish   Portuguese   Provencal   Punjabi   Quechua   Roman   Ruanda   
    Rumanian   Rundi   Runic   Russian   Samoan   Sanskrit   Serbian   
    Sinhalese   Sioux   Slavonic   Slovak   Slovenian   Somalia   Sotho   
    Spanish   Sudanese   Swahili   Swazi   Swedish   Swiss   Syriac   
    Tagalog   Tahitian   Tamil   Telugu   Tewa   Thai   Tibetan   Tongan   
    Tswana   Turkish   Ukrainian   Urdu   Vietnamese   Visayan   Welsh   
    Wendish   Wolof   Xhosa   Yukon   
    
    Among these languages, three Chinese-based writing systems require more than 256 letters: Chinese, Japanese, and Traditional Korean.
  • JAPANESE FONTS ON THE COMPUTER.
    Since the Japanese writing system requires more than 256 letters, there are several font systems for Japanese which employ two consecutive bytes (i.e., which accommodate 256 x 256 = 65,536 assignments.) Strictly speaking, Chinese does not fit into a 65,536-character font, but many Chinese characters occur so infrequently in text that they may be composed on-the-fly on an as-needed basis. The most popular Japanese fonts are: JIS (=Japan Industrial Standard), Shift-JIS (SJIS), and EUC (=European Union Characters). On the Internet, Shift-JIS appears to be the most popular Japanese font. This tutorial employs Shift-JIS. If you display a SJIS output on a computer without an installed SJIS Japanese font, then all the Japanese words in the output will appear as gibberish.
  • INSTALLING A JAPANESE FONT ON THE COMPUTER. Currently, you must purchase a SJIS Japanese font, for prices ranging from $50 to several hundred dollars. Our office uses the UnionWay SJIS Japanese Font, but many other vendors are advertised on the Internet.
    When you obtain your font, you should install it according to the vendor's instructions. In Windows 97, you typically begin by clicking on START, then SETTINGS, then CONTROL PANEL, then double-clicking on FONT, then clicking on INSTALL NEW FONT, then entering the filename of your new font. Then go into your Internet Browser (Netscape, Microsoft, etc.) and change your fontname under OPTIONS.
    Caution: Most one-byte fonts occupy only about 10 kilobytes on your computer RAM memory, but two-byte fonts may occupy up to 7 megabytes, and may challenge your computer's memory and slow down your Internet browser.
  • SAMPLE JAPANESE WORDS IN THE SJIS FONT.
    CHI BLOOD 血
    CHI THOUSAND 千
    CHII SMALL 小
    HIGASHI EAST 東
    HITO PERSON 人
    INU DOG 犬
    JIN BENEVOLENCE 仁
    KAWA RIVER 川
    MIZU WATER 水
    MO SAY 申
    NI TWO 二
    O KING 壬
    ONNA WOMAN 女
    OTOKO GENTLEMAN 男
    SHIN FAITH 信
    SHIN FOREST 森
    SHIN TRUST 信
    SO CHILD 子
    TAN DAWN 旦
    TO TEN 十
    USHI COW 牛
    YAMA MOUNTAIN 山
  • RETURN TO BILL MOORE'S HOME PAGE. Return to Bill Moore's Home Page
  • For additional information, contact Bill Moore at:
    gwmoore@erols.com