Skip to content
Sections
>> Trisquel >> Paketit >> etiona >> misc >> unidic-mecab
etiona  ] [  nabia  ] [  aramo  ]
[ Source: unidic-mecab  ]

Paketti: unidic-mecab (2.2.0-1)

Dictionary for Mecab (Corpus of Contemporary Written Japanese)

unidic-mecab is a dictionary for Mecab (Japanese morphological analysis implementation), based on corpus of Contemporary Written Japanese (upstream publish it as unidic-cwj).

 * All entries are based on the definition of "SUW (short-unit word)" that is
   specified by NINJAL (The National Institute for Japanese Language and
   Linguistics), which provides word segmentation in uniform size suited for
   linguistic research.
 * It has three-layered structure with
    - lemma
    - form
    - spelling
   And it can provide a clear distinction of two types of word variant:
   spelling variant and form variant.
 * It is useful for research of Speech processing since it can be added
   accent and shift in sound information.

Imuroi unidic-mecab

Imurointi kaikille saataville arkkitehtuureille
Arkkitehtuuri Paketin koko Koko asennettuna Tiedostot
all 94,664.1 kt1444023 kt [tiedostoluettelo]