Paketti: unidic-mecab (2.2.0-1)
Links for unidic-mecab
Trisquel-palvelut:
Imuroi lähdekoodipaketti unidic-mecab:
Ylläpitäjä:
Original Maintainers:
- Natural Language Processing, Japanese (Mail Archive)
- Hideki Yamane
External Resources:
- Kotisivu [unidic.ninjal.ac.jp]
Samankaltaisia paketteja:
Dictionary for Mecab (Corpus of Contemporary Written Japanese)
unidic-mecab is a dictionary for Mecab (Japanese morphological analysis implementation), based on corpus of Contemporary Written Japanese (upstream publish it as unidic-cwj).
* All entries are based on the definition of "SUW (short-unit word)" that is specified by NINJAL (The National Institute for Japanese Language and Linguistics), which provides word segmentation in uniform size suited for linguistic research. * It has three-layered structure with - lemma - form - spelling And it can provide a clear distinction of two types of word variant: spelling variant and form variant. * It is useful for research of Speech processing since it can be added accent and shift in sound information.
Imuroi unidic-mecab
Arkkitehtuuri | Paketin koko | Koko asennettuna | Tiedostot |
---|---|---|---|
all | 94,664.1 kt | 1444023 kt | [tiedostoluettelo] |