Skip to content
Sections
>> Trisquel >> Balíky >> etiona >> python >> python3-jieba
etiona  ] [  nabia  ] [  aramo  ]
[ Zdroj: python-jieba  ]

Balík: python3-jieba (0.39-1)

Jieba Chinese text segmenter (Python 3)

"Jieba" (Chinese for "to stutter")is a high-accuracy Chinese text segmenteran based on HMM-model and Viterbi algorithm. It uses dynamic programming to find the most probable combination based on the word frequency.

It supports three types of segmentation mode:

 * Accurate Mode attempts to cut the sentence into the most accurate
   segmentations, which is suitable for text analysis.
 * Full Mode gets all the possible words from the sentence. Fast but not
   accurate.
 * Search Engine Mode, based on the Accurate Mode, attempts to cut long words
   into several short words, which can raise the recall rate. Suitable for
   search engines.
Traditional Chinese and customized dictionaries are also supported.

This package installs the library for Python 3.

Ostatné balíky súvisiace s balíkom python3-jieba

  • závisí
  • odporúča
  • navrhuje
  • dep: python3
    interactive high-level object-oriented language (default python3 version)

Stiahnuť python3-jieba

Stiahnuť pre všetky dostupné architektúry
Architektúra Veľkosť balíka Nainštalovaná veľkosť Súbory
all 4,814.2 kB24715 kB [zoznam súborov]