Paket: frog (0.13.7-1build2)
Länkar för frog
Trisquelresurser:
Hämta källkodspaketet frog:
Ansvarig:
Original Maintainers:
- Debian Science Team (E-postarkiv)
- Maarten van Gompel
- Ko van der Sloot
Externa resurser:
- Hemsida [languagemachines.github.io]
Liknande paket:
tagger and parser for natural languages (runtime)
Memory-Based Learning (MBL) is a machine-learning method applicable to a wide range of tasks in Natural Language Processing (NLP).
Frog is a modular system integrating a morphosyntactic tagger, lemmatizer, morphological analyzer, and dependency parser for natural languages. It is based upon it's predecessor TADPOLE (TAgger, Dependency Parser, and mOrphoLogical analyzEr). Using Memory-Based Learning techniques, frog tokenizes, tags, lemmatizes, and morphologically segments word tokens in incoming UTF-8 text files, and assigns a dependency graph to each sentence. Frog is particularly targeted at the increasing need for fast, automatic NLP systems applicable to very large (multi-million to billion word) document collections that are becoming available due to the progressive digitization of both new and old textual data. Up to now, frog has only been tested and used using corpora of Dutch natural language (see the frogdata package for samples).
Frog is a product of the ILK Research Group (Tilburg University, The Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium).
If you do scientific research in NLP, Frog will likely be of use to you.
Andra paket besläktade med frog
|
|
|
-
- dep: libc6 (>= 2.14) [amd64]
- GNU C Library: Shared libraries
också ett virtuellt paket som tillhandahålls av libc6-udeb
- dep: libc6 (>= 2.4) [i386]
-
- dep: libfolia6
- Implementation of the FoLiA document format
-
- dep: libfrog1
- tagger and parser for Dutch language (library)
-
- dep: libgcc1 (>= 1:3.0)
- GCC support library
-
- dep: libicu60 (>= 60.1-1~)
- International Components for Unicode
-
- dep: libmbt1
- memory-based tagger-generator and tagger - runtime
-
- dep: libstdc++6 (>= 5.2)
- GNU Standard C++ Library v3
-
- dep: libticcutils2v5
- library for TiCC software - runtime files
-
- dep: libtimbl4
- Tilburg Memory Based Learner - runtime
-
- dep: libucto2
- Unicode Tokenizer - runtime
-
- rec: ucto
- Unicode Tokenizer
Hämta frog
Arkitektur | Paketstorlek | Installerad storlek | Filer |
---|---|---|---|
amd64 | 53,4 kbyte | 273 kbyte | [filförteckning] |
i386 | 54,7 kbyte | 265 kbyte | [filförteckning] |