Skip to content
Sections
>> Trisquel >> Paquets >> aramo >> gnu-r >> r-cran-tokenizers
aramo  ]
[ Paquet source : r-cran-tokenizers  ]

Paquet : r-cran-tokenizers (0.2.1-3)

GNU R fast, consistent tokenization of natural language text

Convert natural language text into tokens. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, shingled characters, lines, tweets, Penn Treebank, regular expressions, as well as functions for counting characters, words, and sentences, and a function for splitting longer texts into separate documents, each with the same number of words. The tokenizers have a consistent interface, and the package is built on the 'stringi' and 'Rcpp' packages for fast yet correct tokenization in 'UTF-8'.

Autres paquets associés à r-cran-tokenizers

  • dépendances
  • recommandations
  • suggestions
  • dep: libc6 (>= 2.14) [amd64]
    GNU C Library: Shared libraries
    un paquet virtuel est également fourni par libc6-udeb
    dep: libc6 (>= 2.17) [arm64, ppc64el]
    dep: libc6 (>= 2.4) [armhf]
  • dep: libgcc-s1 (>= 3.3.1) [non armhf]
    GCC support library
    dep: libgcc-s1 (>= 3.5) [armhf]
  • dep: libstdc++6 (>= 11)
    GNU Standard C++ Library v3
  • dep: r-api-4.0
    paquet virtuel fourni par r-base-core
  • dep: r-base-core (>= 4.1.1-2)
    GNU R core of statistical computation and graphics system
  • dep: r-cran-rcpp (>= 0.12.3)
    GNU R package for Seamless R and C++ Integration
  • dep: r-cran-snowballc (>= 0.5.1)
    Snowball stemmers based on the C libstemmer UTF-8 library
  • dep: r-cran-stringi (>= 1.0.1)
    GNU R character string processing facilities
  • sug: r-cran-covr
    test coverage for GNU R packages
  • sug: r-cran-knitr
    GNU R package for dynamic report generation using Literate Programming
  • sug: r-cran-rmarkdown
    convert R markdown documents into a variety of formats

Télécharger r-cran-tokenizers

Télécharger pour toutes les architectures proposées
Architecture Taille du paquet Espace occupé une fois installé Fichiers
amd64 637,6 ko817 ko [liste des fichiers]
arm64 638,4 ko817 ko [liste des fichiers]
armhf 646,5 ko812 ko [liste des fichiers]
ppc64el 646,2 ko864 ko [liste des fichiers]