textrecipes: Extra 'Recipes' for Text Processing

Converting text to numerical features requires specifically created procedures, which are implemented as steps according to the 'recipes' package. These steps allows for tokenization, filtering, counting (tf and tfidf) and feature hashing.

Version: 0.4.1
Depends: R (≥ 2.10), recipes (≥ 0.1.15)
Imports: dplyr, generics (≥ 0.1.0), magrittr, Matrix, purrr, Rcpp, rlang, SnowballC, tibble, tidyr, tokenizers, vctrs
LinkingTo: Rcpp
Suggests: covr, janitor, knitr, modeldata, rmarkdown, spacyr, stopwords, testthat (≥ 2.1.0), text2vec, textfeatures (≥ 0.3.3), stringi, tokenizers.bpe, udpipe
Published: 2021-07-11
Author: Emil Hvitfeldt ORCID iD [aut, cre]
Maintainer: Emil Hvitfeldt <emilhhvitfeldt at gmail.com>
BugReports: https://github.com/tidymodels/textrecipes/issues
License: MIT + file LICENSE
URL: https://github.com/tidymodels/textrecipes, https://textrecipes.tidymodels.org
NeedsCompilation: yes
SystemRequirements: GNU make, C++11
Materials: README NEWS
CRAN checks: textrecipes results


Reference manual: textrecipes.pdf
Vignettes: Working with n-grams
Cookbook - Using more complex recipes involving text
Under the hood - tokenlist


Package source: textrecipes_0.4.1.tar.gz
Windows binaries: r-devel: textrecipes_0.4.1.zip, r-devel-UCRT: textrecipes_0.4.1.zip, r-release: textrecipes_0.4.1.zip, r-oldrel: textrecipes_0.4.1.zip
macOS binaries: r-release (arm64): textrecipes_0.4.1.tgz, r-release (x86_64): textrecipes_0.4.1.tgz, r-oldrel: textrecipes_0.4.1.tgz
Old sources: textrecipes archive


Please use the canonical form https://CRAN.R-project.org/package=textrecipes to link to this page.