A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and ngrams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.

Documentation

Manual: quanteda.pdf
Vignette: Getting Started Guide

Maintainer: Kenneth Benoit <kbenoit at lse.ac.uk>

Author(s): Kenneth Benoit*, Kohei Watanabe*, Paul Nulty*, Adam Obeng*, Haiyan Wang*, Benjamin Lauderdale*, Will Lowe*

Install package and any missing dependencies by running this line in your R console:

install.packages("quanteda")

Depends R (>= 3.4.0), methods
Imports utils, stats, Matrix(>=1.2), data.table(>=1.9.6), SnowballC, wordcloud, Rcpp(>=0.12.12), RcppParallel, RSpectra, stringi, fastmatch, ggplot2(>=2.2.0), XML, yaml, lubridate, magrittr
Suggests knitr, rmarkdown, lda, proxy, topicmodels, tm(>=0.6), slam, testthat, RColorBrewer, xtable, DT, ca, purrr
Enhances
Linking to Rcpp, RcppParallel, RcppArmadillo(>=0.7.600.1.0)
Reverse
depends
word.alignment
Reverse
imports
clustRcompaR, gofastr, preText, stm, textstem
Reverse
suggests
corpustools, phrasemachine, readtext, tidytext
Reverse
enhances
corpus
Reverse
linking to

Package quanteda
Materials
URL http://quanteda.io
Task Views NaturalLanguageProcessing
Version 0.99.12
Published 2017-10-06
License GPL-3
BugReports https://github.com/kbenoit/quanteda/issues
SystemRequirements C++11
NeedsCompilation yes
Citation
CRAN checks quanteda check results
Package source quanteda_0.99.12.tar.gz