tosca: Tools for Statistical Content Analysis

A framework for statistical analysis in content analysis. In addition to a pipeline for preprocessing text corpora and linking to the latent Dirichlet allocation from the 'lda' package, plots are offered for the descriptive analysis of text corpora and topic models. In addition, an implementation of Chang's intruder words and intruder topics is provided.

Version: 0.2-0
Depends: R (≥ 3.5.0)
Imports: tm (≥ 0.7-5), lda (≥ 1.4.2), quanteda (≥ 1.4.0), lubridate (≥ 1.7.3), htmltools (≥ 0.3.6), RColorBrewer (≥ 1.1-2), stringr (≥ 1.3.1), WikipediR (≥ 1.5.0), data.table (≥ 1.11.4)
Suggests: testthat (≥ 2.0.0), knitr (≥ 1.20), devtools (≥ 1.13), rmarkdown (≥ 1.9)
Published: 2020-03-10
Author: Lars Koppers ORCID iD [aut, cre], Jonas Rieger ORCID iD [aut], Karin Boczek ORCID iD [ctb], Gerret von Nordheim ORCID iD [ctb]
Maintainer: Lars Koppers <koppers at>
License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
NeedsCompilation: no
Citation: tosca citation info
CRAN checks: tosca results


Reference manual: tosca.pdf
Vignettes: Vignette tosca
Package source: tosca_0.2-0.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
macOS binaries: r-release: tosca_0.2-0.tgz, r-oldrel: tosca_0.2-0.tgz
Old sources: tosca archive

Reverse dependencies:

Reverse suggests: ldaPrototype


Please use the canonical form to link to this page.