tosca: Tools for Statistical Content Analysis

A framework for statistical analysis in content analysis. In addition to a pipeline for preprocessing text corpora and linking to the latent Dirichlet allocation from the 'lda' package, plots are offered for the descriptive analysis of text corpora and topic models. In addition, an implementation of Chang's intruder words and intruder topics is provided. Sample data for the vignette is included in the toscaData package, which is available on gitHub: <>.

Version: 0.3-2
Depends: R (≥ 3.5.0)
Imports: tm (≥ 0.7-5), lda (≥ 1.4.2), quanteda (≥ 1.4.0), lubridate (≥ 1.7.3), htmltools (≥ 0.3.6), RColorBrewer (≥ 1.1-2), stringr (≥ 1.3.1), WikipediR (≥ 1.5.0), data.table (≥ 1.11.4)
Suggests: toscaData, testthat (≥ 2.0.0), knitr (≥ 1.20), devtools (≥ 1.13), rmarkdown (≥ 1.9)
Published: 2021-10-28
DOI: 10.32614/CRAN.package.tosca
Author: Lars Koppers ORCID iD [aut, cre], Jonas Rieger ORCID iD [aut], Karin Boczek ORCID iD [ctb], Gerret von Nordheim ORCID iD [ctb]
Maintainer: Lars Koppers <koppers at>
License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
NeedsCompilation: no
Citation: tosca citation info
CRAN checks: tosca results


Reference manual: tosca.pdf
Vignettes: Vignette tosca


Package source: tosca_0.3-2.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
macOS binaries: r-release (arm64): tosca_0.3-2.tgz, r-oldrel (arm64): tosca_0.3-2.tgz, r-release (x86_64): tosca_0.3-2.tgz, r-oldrel (x86_64): tosca_0.3-2.tgz
Old sources: tosca archive

Reverse dependencies:

Reverse imports: rollinglda
Reverse suggests: ldaPrototype


Please use the canonical form to link to this page.