A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Support for additional languages can be added on-the-fly or by plugin packages. Note: For full functionality a local installation of TreeTagger is recommended. 'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full use of this feature, please install RKWard from (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage. To ask for help, report bugs, request features, or discuss the development of the package, please subscribe to the koRpus-dev mailing list ().

Documentation

Manual: koRpus.pdf
Vignette: Using the koRpus Package for Text Analysis

Maintainer: m.eik michalke <meik.michalke at hhu.de>

Author(s): m.eik michalke*, Earl Brown*, Alberto Mirisola*, Alexandre Brulet*, Laura Hauser*

Install package and any missing dependencies by running this line in your R console:

install.packages("koRpus")

Depends R (>= 2.10.0), methods, data.table
Imports
Suggests testthat, tm, SnowballC, shiny
Enhances rkward
Linking to
Reverse
depends
Reverse
imports
textmining, textstem
Reverse
suggests
pander, qdap
Reverse
enhances
Reverse
linking to

Package koRpus
Materials
URL https://reaktanz.de/?c=hacking&s=koRpus
Task Views NaturalLanguageProcessing
Version 0.10-1
Published 2017-03-02
License GPL (>= 3)
BugReports
SystemRequirements
NeedsCompilation no
Citation
CRAN checks koRpus check results
Package source koRpus_0.10-1.tar.gz