Allows for fast, correct, consistent, portable, as well as convenient character string/text processing in every locale and any native encoding. Owing to the use of the ICU library, the package provides R users with platform-independent functions known to Java, Perl, Python, PHP, and Ruby programmers. Available features include: pattern searching (e.g., with ICU Java-like regular expressions or the Unicode Collation Algorithm), random string generation, case mapping, string transliteration, concatenation, Unicode normalization, date-time formatting and parsing, etc.


Manual: stringi.pdf
Vignette: None available.

Maintainer: Marek Gagolewski <gagolews at>

Author(s): Marek Gagolewski*, Bartek Tartanus*, and other contributors (stringi source code); IBM and other contributors (ICU4C 55.1 source code); Unicode, Inc. (Unicode Character Database)

Install package and any missing dependencies by running this line in your R console:


Package stringi
Task Views NaturalLanguageProcessing
Version 1.1.5
Published 2017-04-07
License file LICENSE
SystemRequirements ICU4C (>= 52, optional)
NeedsCompilation yes
CRAN checks stringi check results
Package source stringi_1.1.5.tar.gz