Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences.

Documentation

Manual: stringdist.pdf
Vignette: None available.

Maintainer: Mark van der Loo <mark.vanderloo at gmail.com>

Author(s): Mark van der Loo*, Jan van der Laan*, R Core Team*, Nick Logan*

Install package and any missing dependencies by running this line in your R console:

install.packages("stringdist")

Depends R (>= 2.15.3)
Imports parallel
Suggests testthat
Enhances
Linking to
Reverse
depends
brewdata, vwr
Reverse
imports
bcRep, deductive, diffrprojects, fuzzyjoin, lingtypology, lintr, PGRdup, qdap, sjmisc, tcR, TSTr
Reverse
suggests
rlist, sprint
Reverse
enhances
Reverse
linking to

Package stringdist
Materials
URL https://github.com/markvanderloo/stringdist
Task Views OfficialStatistics
Version 0.9.4.4
Published 2016-12-16
License GPL-3
BugReports https://github.com/markvanderloo/stringdist/issues
SystemRequirements
NeedsCompilation yes
Citation
CRAN checks stringdist check results
Package source stringdist_0.9.4.4.tar.gz