nc: Named Capture to Data Tables

User-friendly functions for extracting a data table (row for each match, column for each group) from non-tabular text data using regular expressions, and for melting columns that match a regular expression. Patterns are defined using a readable syntax that makes it easy to build complex patterns in terms of simpler, re-usable sub-patterns. Named R arguments are translated to column names in the output; capture groups without names are used internally in order to provide a standard interface to three regular expression C libraries (PCRE, RE2, ICU). Output can also include numeric columns via user-specified type conversion functions. RE2 engine (re2r package) was removed from CRAN in Mar 2020 so must be installed from github.

Version: 2020.3.23
Depends: R (≥ 2.14)
Imports: data.table
Suggests: testthat, re2r, stringi, ggplot2, tidyr (≥ 1.0.0), cdata, dplyr, reshape2, knitr, R.utils
Published: 2020-03-25
Author: Toby Dylan Hocking
Maintainer: Toby Dylan Hocking <toby.hocking at>
License: GPL-3
NeedsCompilation: no
Materials: NEWS
CRAN checks: nc results


Reference manual: nc.pdf
Vignettes: vignette 1: capture first match
vignette 2: capture all matches
vignette 3: capture melt
vignette 4: comparisons with other packages
Package source: nc_2020.3.23.tar.gz
Windows binaries: r-prerelease:, r-release:, r-oldrel:
macOS binaries: r-prerelease: nc_2020.3.23.tgz, r-release: nc_2020.3.23.tgz, r-oldrel: nc_2020.2.27.tgz
Old sources: nc archive


Please use the canonical form to link to this page.