| Type: | Package | 
| Title: | Basic Pattern Analysis | 
| Version: | 0.1.1 | 
| Date: | 2016-04-03 | 
| Description: | Run basic pattern analyses on character sets, digits, or combined input containing both characters and numeric digits. Useful for data cleaning and for identifying columns containing multiple or nonstandard formats. | 
| Depends: | base | 
| Imports: | magrittr, plyr | 
| Suggests: | testthat, knitr, rmarkdown | 
| License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] | 
| URL: | https://github.com/bgreenwell/bpa | 
| BugReports: | https://github.com/bgreenwell/bpa/issues | 
| RoxygenNote: | 5.0.1 | 
| VignetteBuilder: | knitr | 
| NeedsCompilation: | no | 
| Packaged: | 2016-04-03 23:53:13 UTC; w108bmg | 
| Author: | Brandon Greenwell [aut, cre] | 
| Maintainer: | Brandon Greenwell <greenwell.brandon@gmail.com> | 
| Repository: | CRAN | 
| Date/Publication: | 2016-04-04 08:37:03 | 
Pipe operator
Description
See %>% for more details.
Usage
lhs %>% rhs
Basic Pattern Analysis
Description
Perform a basic pattern analysis
Usage
get_pattern(x, show_ws = TRUE, ws_char = "w")
basic_pattern_analysis(x, unique_only = FALSE, show_ws = TRUE,
  ws_char = "w", useNA = c("no", "ifany", "always"), ...)
## Default S3 method:
basic_pattern_analysis(x, unique_only = FALSE,
  show_ws = TRUE, ws_char = "w", useNA = c("no", "ifany", "always"), ...)
## S3 method for class 'data.frame'
basic_pattern_analysis(x, unique_only = FALSE,
  show_ws = TRUE, ws_char = "w", useNA = c("no", "ifany", "always"), ...)
bpa(x, ...)
Arguments
| x | A data frame or character vector. | 
| show_ws | Logical indicating whether or not to show whitespace
using a special character. Default is  | 
| ws_char | Character string to use to depict whitespace when 
 | 
| unique_only | Logical indicating whether or not to only show the unique
patterns. Default is  | 
| useNA | Logical indicating whether to include  | 
| ... | Additional optional arguments to be passed onto  | 
Examples
basic_pattern_analysis(iris)
basic_pattern_analysis(iris, unique_only = TRUE)
Pattern Matching
Description
Extract values from a vector that match a particular pattern.
Usage
match_pattern(x, pattern, unique_only = FALSE, ...)
Arguments
| x | A vector, typically of class  | 
| pattern | Character string specifying the particular pattern to match. | 
| unique_only | Logical indicating whether or not to only return unique
values. Default is  | 
| ... | Additional optional arguments to ba passed onto 
 | 
Details
The pattern specified by the required argument pattern must be a valid
pattern produced by the get_pattern function. That is, all digits
should be represented by a "9", lowercase/uppercase letters by a 
"a"/"A", etc.
Examples
phone <- c("123-456-7890", "456-7890", "123-4567", "456-7890")
match_pattern(phone, pattern = "999-9999")
match_pattern(phone, pattern = "999-9999", unique_only = TRUE)
Simulated Data
Description
Simulated (messy) data set to help illustrate some of the uses of basic pattern analysis.
Format
A data frame with 1000 rows and 3 variables
Details
-  GenderGender in various formats.
-  DateDates in various formats.
- Phone Phone numbers in various formats. 
Examples
data(messy)
bpa(messy, unique_only = TRUE, ws_char = " ")
Remove Leading/Trailing Whitespace
Description
Remove leading and/or trailing whitespace from character strings.
Usage
trim_ws(x, which = c("both", "left", "right"))
Arguments
| x | A data frame or vector. | 
| which | A character string specifying whether to remove both leading and
trailing whitespace (default), or only leading ( | 
Examples
# Toy example
d <- data.frame(x = c(" a ", "b ", "c"),
                y = c("   1 ", "2", " 3"),
                z = c(4, 5, 6))
print(d)  # print data as is
trim_ws(d)  # print data with whitespace trimmed off
sapply(trim_ws(d), class)  # check that column types are preserved