Basic Functions |
|
---|---|
recipes: A package for computing and preprocessing design matrices. |
|
Create a recipe for preprocessing data |
|
Create a formula from a prepared recipe |
|
Print a Recipe |
|
Summarize a recipe |
|
Estimate a preprocessing recipe |
|
Apply a trained preprocessing recipe |
|
Extract transformed training set |
|
Methods for selecting variables in step functions |
|
|
Role Selection |
Manually alter roles |
|
Update role specific requirements |
|
|
Helpers for steps with case weights |
Using case weights with recipes |
|
Step Functions - Imputation |
|
Impute via bagged trees |
|
Impute via k-nearest neighbors |
|
Impute numeric variables via a linear model |
|
Impute numeric data below the threshold of measurement |
|
Impute numeric data using the mean |
|
Impute numeric data using the median |
|
Impute nominal data using the most common value |
|
Impute numeric data using a rolling window statistic |
|
Assign missing categories to "unknown" |
|
Step Functions - Individual Transformations |
|
Box-Cox transformation for non-negative data |
|
B-spline basis functions |
|
Add sin and cos terms for harmonic analysis |
|
Hyperbolic transformations |
|
Inverse transformation |
|
Inverse logit transformation |
|
Logarithmic transformation |
|
Logit transformation |
|
Add new variables using dplyr |
|
Natural spline basis functions |
|
Orthogonal polynomial basis functions |
|
Generalized bernstein polynomial basis |
|
Apply (smoothed) rectified linear transformation |
|
Basis splines |
|
Convex splines |
|
Monotone splines |
|
Natural splines |
|
Non-negative splines |
|
Square root transformation |
|
Yeo-Johnson transformation |
|
Step Functions - Discretization |
|
Discretize Numeric Variables |
|
Discretize Numeric Variables |
|
Cut a numeric variable into a factor |
|
Step Functions - Dummy Variables and Encodings |
|
Create a factors from A dummy variable |
|
Create counts of patterns using regular expressions |
|
Create traditional dummy variables |
|
Extract patterns from nominal data |
|
Handle levels in multiple predictors together |
|
Convert factors to strings |
|
Create missing data column indicators |
|
Convert values to predefined integers |
|
Simple value assignments for novel factor levels |
|
Convert numbers to factors |
|
Convert ordinal factors to numeric scores |
|
Collapse infrequent categorical levels |
|
Percentile transformation |
|
Detect a regular expression |
|
Relevel factors to a desired level |
|
Convert strings to factors |
|
Assign missing categories to "unknown" |
|
Convert ordered factors to unordered factors |
|
Step Functions - Date and Datetime |
|
Date feature generator |
|
Time feature generator |
|
Holiday feature generator |
|
Step Functions - Interactions |
|
Create interaction variables |
|
Step Functions - Normalization |
|
Centering numeric data |
|
Center and scale numeric data |
|
Scaling numeric data to a specific range |
|
Scaling numeric data |
|
Step Functions - Multivariate Transformations |
|
Distances to class centroids |
|
Compute shrunken centroid distances for classification models |
|
Data depths |
|
Distance between two locations |
|
ICA signal extraction |
|
Isomap embedding |
|
Kernel PCA signal extraction |
|
Polynomial kernel PCA signal extraction |
|
Radial basis function kernel PCA signal extraction |
|
Mutate multiple columns using dplyr |
|
Non-negative matrix factorization signal extraction |
|
Non-negative matrix factorization signal extraction with lasso penalization |
|
PCA signal extraction |
|
Partial least squares feature extraction |
|
Ratio variable creation |
|
Spatial sign preprocessing |
|
Step Functions - Filters |
|
High correlation filter |
|
Missing value column filter |
|
Linear combination filter |
|
Near-zero variance filter |
|
General variable filter |
|
Select variables using dplyr |
|
Zero variance filter |
|
Step Functions - Row Operations |
|
Sort rows using dplyr |
|
Filter rows using dplyr |
|
Create a lagged predictor |
|
Remove observations with missing values |
|
Impute numeric data using a rolling window statistic |
|
Sample rows using dplyr |
|
Shuffle variables |
|
Filter rows by position using dplyr |
|
Step Functions - Others |
|
Add intercept (or constant) column |
|
Create a profiling version of a data set |
|
Rename variables by name using dplyr |
|
Rename multiple columns using dplyr |
|
Moving window functions |
|
Check Functions |
|
Check variable class |
|
Check if all columns are present |
|
Check for missing values |
|
Check for new values |
|
Check range consistency |
|
Developer Functions |
|
Developer functions for creating recipes steps |
|
Add a New Operation to the Current Recipe |
|
Detect if a particular step or check is used in a recipe |
|
Check to see if a recipe is trained/prepared |
|
Get types for use in recipes |
|
Naming Tools |
|
Wrapper function for preparing recipes within resampling |
|
Evaluate a selection with tidyselect semantics for arguments |
|
Evaluate a selection with tidyselect semantics specific to recipes |
|
Checks that steps have all S3 methods |
|
Prototype of recipe object |
|
Validate prototype of recipe object |
|
Role indicators |
|
Using sparse data with recipes |
|
Update a recipe step |
|
Tidy Methods |
|
|
Tidy the result of a recipe |