aggregate()
Aggregate data based on given grouping.
transform()
Transform observation variables.
normalize()
Normalize observation variables.
variable_select()
Select observation variables.
variable_importance()
Measure variable importance.
drop_na_rows()
Drop rows that are NA in all specified variables.
NA
mark_outlier_rows()
Mark outlier rows.
correlation_threshold()
Remove redundant variables.
drop_na_columns()
Remove variables with NA values.
variance_threshold()
Remove variables with near-zero variance.
covariance()
Compute covariance matrix and vectorize.
generalized_log()
Generalized log transform data.
husk()
Husk data.
sparse_random_projection()
Reduce the dimensionality of a population using sparse random projection.
spherize()
Spherize data.
replicate_correlation()
Measure replicate correlation of variables.
svd_entropy()
Feature importance based on data entropy.
extract_subpopulations()
Extract subpopulations.
stratify()
Stratify operations.
count_na_rows()
Count the number of NAs per variable.
generate_component_matrix()
A sparse matrix for sparse random projection.
find_significant_pcs()
Find significant PC's given the eigenvalues.