remove_redundancy_elements_through_correlation {tidybulk} | R Documentation |
Drop redundant elements (e.g., samples) for which feature (e.g., genes) aboundances are correlated
remove_redundancy_elements_through_correlation( .data, .element = NULL, .feature = NULL, .abundance = NULL, correlation_threshold = 0.9, top = Inf, of_samples = TRUE, log_transform = FALSE )
.data |
A tibble |
.element |
A column symbol. The column that is used to calculate distance (i.e., normally samples) |
.feature |
A column symbol. The column that is represents entities to cluster (i.e., normally genes) |
.abundance |
A column symbol with the value the clustering is based on (e.g., 'count') |
correlation_threshold |
A real number between 0 and 1 |
top |
An integer. How many top genes to select |
of_samples |
A boolean |
log_transform |
A boolean, whether the value should be log-transformed (e.g., TRUE for RNA sequencing data) |
A tibble with redundant elements removed