remove_redundancy_elements_through_correlation {tidybulk}R Documentation

Drop redundant elements (e.g., samples) for which feature (e.g., genes) aboundances are correlated

Description

Drop redundant elements (e.g., samples) for which feature (e.g., genes) aboundances are correlated

Usage

remove_redundancy_elements_through_correlation(
  .data,
  .element = NULL,
  .feature = NULL,
  .abundance = NULL,
  correlation_threshold = 0.9,
  top = Inf,
  of_samples = TRUE,
  log_transform = FALSE
)

Arguments

.data

A tibble

.element

A column symbol. The column that is used to calculate distance (i.e., normally samples)

.feature

A column symbol. The column that is represents entities to cluster (i.e., normally genes)

.abundance

A column symbol with the value the clustering is based on (e.g., 'count')

correlation_threshold

A real number between 0 and 1

top

An integer. How many top genes to select

of_samples

A boolean

log_transform

A boolean, whether the value should be log-transformed (e.g., TRUE for RNA sequencing data)

Value

A tibble with redundant elements removed


[Package tidybulk version 1.4.0 Index]