
Extract Absolute Copy-Number Features
ExtractCopyNumberFeatures.RdExtract genome-wide copy-number features data from either a list of dataframes or a QDNAseq S4 object for 1 or more samples. This function is intended to be run on absolute CN data.
Usage
ExtractCopyNumberFeatures(
CN_data,
genome,
cores = 1,
log_features = FALSE,
extra_features = FALSE
)Arguments
- CN_data
List of dataframes or S4 QDNAseq object. Segmented copy-number data for for 1 or more samples. If input is a list of dataframes, columns should be:
chromosome
start
end
segVal
- genome
Character string. The reference genome used for alignment.
Options: 'hg19', 'hg38'- cores
Integer. The number of cores to use for parallel processing.
- log_features
FALSE or char vector. If a vector of feature names is provided, take the log1p of these extracted CN-features.
- extra_features
Logical. If TRUE, extracts CN-feature data for two more features: nc50, and cdist.
Details
The extracted copy-number features are:
Breakpoint count per 10MB -
bp10MBCopy-number value of each segment -
copynumberCopy-number difference between adjacent segments -
changepointBreakpoint count per chromosome arm -
bpchrarmLengths of oscillating CN segment chains -
osCNSize of copy-number segments in base-pairs -
segsize
Extra features:
7. Minimum number of chromosomes (a count) needed to account for 50% of CN changes in a sample - nc50
8. Distance in base pairs of each breakpoint to the centromere - cdist