This section contains quality control plots and statistics for the methylation data.
Each box plot below shows the signal distribution of quality control probes across all samples. The control box plots are separated by control types. Detailed description of the control probes is given in the RnBeads vignette.
The plots below visualize the exact signal levels at each quality control probe. Note that the scale is not standardized. Background signal is usualy at the level of 1000 to 2000.
SNP-based box and bar plots that facilitate identification of the sample mixups.
SNP Heatmap allows for identification of sample mixups in particular for genetically matched designs.
If we inspect the samples in the space defined by the SNP probes only, samples appear close to each other are genetically similar. We calculated the Manhattan distance between all vectors of SNP probes. The figure below shows the relative distances.
Display samples as |
Scatter plot showing the samples in the first two principal components of the space of their SNP probes.
The full table of all pairwise distances is stored in a dedicated comma-separated file accompanying this report.
RnBeads predicted the gender of the samples in the dataset using a logistic regression model. The results are summarized in the table below.
Gender | Samples | |
1 | female | 2574 |
2 | male | 1460 |
3 | unknown | 0 |
Gender was predicted based on the increase (or decrease) of mean signal intensities in the sex chromosomes w.r.t. the corresponding value in autosomes. The figure below displays these characteristics of the samples.
Colors denote |
Gender prediction based on mean signal increase. The decision boundary between the two genders is visualized by a black line. Sample colors denote predicted male probability / gender.