Chapter 2 Covariates

This Covariates chapter provides flexibility to visualize the covariates in the current dataset, such as Age, Group, Genotype, etc. This dynamic table contains several columns, including the PC, Covariate, Type, Significance, p-value, and FDR for each covariate listed.

The PC column shows the principal component that the covariate belongs to. We will see the same covariate name appears in different PC, such as PC1 and PC2, with different Significance.

The Covariate column displays the name of each covariate, and the terms are self-explainable. For example, some covariates are the sample annotation of the dataset, such as Age, Group, Genotype. On the other hand, some covariates are related to the data analysis steps, such as the metrics from the genome mapping step. If the data is downloaded from GEO and incorporated into the pipeline using recount3, metrics from the recount3 pipeline will also show up in the Covariate column.

The Type column indicates the type of each covariate. Covariates can be Numeric or Categorical types.

The Significance column lists the ANOVA FDR information between each covariate v.s. PC. For Numeric covariates, an r value is associated with each ANOVA test. However, for Categorical covariates, only FDR values are displayed.

The last two columns, P-value and FDR, indicate the results of statistical tests (P-value) and adjusted P-values.

2.1 List of covariates

Table 2.1: Significant covariates
PC Covariate Type Significance P-value FDR
PC1 recount_qc_star_number_of_splices_at_ac Numeric recount_qc_star_number_of_splices_at_ac vs. PC1: r=-0.544; fdr=2.7e-07 0.0000000 0.0000003
PC2 recount_qc_star_percent_of_reads_unmapped_other Numeric recount_qc_star_percent_of_reads_unmapped_other vs. PC2: r=0.408; fdr=1.4e-03 0.0000104 0.0014013
PC1 recount_qc_intron_sum Numeric recount_qc_intron_sum vs. PC1: r=-0.367; fdr=3.8e-03 0.0000882 0.0038327
PC2 recount_qc_star_number_of_reads_unmapped_other Numeric recount_qc_star_number_of_reads_unmapped_other vs. PC2: r=0.364; fdr=3.8e-03 0.0000994 0.0038327
PC1 recount_qc_star_number_of_splices_gc_ag Numeric recount_qc_star_number_of_splices_gc_ag vs. PC1: r=-0.371; fdr=3.8e-03 0.0000725 0.0038327
PC1 recount_qc_junction_count Numeric recount_qc_junction_count vs. PC1: r=-0.365; fdr=3.8e-03 0.0000947 0.0038327
PC2 recount_qc_star_number_of_reads_unmapped_other_both Numeric recount_qc_star_number_of_reads_unmapped_other_both vs. PC2: r=0.364; fdr=3.8e-03 0.0000994 0.0038327
PC1 recount_qc_exon_fc_unique_percent Numeric recount_qc_exon_fc_unique_percent vs. PC1: r=0.355; fdr=4.5e-03 0.0001502 0.0045066
PC1 recount_qc_intron_sum_percent Numeric recount_qc_intron_sum_percent vs. PC1: r=-0.356; fdr=4.5e-03 0.0001457 0.0045066
PC2 recount_qc_bc_frag_mean_length Numeric recount_qc_bc_frag_mean_length vs. PC2: r=0.35; fdr=5.2e-03 0.0001911 0.0051596
PC1 recount_qc_star_percent_mapped_reads_both Numeric recount_qc_star_percent_mapped_reads_both vs. PC1: r=-0.346; fdr=5.7e-03 0.0002311 0.0056714
PC1 recount_qc_aligned_readspercent_chrx Numeric recount_qc_aligned_readspercent_chrx vs. PC1: r=-0.34; fdr=6.1e-03 0.0003010 0.0061227
PC1 recount_qc_star_deletion_average_length Numeric recount_qc_star_deletion_average_length vs. PC1: r=-0.339; fdr=6.1e-03 0.0003047 0.0061227
PC1 recount_seq_qc_percentg Numeric recount_seq_qc_percentg vs. PC1: r=0.339; fdr=6.1e-03 0.0003175 0.0061227
PC1 recount_qc_star_uniquely_mapped_reads_percent Numeric recount_qc_star_uniquely_mapped_reads_percent vs. PC1: r=-0.335; fdr=6.1e-03 0.0003631 0.0061281
PC1 recount_qc_star_uniquely_mapped_reads_percent_both Numeric recount_qc_star_uniquely_mapped_reads_percent_both vs. PC1: r=-0.337; fdr=6.1e-03 0.0003450 0.0061281
PC1 recount_qc_star_percent_reads_unmapped_too_short_both Numeric recount_qc_star_percent_reads_unmapped_too_short_both vs. PC1: r=0.334; fdr=6.2e-03 0.0003908 0.0062076
PC1 recount_qc_gene_fc_unique_percent Numeric recount_qc_gene_fc_unique_percent vs. PC1: r=0.332; fdr=6.3e-03 0.0004206 0.0063086
PC1 recount_qc_star_percent_of_reads_unmapped_too_short Numeric recount_qc_star_percent_of_reads_unmapped_too_short vs. PC1: r=0.325; fdr=7.9e-03 0.0005535 0.0078648
PC3 recount_qc_bc_auc_unique_percent Numeric recount_qc_bc_auc_unique_percent vs. PC3: r=0.306; fdr=1.7e-02 0.0012235 0.0165175
PC1 recount_qc_star_number_of_reads_unmapped_too_short Numeric recount_qc_star_number_of_reads_unmapped_too_short vs. PC1: r=0.301; fdr=1.8e-02 0.0014488 0.0177802
PC1 recount_qc_star_number_of_reads_unmapped_too_short_both Numeric recount_qc_star_number_of_reads_unmapped_too_short_both vs. PC1: r=0.301; fdr=1.8e-02 0.0014488 0.0177802
PC2 recount_qc_star_percent_of_reads_mapped_to_too_many_loci Numeric recount_qc_star_percent_of_reads_mapped_to_too_many_loci vs. PC2: r=0.3; fdr=1.8e-02 0.0015278 0.0179345
PC1 recount_seq_qc_percenta Numeric recount_seq_qc_percenta vs. PC1: r=-0.296; fdr=2.0e-02 0.0017673 0.0198821
PC2 recount_qc_star_percent_reads_mapped_to_too_many_loci_both Numeric recount_qc_star_percent_reads_mapped_to_too_many_loci_both vs. PC2: r=0.292; fdr=2.3e-02 0.0020858 0.0225270
PC2 recount_qc_star_percent_of_reads_mapped_to_multiple_loci Numeric recount_qc_star_percent_of_reads_mapped_to_multiple_loci vs. PC2: r=-0.288; fdr=2.4e-02 0.0024343 0.0237980
PC2 recount_qc_star_number_of_reads_mapped_to_multiple_loci Numeric recount_qc_star_number_of_reads_mapped_to_multiple_loci vs. PC2: r=-0.287; fdr=2.4e-02 0.0024692 0.0237980
PC2 recount_qc_star_number_of_reads_mapped_to_multiple_loci_both Numeric recount_qc_star_number_of_reads_mapped_to_multiple_loci_both vs. PC2: r=-0.287; fdr=2.4e-02 0.0024692 0.0237980
PC2 recount_qc_star_percent_reads_mapped_to_multiple_loci_both Numeric recount_qc_star_percent_reads_mapped_to_multiple_loci_both vs. PC2: r=-0.286; fdr=2.4e-02 0.0025561 0.0237980
PC1 recount_qc_bc_auc_unique_percent Numeric recount_qc_bc_auc_unique_percent vs. PC1: r=0.277; fdr=3.2e-02 0.0036048 0.0324429
PC1 recount_qc_star_percent_of_reads_unmapped_other Numeric recount_qc_star_percent_of_reads_unmapped_other vs. PC1: r=-0.274; fdr=3.4e-02 0.0038764 0.0337622
PC1 recount_qc_star_percent_reads_unmapped_other_both Numeric recount_qc_star_percent_reads_unmapped_other_both vs. PC1: r=-0.273; fdr=3.4e-02 0.0040591 0.0342487
PC2 recount_qc_aligned_readspercent_chrm Numeric recount_qc_aligned_readspercent_chrm vs. PC2: r=-0.267; fdr=4.1e-02 0.0050544 0.0413540
PC1 group Categorical group vs. PC1 ANOVA fdr: 4.9e-02 0.0121922 0.0487687
PC1 sra_sample_attributes Categorical sra_sample_attributes vs. PC1 ANOVA fdr: 4.9e-02 0.0121922 0.0487687
PC1 response Categorical response vs. PC1 ANOVA fdr: 4.9e-02 0.0121922 0.0487687
PC2 recount_qc_junction_count Numeric recount_qc_junction_count vs. PC2: r=0.256; fdr=5.7e-02 0.0072009 0.0571834
PC3 recount_qc_bc_auc_all_percent Numeric recount_qc_bc_auc_all_percent vs. PC3: r=0.255; fdr=5.8e-02 0.0075149 0.0578633
PC2 recount_seq_qc_percentc Numeric recount_seq_qc_percentc vs. PC2: r=0.254; fdr=5.8e-02 0.0077151 0.0578633
PC1 recount_qc_star_percent_reads_mapped_to_multiple_loci_both Numeric recount_qc_star_percent_reads_mapped_to_multiple_loci_both vs. PC1: r=0.25; fdr=6.1e-02 0.0086196 0.0612446
PC1 recount_seq_qc_percentt Numeric recount_seq_qc_percentt vs. PC1: r=-0.251; fdr=6.1e-02 0.0085142 0.0612446
PC1 recount_qc_star_percent_of_reads_mapped_to_multiple_loci Numeric recount_qc_star_percent_of_reads_mapped_to_multiple_loci vs. PC1: r=0.247; fdr=6.4e-02 0.0096109 0.0639623
PC2 recount_qc_star_number_of_reads_mapped_to_too_many_loci Numeric recount_qc_star_number_of_reads_mapped_to_too_many_loci vs. PC2: r=0.247; fdr=6.4e-02 0.0097128 0.0639623
PC2 recount_qc_star_number_of_reads_mapped_to_too_many_loci_both Numeric recount_qc_star_number_of_reads_mapped_to_too_many_loci_both vs. PC2: r=0.247; fdr=6.4e-02 0.0097128 0.0639623
PC1 recount_qc_star_number_of_reads_unmapped_other Numeric recount_qc_star_number_of_reads_unmapped_other vs. PC1: r=-0.244; fdr=6.7e-02 0.0106223 0.0666982
PC1 recount_qc_star_number_of_reads_unmapped_other_both Numeric recount_qc_star_number_of_reads_unmapped_other_both vs. PC1: r=-0.244; fdr=6.7e-02 0.0106223 0.0666982
PC1 recount_qc_bc_auc_all_percent Numeric recount_qc_bc_auc_all_percent vs. PC1: r=0.229; fdr=9.8e-02 0.0164923 0.0975494
PC1 recount_qc_gene_fc_all_percent Numeric recount_qc_gene_fc_all_percent vs. PC1: r=0.23; fdr=9.8e-02 0.0159684 0.0975494
PC2 recount_qc_gene_fc_unique_percent Numeric recount_qc_gene_fc_unique_percent vs. PC2: r=-0.229; fdr=9.8e-02 0.0166195 0.0975494

2.2 Plots of significant covariates

Loop through the significant covariates and product all plots.