Elimination of reference mapping bias reveals robust immune related allele-specific expression in cross-bred sheep
Salavati M., Bush S., Palma-Vera S., McCulloch MEB., Hume D., Clark E.
Abstract Pervasive allelic variation at both gene and single nucleotide level (SNV) between individuals is commonly associated with complex traits in humans and animals. Allele-specific expression (ASE) analysis, using RNA-Seq, can provide a detailed annotation of allelic imbalance and infer the existence of cis-acting transcriptional regulation. However, variant detection in RNA-Seq data is compromised by biased mapping of reads to the reference DNA sequence. In this manuscript we describe an unbiased standardised computational pipeline for allele-specific expression analysis using RNA-Seq data, which we have adapted and developed using tools available under open licence. The analysis pipeline we present is designed to minimise reference bias while providing accurate profiling of allele-specific expression across tissues and cell types. Using this methodology, we were able to profile pervasive allelic imbalance across tissues and cell types, at both the gene and SNV level, in Texel x Scottish Blackface sheep, using the sheep gene expression atlas dataset. ASE profiles were pervasive in each sheep and across all tissue types investigated. However, ASE profiles shared across tissues were limited and instead they tended to be highly tissue-specific. These tissue-specific ASE profiles may underlie the expression of economically important traits and could be utilized as weighted SNVs, for example, to improve the accuracy of genomic selection in breeding programmes for sheep. An additional benefit of the pipeline is that it does not require parental genotypes and can therefore be applied to other RNA-Seq datasets for livestock, including those available on the Functional Annotation of Animal Genomes (FAANG) data portal. This study is the first global characterisation of moderate to extreme ASE in tissues and cell types from sheep. We have applied a robust methodology for ASE profiling, to provide both a novel analysis of the multi-dimensional sheep gene expression atlas dataset, and a foundation for identifying the regulatory and expressed elements of the genome that are driving complex traits in livestock.