#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

A Statistical Framework for Joint eQTL Analysis in Multiple Tissues


Mapping expression Quantitative Trait Loci (eQTLs) represents a powerful and widely adopted approach to identifying putative regulatory variants and linking them to specific genes. Up to now eQTL studies have been conducted in a relatively narrow range of tissues or cell types. However, understanding the biology of organismal phenotypes will involve understanding regulation in multiple tissues, and ongoing studies are collecting eQTL data in dozens of cell types. Here we present a statistical framework for powerfully detecting eQTLs in multiple tissues or cell types (or, more generally, multiple subgroups). The framework explicitly models the potential for each eQTL to be active in some tissues and inactive in others. By modeling the sharing of active eQTLs among tissues, this framework increases power to detect eQTLs that are present in more than one tissue compared with “tissue-by-tissue” analyses that examine each tissue separately. Conversely, by modeling the inactivity of eQTLs in some tissues, the framework allows the proportion of eQTLs shared across different tissues to be formally estimated as parameters of a model, addressing the difficulties of accounting for incomplete power when comparing overlaps of eQTLs identified by tissue-by-tissue analyses. Applying our framework to re-analyze data from transformed B cells, T cells, and fibroblasts, we find that it substantially increases power compared with tissue-by-tissue analysis, identifying 63% more genes with eQTLs (at FDR = 0.05). Further, the results suggest that, in contrast to previous analyses of the same data, the majority of eQTLs detectable in these data are shared among all three tissues.


Vyšlo v časopise: A Statistical Framework for Joint eQTL Analysis in Multiple Tissues. PLoS Genet 9(5): e32767. doi:10.1371/journal.pgen.1003486
Kategorie: Research Article
prolekare.web.journal.doi_sk: https://doi.org/10.1371/journal.pgen.1003486

Souhrn

Mapping expression Quantitative Trait Loci (eQTLs) represents a powerful and widely adopted approach to identifying putative regulatory variants and linking them to specific genes. Up to now eQTL studies have been conducted in a relatively narrow range of tissues or cell types. However, understanding the biology of organismal phenotypes will involve understanding regulation in multiple tissues, and ongoing studies are collecting eQTL data in dozens of cell types. Here we present a statistical framework for powerfully detecting eQTLs in multiple tissues or cell types (or, more generally, multiple subgroups). The framework explicitly models the potential for each eQTL to be active in some tissues and inactive in others. By modeling the sharing of active eQTLs among tissues, this framework increases power to detect eQTLs that are present in more than one tissue compared with “tissue-by-tissue” analyses that examine each tissue separately. Conversely, by modeling the inactivity of eQTLs in some tissues, the framework allows the proportion of eQTLs shared across different tissues to be formally estimated as parameters of a model, addressing the difficulties of accounting for incomplete power when comparing overlaps of eQTLs identified by tissue-by-tissue analyses. Applying our framework to re-analyze data from transformed B cells, T cells, and fibroblasts, we find that it substantially increases power compared with tissue-by-tissue analysis, identifying 63% more genes with eQTLs (at FDR = 0.05). Further, the results suggest that, in contrast to previous analyses of the same data, the majority of eQTLs detectable in these data are shared among all three tissues.


Zdroje

1. FrazerKA, MurraySS, SchorkNJ, TopolEJ (2009) Human genetic variation and its contribution to complex traits. Nat Rev Genet 10: 241–251.

2. MontgomerySB, DermitzakisET (2011) From expression QTLs to personalized transcriptomics. Nature reviews Genetics 12: 277–282.

3. WrayGA (2007) The evolutionary significance of cis-regulatory mutations. Nature Reviews Genetics 8: 206–216.

4. CheungVG, ConlinLK, WeberTM, ArcaroM, JenKY, et al. (2003) Natural variation in human gene expression assessed in lymphoblastoid cells. Nature Genetics 33: 422–425.

5. StrangerBE, NicaAC, ForrestMS, DimasA, BirdCP, et al. (2007) Population genomics of human gene expression. Nature Genetics 39: 1217–1224.

6. GiladY, RifkinSA, PritchardJK (2008) Revealing the architecture of gene regulation: the promise of eQTL studies. Trends in genetics: TIG 24: 408–415.

7. NicaAC, MontgomerySB, DimasAS, StrangerBE, BeazleyC, et al. (2010) Candidate causal regulatory effects by integration of expression QTLs with complex trait genetic associations. PLoS Genet 6: e1000895 doi:10.1371/journal.pgen.1000895.

8. NicolaeDL, GamazonE, ZhangW, DuanS, DolanME, et al. (2010) Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet 6: e1000888 doi:10.1371/journal.pgen.1000888.

9. CooksonW, LiangL, AbecasisG, MoffattM, LathropM (2009) Mapping complex disease traits with global gene expression. Nat Rev Genet 10: 184–194.

10. GreenawaltDM, DobrinR, ChudinE, HatoumIJ, SuverC, et al. (2011) A survey of the genetics of stomach, liver, and adipose gene expression from a morbidly obese cohort. Genome Research 21: 1008–1016.

11. StegleO, PartsL, PiipariM, WinnJ, DurbinR (2012) Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat Protocols 7: 500–507.

12. DimasAS, DeutschS, StrangerBE, MontgomerySB, BorelC, et al. (2009) Common regulatory variation impacts gene expression in a cell type-dependent manner. Science 325: 1246–1250.

13. NicaAC, PartsL, GlassD, NisbetJ, BarrettA, et al. (2011) The architecture of gene regulatory variation across multiple human tissues: the MuTHER study. PLoS Genet 7: e1002003 doi:10.1371/journal.pgen.1002003.

14. DingJ, GudjonssonJE, LiangL, StuartPE, LiY, et al. (2010) Gene expression in skin and lymphoblastoid cells: Refined statistical method reveals extensive overlap in cis-eQTL signals. American Journal of Human Genetics 87: 779–789.

15. GerritsA, LiY, TessonBM, BystrykhLV, WeersingE, et al. (2009) Expression Quantitative Trait Loci Are Highly Sensitive to Cellular Differentiation State. PLoS Genet 5: e1000692 doi:10.1371/journal.pgen.1000692.

16. FuJ, WolfsMGM, DeelenP, WestraHJ, FehrmannRSN, et al. (2012) Unraveling the Regulatory Mechanisms Underlying Tissue-Dependent Genetic Variation of Gene Expression. PLoS Genet 8: e1002431 doi:10.1371/journal.pgen.1002431.

17. PetrettoE, BottoloL, LangleySR, HeinigM, McDermott-RoeC, et al. (2010) New insights into the genetic control of gene expression using a bayesian multi-tissue approach. PLoS Comput Biol 6: e1000737 doi:10.1371/journal.pcbi.1000737.

18. LiJ, TsengGC (2011) An adaptively weighted statistic for detecting differential gene expression when combining multiple transcriptomic studies. The Annals of Applied Statistics 5: 994–1019.

19. LebrecJJ, StijnenT, van HouwelingenHC (2010) Dealing with heterogeneity between cohorts in genomewide SNP association studies. Statistical Applications in Genetics and Molecular Biology 9.

20. HanB, EskinE (2011) Random-Effects Model Aimed at Discovering Associations in Meta-Analysis of Genome-wide Association Studies. The American Journal of Human Genetics 88: 586–598.

21. Wen X, Stephens M (2011). Bayesian Methods for Genetic Association Analysis with Heterogeneous Subgroups: from Meta-Analyses to Gene-Environment Interactions. URL http://arxiv.org/abs/1111.1210. 1111.1210.

22. BhattacharjeeS, RajaramanP, JacobsKB, WheelerWA, MelinBS, et al. (2012) A subset-based approach improves power and interpretation for the combined analysis of genetic association studies of heterogeneous traits. American journal of human genetics 90: 821–835.

23. HanB, EskinE (2012) Interpreting Meta-Analyses of Genome-Wide association studies. PLoS Genet 8: e1002555 doi:10.1371/journal.pgen.1002555.

24. VeyrierasJB, KudaravalliS, KimSY, DermitzakisET, GiladY, et al. (2008) High-resolution mapping of expression-QTLs yields insight into human gene regulation. PLoS Genet 4: e1000214 doi:10.1371/journal.pgen.1000214.

25. MaranvilleJC, LucaF, RichardsAL, WenX, WitonskyDB, et al. (2011) Interactions between glucocorticoid treatment and cis-regulatory polymorphisms contribute to cellular response phenotypes. PLoS Genet 7: e1002162 doi:10.1371/journal.pgen.1002162.

26. FergusonJP, ChoJH, ZhaoH (2012) A new approach for the joint analysis of multiple ChIP-seq libraries with application to histone modification. Statistical applications in genetics and molecular biology 11.

27. KassRE, RafteryAE (1995) Bayes Factors. Journal of the American Statistical Association 90: 773–795.

28. ServinB, StephensM (2007) Imputation-based analysis of association studies: candidate regions and quantitative traits. PLoS Genet 3: e114 doi:10.1371/journal.pgen.0030114.

29. GoodIJ (1992) The Bayes/Non-bayes compromise: A brief review. Journal of the American Statistical Association 87: 597–606.

30. Gelman A, Hill J (2006) Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge University Press, 1 edition. URL http://www.worldcat.org/isbn/0521867061.

31. StoreyJD, TibshiraniR (2003) Statistical significance for genomewide studies. Proceedings of the National Academy of Sciences of the United States of America 100: 9440–9445.

32. PickrellJK, MarioniJC, PaiAA, DegnerJF, EngelhardtBE, et al. (2010) Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature 464: 768–772.

33. SunW (2012) A statistical framework for eQTL mapping using RNA-seq data. Biometrics 68: 1–11.

34. PriceAL, HelgasonA, ThorleifssonG, McCarrollSA, KongA, et al. (2011) Single-tissue and crosstissue heritability of gene expression via identity-by-descent in related or unrelated individuals. PLoS Genet 7: e1001317 doi:10.1371/journal.pgen.1001317.

35. PowellJE, HendersAK, McRaeAF, WrightMJ, MartinNG, et al. (2012) Genetic control of gene expression in whole blood and lymphoblastoid cell lines is largely independent. Genome Research 22: 456–466.

36. Wen X (2012). Bayesian Analysis of Multiway Tables in Association Studies: A Model Comparison Approach. URL http://arxiv.org/abs/1208.4621. 1208.4621.

37. LiH, DurbinR (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics (Oxford, England) 25: 1754–1760.

38. HinrichsAS, KarolchikD, BaertschR, BarberGP, BejeranoG, et al. (2006) The UCSC Genome Browser Database: update 2006. Nucleic acids research 34: D590–D598.

39. PriceAL, PattersonNJ, PlengeRM, WeinblattME, ShadickNA, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nature genetics 38: 904–909.

40. LeekJT, ScharpfRB, BravoHAC, SimchaD, LangmeadB, et al. (2010) Tackling the widespread and critical impact of batch effects in high-throughput data. Nature Reviews Genetics 11: 733–739.

Štítky
Genetika Reprodukčná medicína

Článok vyšiel v časopise

PLOS Genetics


2013 Číslo 5
Najčítanejšie tento týždeň
Najčítanejšie v tomto čísle
Kurzy

Zvýšte si kvalifikáciu online z pohodlia domova

Získaná hemofilie - Povědomí o nemoci a její diagnostika
nový kurz

Eozinofilní granulomatóza s polyangiitidou
Autori: doc. MUDr. Martina Doubková, Ph.D.

Všetky kurzy
Prihlásenie
Zabudnuté heslo

Zadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.

Prihlásenie

Nemáte účet?  Registrujte sa

#ADS_BOTTOM_SCRIPTS#