The EN-TEx resource of multi-tissue personal epigenomes & variant-impact models.
View/ Open
Date
2023-03-30Author
Rozowsky, J
Gao, J
Borsari, B
Yang, YT
Galeev, T
Gürsoy, G
Epstein, CB
Xiong, K
Xu, J
Li, T
Liu, J
Yu, K
Berthel, A
Chen, Z
Navarro, F
Sun, MS
Wright, J
Chang, J
Cameron, CJF
Shoresh, N
Gaskell, E
Drenkow, J
Adrian, J
Aganezov, S
Aguet, F
Balderrama-Gutierrez, G
Banskota, S
Corona, GB
Chee, S
Chhetri, SB
Cortez Martins, GC
Danyko, C
Davis, CA
Farid, D
Farrell, NP
Gabdank, I
Gofin, Y
Gorkin, DU
Gu, M
Hecht, V
Hitz, BC
Issner, R
Jiang, Y
Kirsche, M
Kong, X
Lam, BR
Li, S
Li, B
Li, X
Lin, KZ
Luo, R
Mackiewicz, M
Meng, R
Moore, JE
Mudge, J
Nelson, N
Nusbaum, C
Popov, I
Pratt, HE
Qiu, Y
Ramakrishnan, S
Raymond, J
Salichos, L
Scavelli, A
Schreiber, JM
Sedlazeck, FJ
See, LH
Sherman, RM
Shi, X
Shi, M
Sloan, CA
Strattan, JS
Tan, Z
Tanaka, FY
Vlasova, A
Wang, J
Werner, J
Williams, B
Xu, M
Yan, C
Yu, L
Zaleski, C
Zhang, J
Ardlie, K
Cherry, JM
Mendenhall, EM
Noble, WS
Weng, Z
Levine, ME
Dobin, A
Wold, B
Mortazavi, A
Ren, B
Gillis, J
Myers, RM
Snyder, MP
Choudhary, J
Milosavljevic, A
Schatz, MC
Bernstein, BE
Guigó, R
Gingeras, TR
Gerstein, M
Type
Journal Article
Metadata
Show full item recordAbstract
Understanding how genetic variants impact molecular phenotypes is a key goal of functional genomics, currently hindered by reliance on a single haploid reference genome. Here, we present the EN-TEx resource of 1,635 open-access datasets from four donors (∼30 tissues × ∼15 assays). The datasets are mapped to matched, diploid genomes with long-read phasing and structural variants, instantiating a catalog of >1 million allele-specific loci. These loci exhibit coordinated activity along haplotypes and are less conserved than corresponding, non-allele-specific ones. Surprisingly, a deep-learning transformer model can predict the allele-specific activity based only on local nucleotide-sequence context, highlighting the importance of transcription-factor-binding motifs particularly sensitive to variants. Furthermore, combining EN-TEx with existing genome annotations reveals strong associations between allele-specific and GWAS loci. It also enables models for transferring known eQTLs to difficult-to-profile tissues (e.g., from skin to heart). Overall, EN-TEx provides rich data and generalizable models for more accurate personal functional genomics.
Collections
Subject
ENCODE
GTEx
allele-specific activity
eQTLs
functional epigenomes
functional genomics
genome annotations
personal genome
predictive models
structural variants
tissue specificity
transformer model
Epigenome
Quantitative Trait Loci
Genome-Wide Association Study
Genomics
Phenotype
Polymorphism, Single Nucleotide
Research team
Functional Proteomics
Prote & Metabolomics Fac
Language
eng
Date accepted
2023-02-10
License start date
2023-03-30
Citation
Cell, 2023, 186 (7), pp. 1493 - 1511.e40
Publisher
Elsevier BV
Except where otherwise noted, this item's license is described
as
http://creativecommons.org/licenses/by/4.0/
Related items
Showing items related by title, author, creator and subject.
-
Association analyses of more than 140,000 men identify 63 new prostate cancer susceptibility loci.
Schumacher, FR; Al Olama, AA; Berndt, SI; Benlloch, S; Ahmed, M; et al. (2018-07)Genome-wide association studies (GWAS) and fine-mapping efforts to date have identified more than 100 prostate cancer (PrCa)-susceptibility loci. We meta-analyzed genotype data from a custom high-density array of 46,939 ... -
Pan-cancer analysis of whole genomes.
ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium (2020-02-05)Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale 1-3 . Here we report the integrative analysis of ... -
Perspectives on ENCODE.
ENCODE Project Consortium; Snyder, MP; Gingeras, TR; Moore, JE; Weng, Z; et al. (2020-07-29)The Encylopedia of DNA Elements (ENCODE) Project launched in 2003 with the long-term goal of developing a comprehensive map of functional elements in the human genome. These included genes, biochemical regions associated ...