GENCODE 2021.

Loading...
Thumbnail Image

Embargo End Date

Authors

Frankish, A
Diekhans, M
Jungreis, I
Lagarde, J
Loveland, JE
Mudge, JM
Sisu, C
Wright, JC
Armstrong, J
Barnes, I
Berry, A
Bignell, A
Boix, C
Carbonell Sala, S
Cunningham, F
Di Domenico, T
Donaldson, S
Fiddes, IT
García Girón, C
Gonzalez, JM
Grego, T
Hardy, M
Hourlier, T
Howe, KL
Hunt, T
Izuogu, OG
Johnson, R
Martin, FJ
Martínez, L
Mohanan, S
Muir, P
Navarro, FCP
Parker, A
Pei, B
Pozo, F
Riera, FC
Ruffier, M
Schmitt, BM
Stapleton, E
Suner, M-M
Sycheva, I
Uszczynska-Ratajczak, B
Wolf, MY
Xu, J
Yang, YT
Yates, A
Zerbino, D
Zhang, Y
Choudhary, JS
Gerstein, M
Guigó, R
Hubbard, TJP
Kellis, M
Paten, B
Tress, ML
Flicek, P

Document Type

Journal Article

Date

2021-01-08

Date Accepted

2020-10-24

Abstract

The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.

Citation

Nucleic Acids Research, 2021, 49 (D1), pp. D916 - D923

Source Title

Nucleic Acids Research

Publisher

OXFORD UNIV PRESS

ISSN

0305-1048

eISSN

1362-4962
1362-4962

Research Team

Functional Proteomics
Prote & Metabolomics Fac

Notes