Options
The Protein‐Coding Human Genome: Annotating High‐Hanging Fruits
ISSN
1521-1878
0265-9247
Date Issued
2019
Author(s)
DOI
10.1002/bies.201900066
Abstract
The major transcript variants of human protein-coding genes are annotated to a certain degree of accuracy combining manual curation, transcript data, and proteomics evidence. However, there is considerable disagreement on the annotation of about 2000 genes-they can be protein-coding, noncoding, or pseudogenes-and on the annotation of most of the predicted alternative transcripts. Pure transcriptome mapping approaches seem to be limited in discriminating functional expression from noise. These limitations have partially been overcome by dedicated algorithms to detect alternative spliced micro-exons and wobble splice variants. Recently, knowledge about splice mechanism and protein structure are incorporated into an algorithm to predict neighboring homologous exons, often spliced in a mutually exclusive manner. Predicted exons are evaluated by transcript data, structural compatibility, and evolutionary conservation, revealing hundreds of novel coding exons and splice mechanism re-assignments. The emerging human pan-genome is necessitating distinctive annotations incorporating differences between individuals and between populations.
Subjects
File(s)
No Thumbnail Available
Name
Hatje_et_al-2019-BioEssays.pdf
Size
464.19 KB
Checksum (MD5)
f0f8773ade64a71db6b14507cbc8ed32