Skip to main content

Table 1 Number and length of known human nuclear protein-coding genes and protein-coding transcripts (mRNAs)

From: Human protein-coding genes and gene feature statistics in 2019

  Protein-coding genesa mRNAsb
Number
 Total entries 19,116 49,632
 Median N/A N/A
 Mean Per chr: 797 N/A
 SD N/A N/A
 Min chrY: 47
chr21: 228
N/A
 Max chr1: 1952 N/A
Length
 Median 26,018 bp 2938 bp
 Mean 66,646 bp 3522 bp
 SD 131,781 bp 2557 bp
 Shortest 189 bp (KRTAP6-2, chr21) 186 bp (DEFB133, chr6)
 Longest 2,473,592 bp (RBFOX1, chr16) 109,224 bp (TTN, chr2)
 Total 1,274,002,474 bp 174,797,813 bp
  1. SD standard deviation, chr chromosome, min minimum, max maximum, bp base pair
  2. aValues of protein-coding genes have been calculated exploiting Excel functions in Genes.xlsx file containing data exported from GeneBase “Genes” and “Gene_Summary” tables (records retrieved searching for nuclear protein-coding gene type and REVIEWED or VALIDATED gene RefSeq status and REVIEWED or VALIDATED transcript RefSeq status, excluding records annotated as “not in current annotation release”). Min and max number of genes per chr were derived using filter function in the Excel Genes.xlsx file. Mean number per chr has been calculated dividing the total number of genes by 24 (22 autosomes, chrX and chrY)
  3. bValues were calculated exploiting Excel functions in Transcripts.xlsx file containing data exported from GeneBase “Transcripts” table (retrieved records with a VALIDATED or REVIEWED RefSeq status with an “NM_” type of corresponding RefSeq RNA accession number belonging to genes with a VALIDATED or REVIEWED RefSeq status, excluding “not in current annotation release” records). The gene locations have been retrieved manually from GeneBase “Gene_Summary” table. N/A: not applicable