Skip to main content

Table 1 Number and length of known human nuclear protein-coding genes and protein-coding transcripts (mRNAs)

From: Human protein-coding genes and gene feature statistics in 2019

 

Protein-coding genesa

mRNAsb

Number

 Total entries

19,116

49,632

 Median

N/A

N/A

 Mean

Per chr: 797

N/A

 SD

N/A

N/A

 Min

chrY: 47

chr21: 228

N/A

 Max

chr1: 1952

N/A

Length

 Median

26,018 bp

2938 bp

 Mean

66,646 bp

3522 bp

 SD

131,781 bp

2557 bp

 Shortest

189 bp (KRTAP6-2, chr21)

186 bp (DEFB133, chr6)

 Longest

2,473,592 bp (RBFOX1, chr16)

109,224 bp (TTN, chr2)

 Total

1,274,002,474 bp

174,797,813 bp

  1. SD standard deviation, chr chromosome, min minimum, max maximum, bp base pair
  2. aValues of protein-coding genes have been calculated exploiting Excel functions in Genes.xlsx file containing data exported from GeneBase “Genes” and “Gene_Summary” tables (records retrieved searching for nuclear protein-coding gene type and REVIEWED or VALIDATED gene RefSeq status and REVIEWED or VALIDATED transcript RefSeq status, excluding records annotated as “not in current annotation release”). Min and max number of genes per chr were derived using filter function in the Excel Genes.xlsx file. Mean number per chr has been calculated dividing the total number of genes by 24 (22 autosomes, chrX and chrY)
  3. bValues were calculated exploiting Excel functions in Transcripts.xlsx file containing data exported from GeneBase “Transcripts” table (retrieved records with a VALIDATED or REVIEWED RefSeq status with an “NM_” type of corresponding RefSeq RNA accession number belonging to genes with a VALIDATED or REVIEWED RefSeq status, excluding “not in current annotation release” records). The gene locations have been retrieved manually from GeneBase “Gene_Summary” table. N/A: not applicable