Skip to main content

Advertisement

Table 2 Compact overview of the datasets

From: Next generation sequencing reads comparison with an alignment-free distance

  Datasets
  Yeast E. coli Human
Genome length 12.1 Mb 4.6 Mb 3.2 Gb
Sequencing machine Illumina HiSeq Roche 454 Illumina GA II
Database NCBI SRA CGU NCBI SRA
Accession number ERX191563 - SRX013970
Run id ERR216898 - SRR031057
Number of downloaded reads ( N ) 3,551,079 436,142 14,267,012
Avg. reads length ± st.dev 100 ±6 235 ±4 75 ±5
Total base pairs 355.0 M 102.5 M 1.1 G
Random selection of aligned reads ( rs ) 54,860 100,000 183,672
Total number of selected reads ( rtot ) 109,720 200,000 367,344
Read pairs in each subset rp 1 M 200,000 1 M
Source chromosome chr1 - chr1