Skip to main content

Table 2 Compact overview of the datasets

From: Next generation sequencing reads comparison with an alignment-free distance

 

Datasets

 

Yeast

E. coli

Human

Genome length

12.1 Mb

4.6 Mb

3.2 Gb

Sequencing machine

Illumina HiSeq

Roche 454

Illumina GA II

Database

NCBI SRA

CGU

NCBI SRA

Accession number

ERX191563

-

SRX013970

Run id

ERR216898

-

SRR031057

Number of downloaded reads ( N )

3,551,079

436,142

14,267,012

Avg. reads length ± st.dev

100 ±6

235 ±4

75 ±5

Total base pairs

355.0 M

102.5 M

1.1 G

Random selection of aligned reads ( rs )

54,860

100,000

183,672

Total number of selected reads ( rtot )

109,720

200,000

367,344

Read pairs in each subset rp

1 M

200,000

1 M

Source chromosome

chr1

-

chr1