Skip to main content

Table 2 Assembly data

From: Empirical assessment of sequencing errors for high throughput pyrosequencing data

Data id.

Assembler

Nb. of

Assembly

N50 / N80

  

contigs

size

 

MH

Newbler

102

981,106

27,323 / 10,189

 

Celera

282

1,011,888

10,963 / 4,287

SA

Newbler

66

2,971,290

143,335 / 56,013

 

Celera

691

3,162,318

52,376 / 17,400

SP

Newbler

178

2,223,061

30,556 / 18,166

 

Celera

671

2,331,216

15,133 / 6,442

  1. Genome assemblies data summary. ‘Assembly size’ corresponds to the overall sum of the lengths of the contigs. The N50 (respectively N80) value corresponds to the largest contig length L such that the contigs of length ≥L contain at least 50% (resp. 80%) of the bases in the assembly.