Skip to main content

Table 5 Base substitutions

From: Empirical assessment of sequencing errors for high throughput pyrosequencing data

Newbler

 

A

C

G

T

A

0 (0%)

74588 (11.2%)

63823 (9.58%)

92787 (13.93%)

C

26505 (3.98%)

0 (0%)

49970 (7.5%)

41064 (6.17%)

G

23442 (3.52%)

48670 (7.31%)

0 (0%)

51326 (7.71%)

T

53587 (8.05%)

72148 (10.83%)

68059 (10.22%)

0 (%)

Total substitutions: 665,969

Celera

 

A

C

G

T

A

0 (0%)

314368 (9.48%)

298435 (9%)

454647 (13.71%)

C

215248 (6.49%)

0 (0%)

155494 (4.69%)

220471 (6.65%)

G

214762 (6.48%)

155015 (4.67%)

0 (0%)

214063 (6.46%)

T

449989 (13.57%)

310634 (9.37%)

312819 (9.43%)

0 (0%)

Total substitutions: 3,315,945

  1. The cell at row x and column y indicates the number of occurrences of the alignment column x y in the data set, that is, the number of times the base x was substituted by y. Close to this number, in parentheses, is the percentage of the value relative to the total number of substitutions.