Skip to main content

Table 5 Base substitutions

From: Empirical assessment of sequencing errors for high throughput pyrosequencing data

Newbler
  A C G T
A 0 (0%) 74588 (11.2%) 63823 (9.58%) 92787 (13.93%)
C 26505 (3.98%) 0 (0%) 49970 (7.5%) 41064 (6.17%)
G 23442 (3.52%) 48670 (7.31%) 0 (0%) 51326 (7.71%)
T 53587 (8.05%) 72148 (10.83%) 68059 (10.22%) 0 (%)
Total substitutions: 665,969
Celera
  A C G T
A 0 (0%) 314368 (9.48%) 298435 (9%) 454647 (13.71%)
C 215248 (6.49%) 0 (0%) 155494 (4.69%) 220471 (6.65%)
G 214762 (6.48%) 155015 (4.67%) 0 (0%) 214063 (6.46%)
T 449989 (13.57%) 310634 (9.37%) 312819 (9.43%) 0 (0%)
Total substitutions: 3,315,945
  1. The cell at row x and column y indicates the number of occurrences of the alignment column x y in the data set, that is, the number of times the base x was substituted by y. Close to this number, in parentheses, is the percentage of the value relative to the total number of substitutions.