BMC Research Notes

Table 5 Base substitutions

From: Empirical assessment of sequencing errors for high throughput pyrosequencing data

Newbler
	A	C	G	T
A	0 (0%)	74588 (11.2%)	63823 (9.58%)	92787 (13.93%)
C	26505 (3.98%)	0 (0%)	49970 (7.5%)	41064 (6.17%)
G	23442 (3.52%)	48670 (7.31%)	0 (0%)	51326 (7.71%)
T	53587 (8.05%)	72148 (10.83%)	68059 (10.22%)	0 (%)
Total substitutions: 665,969
Celera
	A	C	G	T
A	0 (0%)	314368 (9.48%)	298435 (9%)	454647 (13.71%)
C	215248 (6.49%)	0 (0%)	155494 (4.69%)	220471 (6.65%)
G	214762 (6.48%)	155015 (4.67%)	0 (0%)	214063 (6.46%)
T	449989 (13.57%)	310634 (9.37%)	312819 (9.43%)	0 (0%)
Total substitutions: 3,315,945

The cell at row x and column y indicates the number of occurrences of the alignment column $\binom{x}{y}$ in the data set, that is, the number of times the base x was substituted by y. Close to this number, in parentheses, is the percentage of the value relative to the total number of substitutions.

Back to article page

ISSN: 1756-0500

Contact us

Submission enquiries: bmcresearchnotes@biomedcentral.com
General enquiries: ORSupport@springernature.com