Skip to main content

Table 1 Number of reads eliminated at different steps of the CANGS DB sequence processing pipeline

From: CANGS DB: a stand-alone web-based database tool for processing, managing and analyzing 454 data in biodiversity studies

Order of steps

Steps

Total no. of sequences

No. of sequences considered

No. of sequences discarded

1

Removal of Adapter B

447,909

373,116

74,793

2

Filtering sequences with ambiguities

373,116

357,926

15,190

3

Removal of singletons

357,926

311,425

46,501

4

Grouping of sequences according to bar codes

311,425

306,042

5,383

5

Filtering sequences according to length threshold

306,042

305,884

158

6

Removal of PCR primers

305,884

282,053

23,831

7

Quality filtering

282,053

281,003

1,050

 

Total Sequences

447,909

281,003

166,906