Workflow scheme of an EST analytic process. A. The cleaning step is composed of base calling (Phred), vector masking (cross_match), contaminant trimming (SeqClean) and repeat masking (RepeatMasker). B. The cleaned EST sequences are sent to the next step: clustering and assembling (TGICL). C. CsAEs sequences are aligned with various public databases for assigning their putative function and predicting structural variation (AutoSNP and TRF).