Skip to main content

Table 2 Performance evaluation of the optimized SIMT and partitioned vectorized algorithms on GTX 295

From: CUDASW++2.0: enhanced Smith-Waterman protein database search on CUDA-enabled GPUs based on SIMT and virtualized SIMD abstractions

Query Sequences

Partitioned

SIMT

  

10-2 k

20-2 k

40-3 k

10-2 k

Query

Length

Time

GCUPS

Time

GCUPS

Time

GCUPS

Time

GCUPS

P02232

144

1.19

17.7

1.13

18.7

1.09

19.4

1.02

20.7

P05013

189

1.34

20.7

1.30

21.4

1.26

22.1

1.25

22.3

P14942

222

1.49

22.0

1.41

23.1

1.38

23.7

1.37

23.8

P07327

375

2.77

19.9

2.58

21.4

2.42

22.8

2.15

25.7

P01008

464

3.04

22.4

2.82

24.2

2.66

25.6

2.54

26.8

P03435

567

3.93

21.2

3.61

23.1

3.49

23.9

3.11

26.8

P42357

657

4.29

22.5

4.02

24.0

3.87

25.0

3.56

27.1

P21177

729

4.53

23.7

4.22

25.4

4.04

26.5

3.90

27.5

Q38941

850

5.03

24.9

4.66

26.8

4.63

27.0

4.53

27.6

P27895

1000

6.58

22.3

5.87

25.1

5.38

27.3

5.21

28.2

P07756

1500

9.86

22.4

9.19

24.0

8.58

25.7

7.72

28.6

P04775

2005

12.26

24.1

11.32

26.0

10.79

27.3

10.26

28.7

P19096

2504

14.32

25.7

13.34

27.6

12.99

28.4

12.79

28.8

P28167

3005

18.31

24.1

16.46

26.9

15.56

28.4

15.33

28.8

P0C6B8

3564

21.09

24.9

19.34

27.1

17.99

29.1

18.20

28.8

P20930

4061

26.75

22.3

23.35

25.6

20.76

28.8

20.77

28.8

P08519

4548

27.36

24.4

25.11

26.6

23.92

28.0

23.24

28.8

Q7TMA5

4743

25.86

27.0

23.57

29.6

23.51

29.7

24.24

28.8

P33450

5147

32.69

23.2

30.57

24.8

27.37

27.7

26.33

28.7

Q9UKN1

5478

36.61

22.0

32.40

24.9

28.88

27.9

28.05

28.7