Skip to main content

Table 1 The content of the datasets and the query lists used for PRIDE testing

From: Progress in the PRIDE technique for rapidly comparing protein three-dimensional structures

Dataset

Number of domains in the dataset

Number of histograms used for the domain structure representation

Number of domains in the query list

   

E*

D**

Total

   

α

β

α/β

α

β

α/β

 

1

29 098

> 30

24

25

25

25

25

25

149

2

4 937

10 – 30

6

6

6

8

8

8

42

  1. *E corresponds to the "easy" cases when the queries belong to highly populated groups of investigated datasets containing at least 50 domains at the homologous superfamily classification level of CATH;
  2. **D corresponds to the "difficult cases" when queries belonged to small groups having no more than 3 domains at the homologous superfamily classification level of CATH