Skip to main content

Table 1 List of simulation data and clustering results by two algorithm

From: Mini-clusters with mean probabilities for identifying effective siRNAs

  Results
Group X Sequences P 1 P 2 P 3 P 4 P 5 Q(4) RMP-MiC K-mean
a1 UAAUC 0.75 0.75 0.25 0.25 0.25 0.0088 1 1
a2 UACCG 0.75 0.75 0.25 0.25 0.25 0.0088 1 1
a3 UAGAA 0.75 0.75 0.25 0.25 0.25 0.0088 1 1
a4 UUCCG 0.75 0.25 0.25 0.25 0.25 0.0029 2 2
a5 UUGAA 0.75 0.25 0.25 0.25 0.25 0.0029 2 2
a6 UUUGU 0.75 0.25 0.25 0.25 0.25 0.0029 2 2
a7 CGAUC 0.25 1 0.25 0.25 0.25 0.0039 3 2
a8 CGCCG 0.25 1 0.25 0.25 0.25 0.0039 3 2
a9 CGGAA 0.25 1 0.25 0.25 0.25 0.0039 3 2
a10 CGUGU 0.25 1 0.25 0.25 0.25 0.0039 3 2
Group Y  
b1 AACGA 0 0 0.25 0.25 0.25 0 4 2
b2 AUGGA 0 0 0.25 0.25 0.25 0 4 2
b3 UCAGC 0.75 0 0.25 0.25 0.25 0 4 2
b4 UGUUC 0.75 0 0.25 0.25 0.25 0 4 2
b5 UCCUG 0.75 0 0.25 0.25 0.25 0 4 2
b6 CCAAA 0.25 0 0.25 0.25 0.25 0 4 2
b7 CCUAC 0.25 0 0.25 0.25 0.25 0 4 2
  1. P1 is the probabilities of the leftmost nucleotides. P i (i = 2,3,4,5) is conditional probabilities of the i-th position. Q(4) is the the relative mean probabilities of sequences.