Diff for "ClusteringAlgorithms"

Differences between revisions 1 and 3 (spanning 2 versions)

Back to ComputerTerms, InformationRetrieval

Cluster analysis is a statistical technique used to generate a category structure.The groups which are formed should have a high degree of association between members of hte same group and a low degree between members of different groups.

Similarity Measures:

               2C
 S         = -------
  (Di,dj)     A + B 

   Where C is the number of terms that Di and Dj have in common, 
   and A and B are the number of termsin Di and Dj

Similarity Matrix calculates a similarity measure between document x and y

  | S21                 |
  | S31  S32            |
  |  ...                |
  | SN1  SN2 ...SN(N-1) |

Back to ComputerTerms, InformationRetrieval

-  ⇤ ← Revision 1 as of 2004-04-09 00:11:22 → 
  Size: 334
  Editor: yakko
  Comment:
+   ← Revision 3 as of 2004-04-09 15:20:02 → ⇥
  Size: 755
  Editor: yakko
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 5:
+Similarity Measures:

{{{
               2C
 S         = -------
  (Di,dj)     A + B 

   Where C is the number of terms that Di and Dj have in common, 
   and A and B are the number of termsin Di and Dj
}}}

Similarity Matrix calculates a similarity measure between document x and y 


{{{
  | S21                 |
  | S31  S32            |
  |  ...                |
  | SN1  SN2 ...SN(N-1) |
}}}