source: distools/gofcl.m @ 125

Last change on this file since 125 was 10, checked in by bduin, 14 years ago
File size: 1.5 KB
RevLine 
[10]1%GOFCL  Goodness of clusters/classes separability vs compactness
2%
3%   J = GOFCL(D)
4%
5% INPUT
6%   D   NxN Dissimilarity dataset
7%
8% OUTPUT
9%   J   Criterion value
10%
11% DESCRIPTION
12% Computes a goodness of clusters/classes in an NxN dissimilarity dataset D.
13% D is labeled. The criterion provides a trade-off between the cluster/class
14% compactness and cluster/class separability. Consider K classes, with the total
15% numbr of objects N and N_i elements in the i-th class. Let A_ij be the average
16% dissimilarity between the i-th and j-th clusters. Then the criterion is computed
17% as:
18%     J = sum_i (n_i sum_{j neq i} N_i/(N-N_i) A_ij)  / (2 sum_i n_i A_ii )
19%
20% The larger value, the better the separability between the classes.
21%
22% EXAMPLE
23% Compare:
24% 1) rand('seed',37); randn('seed',37); a=gendats(40,2,1); d=sqrt(distm(a));
25%    scatterd(a); j1 = gofcl(d);
26% 2) rand('seed',37); randn('seed',37); a=gendats(40,2,7); d=sqrt(distm(a));
27%    scatterd(a); j2 = gofcl(d);
28
29
30% Copyright: Elzbieta Pekalska, ela.pekalska@googlemail.com
31% Faculty EWI, Delft University of Technology and
32% School of Computer Science, University of Manchester
33
34function J = gofcl(d);
35
36issym(d);
37lab = getnlab(d);
38cc  = max(lab);
39
40for i=1:cc
41  Z = find(lab == i);
42  dw(i) = mean(mean(d(Z,Z)));
43        for j=i+1:cc
44    ZZ = find(lab == j);
45    db(i,j) =  mean(mean(d(Z,ZZ)));
46    db(j,i) =  mean(mean(d(Z,ZZ)));
47        end
48end
49
50
51J = 0;
52for i=1:cc
53  for j=i+1:cc
54    J = J + db(i,j)/((dw(i)+dw(j)));
55  end
56end
57J = J /((cc-1)*cc);
Note: See TracBrowser for help on using the repository browser.