Data defines the model by dint of genetic programming, producing the best decile table.


SAS Code for K-Means Clustering
Bruce Ratner, PhD
Live chat by Boldchat
Live chat by Boldchat

/* Run proc FASTCLUS for k from 3 to 8 */

%macro doFASTCLUS;
%do k= 3 %to 8;

proc FASTCLUS
data= digits
out= fcOut
maxiter= 100
converge= 0 /* run to complete convergence */
radius= 100 /* look for initial centroids that are far apart */
maxclusters= &k
summary;
run;

%end;
%mend;
%doFASTCLUS

The summary output for each k includes four different statistics
for determining the compactness and separation of the clustering results.

For more information about this article, call Bruce Ratner at 516.791.3544 or 1 800 DM STAT-1; or e-mail at br@dmstat1.com.
Sign-up for a free GenIQ webcast: Click here.