gap {SAGx} | R Documentation |
Calculates a goodness of clustering measure based on the average dispersion compared to a reference distribution.
gap(data = swiss,class = g, B = 500, cluster.func = myclus)
data |
The data matrix, with samples (observations) in rows and genes (variables)in columns |
class |
a vector descibing the cluster memberships of the rows of data |
B |
the number of bootstrap samples |
cluster.func |
a function taking the arguments data and k (number of clusters) and outputs cluster assignments
as list elements cluster ( accessed by object$cluster ). |
The GAP statistic and the standard deviation
Per Broberg
Tishirani, R., Walther, G. and Hastie, T. (2000) Estimating the number of clusters in a dataset via the Gap statistic. Technical Report Stanford
library("MASS") data(swiss) cl <- myclus(data = swiss, k = 3) gap(swiss,cl$cluster)