Question

我在MATLAB中使用k-means。这是我的代码：

load cobat.txt;  % read the file

k=input('Enter a number: ');        % determine the number of cluster
isRand=0;   % 0 -> sequeantial initialization
            % 1 -> random initialization

[maxRow, maxCol]=size(cobat);
if maxRow<=k, 
    y=[m, 1:maxRow];
else
    % initial value of centroid
    if isRand,
        p = randperm(size(cobat,1));      % random initialization
        for i=1:k
            c(i,:)=cobat(p(i),:) ; 
        end
    else
        for i=1:k
           c(i,:)=cobat(i,:);        % sequential initialization
        end
    end

    temp=zeros(maxRow,1);   % initialize as zero vector
    u=0;
    while 1,
        d=DistMatrix3(cobat,c);   % calculate the distance 
        [z,g]=min(d,[],2);      % set the matrix g group

        if g==temp,             % if the iteration doesn't change anymore
            break;              % stop the iteration
        else
            temp=g;             % copy the matrix to the temporary variable
        end
        for i=1:k
            f=find(g==i);
            if f                % calculate the new centroid 
                c(i,:)=mean(cobat(find(g==i),:),1)
            end
        end
        c
        sort(c)
    end

    y=[cobat,g]

“cobat”是我的档案。在这里看起来：

“c”是每个群集的质心（群集的中心）的变量。 “g”是显示簇号的变量。问题是，我想根据质心（c）对簇号（从小到大）进行排序/拟合。所以，我尝试排序（c），但它不会影响簇号（g）。

当我尝试排序（g）时，它的排序与我想要的不一样。我希望群集号基于质心排序。例;当我运行k = 3的代码时，这是最终的质心

 73.0000   79.0000   70.6667 %C 1
 58.3333   73.3333   84.6667 %C 2
 36.0000   67.0000   66.0000 %C 3

当我对它进行排序时，数字簇也是“已排序”，

36.0000   67.0000   66.0000 %C 3
58.3333   73.3333   70.6667 %C 2
73.0000   79.0000   84.6667 %C 1

我希望数字群适合，就像这样。

36.0000   67.0000   66.0000 %C 1
58.3333   73.3333   70.6667 %C 2
73.0000   79.0000   84.6667 %C 3

它是合适的，没有排序，所以当这行'y = [cobat，g]'运行时，它也会改变。

这看似简单，但很棘手。我弄清楚了。有人有任何想法要解决它吗？

谢谢。

Answer 1

使用从sort或sortrow

返回的已排序索引

[B,index] = sortrows( c );  % sort the centroids
g = g(index(end:-1:1)); % arrange the labels based on centroids' order

基于质心拟合簇号

1 个答案: