Question

我在单元格中有一些字符串名称。我想比较其他单元格中的某个名称。

例如，第一个单元格的第一列，第二个单元格的所有列。

在以下示例单元格a和单元格b中。如您所见，第3列的许多元素匹配上述第3列的元素。我想逐列比较。

有没有办法不使用循环进行细胞比较？

 a=

'virutass'  'BrucePerens'   'RogerBrowne'   'keverets'
'armando'   'srivasta'      'riel'          'theo'
[]          'wichert'       'acme'          'rgb'
[]          'joey'          'aoliva'        'benc'
[]          'buffy'         'connolly'      'azz'
[]          'willow'        'itamar'        'dorward'
[]          'gnuchris'      'Kay'            'nuked'
[]          'edward'        'sab39'          'Yashy'
[]          'rcw'            'sad'           'idallen'
[]          'JHM'          'davem'            []
[]          'jgg'           'Blacky'          []



b=

 'sp'       'Erbo'         'connolly'     'draco'
'thomasd'   'miguel'       'joey'            []
'isenguard' 'mathieu'      'Kay'             []
'yole'      'fejj'         'acme'            []
'ewan'      'harinath'     'sad'             []

如果超过60％的元素在一列中，则结果是列号。在这个例子中，a的第3列是结果。因为b中的第3列与超过60％的元素相匹配。

Answer 1

使用anonymous functions和cell/arrayfun ismember功能可能有助于满足您的需求：

如果你真的不需要保留行和列：

% Remove empty cells to compare only strings
emptyCells = @(x) cellfun(@isempty,x);
a(emptyCells(a)) = [];
b(emptyCells(b)) = [];

% Get ids of element of b included in a
id = cellfun(@(elem) ismember(elem, a), b);

% Shows resutls
b(id)

如果你真的需要保持细胞行之间的比较：

nColumns = 4
emptyCells = @(x) cellfun(@isempty,x);
% get only the non-empty values of the cell
getValue = @(dataCell) dataCell(not(emptyCells(dataCell)));

% Compare for each columns, values of b(:,col) and a(:,col)
isBinA = arrayfun(@(col) ismember(getValue(b(:,col)), getValue(a(:,col))), 1:nColumns, 'UniformOutput', 0);

这里有一个具有逻辑值的单元格，例如：

isBinA{3}

ans =

     1
     0
     1
     1
     1

在单元格b的第3列中，您有4个名称，这些名称包含在单元格a的第3列中：

b(isBinA{3},3)

ans = 

    'connolly'
    'Kay'
    'acme'
    'sad'

Answer 2

您想要逐列比较单元格（cell1column1与cell2column1，cell1column2与cell2colun2 .....）并检查至少60％匹配？顺序是否重要（如果两列中的名称相同，但行不同，是否可以）？

if length(intersect(a(:,1),b(:,1)))>0.6*length(b(:,1))
disp('Column 1 is good')
else
disp('Column 1 is bad')
end
if length(intersect(a(:,2),b(:,2)))>0.6*length(b(:,2))
disp('Column 2 is good')
else
disp('Column 2 is bad')
end
if length(intersect(a(:,3),b(:,3)))>0.6*length(b(:,3))
disp('Column 3 is good')
else
disp('Column 3 is bad')
end
if length(intersect(a(:,4),b(:,4)))>0.6*length(b(:,4))
disp('Column 4 is good')
else
disp('Column 4 is bad')
end

Matlab，两个串细胞比较

2 个答案: