LINQ查询将多个与多个进行比较

时间:2012-02-23 04:35:10

标签: c# linq entity-framework

更新:道歉我的同事稍微更新了模型

更新:考虑到这个想法的背景,也许会有所帮助

我有Show的ShowData列表。每个ShowData都包含许多Prints。所以

1显示 - >许多ShowData - >许多版画

我有两个相同类型对象的数据集 - ShowData

指纹如下所示:

public class ShowData
{
    public ShowData() { }        
    public int time { get; set; }
    public List<Prints> prints { get; set; }
}
public class Prints
{
    public Prints() { }        
    public int value { get; set; }
    public string range { get; set; }
}

我获得了特定节目的所有ShowData:

var ShowData1 = (from showData1 in context.ShowDatas
                                where (showData1.Show.Id == 1)                                    
                                select new
                                {
                                    showData = showData1,
                                    prints = showData1.Prints
                                });                  

所以一个例子是:

DATASET A

time      prints
1         {1,low},{4,low},{8,low},{9,low},{10,low},{11,high},{15,high},{16,high},{18,high}
2         {4,low},{7,low},{8,low},{9,low},{10,low},{12,high},{15,high},{16,high},{19,high}
3         {1,low},{2,low},{3,low},{8,low},{9,low},{11,high},{12,high},{15,high},{16,high}
4         {1,low},{7,low},{8,low},{9,low},{10,low},{11,high},{12,high},{14,high},{15,high}
5         {1,low},{5,low},{6,low},{8,low},{9,low},{11,high},{14,high},{17,high},{19,high}

DATASET B

time      prints
1         {1,low},{2,low},{3,low},{4,low},{5,low},{11,high},{12,high},{13,high},{18,high}
2         {0,low},{3,low},{5,low},{6,low},{7,low},{11,high},{13,high},{19,high},{20,high}

第一个数据集(DATASET A)大约有4000个ShowData项目。我有另一个ShowData数据集,大约120项(DATASET B)。

我试图找到一种方法来比较两个列表,以显示所有时间点,其中DATASET B 中的打印件至少有2个匹配打印到DATASET A. 然而,低至少需要2场比赛,高位需要2场比赛

所以我的返回查询可能如下所示:

TimeInDataSetB         TimesInDataSetAForLows      TimeInDataSetAForHighs
1                              1,3,5                       3,4
2                                                           5

因此,数据集B中时间1处的打印(范围=低)与数据集A中1,3,5次打印的打印至少有2次匹配,数据集B中时间1打印(范围=高)在DatasetA中,至少有2次与印刷品匹配的位置。

DataSetB中时间2处的项目对于低点的DataSet中没有任何匹配,并且只有1个匹配高点

任何人都可以帮忙吗? (我在c#中寻找答案)

使用第一个答案中描述的方法,我尝试了以下方法:

var query3 = from a in recordingPoints
                             from b1 in ShowData1
                             let timeIntersects = a.Prints.Intersect(b1.prints, printsEqualityComparer)
                             where timeIntersects.GroupBy(x => x.Range)
                                                 .All(x => x.Count() > 2)
                             group b1 by a.Time into grouped
                             select new
                             {

                                 TimeInDataSetA = grouped.Key,
                                 TimeInDataSetB = grouped.ToArray()
                             };

其中recordingPoints是ShowData的列表

用于测试的数据集

List<ShowData> bigdataset = new List<Ent.ShowData>();
                List<ShowData> smalldataset = new List<Ent.ShowData>();

                List<int> ints = new List<int>(new int[]{1, 4, 8, 9, 10, 11, 15, 16, 18});
                ShowData od = new Ent.ShowData();
                od.Show.Id = 7;
                foreach (int it in ints)
                {
                    Prints pr = new Prints();
                    if (it < 11)
                        pr.Range = "low";
                    else
                        pr.Range = "high";
                    pr.Value = it.ToString();
                    od.Prints.Add(pr);
                }                    
                od.Time = 1;
                bigdataset.Add(od);

                ints = new List<int>(new int[] { 4, 7, 8, 9, 10, 12, 15, 16, 19 });
                od = new Ent.ShowData();
                od.Show.Id = 7;
                foreach (int it in ints)
                {
                    Prints pr = new Prints();
                    if (it < 11)
                        pr.Range = "low";
                    else
                        pr.Range = "high";
                    pr.Value = it.ToString();
                    od.Prints.Add(pr);
                }
                od.Time = 2;
                bigdataset.Add(od);

                ints = new List<int>(new int[] { 1, 2, 3, 8, 9, 11, 12, 15, 16 });
                od = new Ent.ShowData();
                od.Show.Id = 7;
                foreach (int it in ints)
                {
                    Prints pr = new Prints();
                    if (it < 11)
                        pr.Range = "low";
                    else
                        pr.Range = "high";
                    pr.Value = it.ToString();
                    od.Prints.Add(pr);
                }
                od.Time = 3;
                bigdataset.Add(od);

                ints = new List<int>(new int[] { 1, 7, 8, 9, 10, 11, 12, 14, 15 });
                od = new Ent.ShowData();
                od.Show.Id = 7;
                foreach (int it in ints)
                {
                    Prints pr = new Prints();
                    if (it < 11)
                        pr.Range = "low";
                    else
                        pr.Range = "high";
                    pr.Value = it.ToString();
                    od.Prints.Add(pr);
                }
                od.Time = 4;
                bigdataset.Add(od);

                ints = new List<int>(new int[] { 1, 5, 6, 8, 9, 11, 14, 17, 19 });
                od = new Ent.ShowData();
                od.Show.Id = 7;
                foreach (int it in ints)
                {
                    Prints pr = new Prints();
                    if (it < 11)
                        pr.Range = "low";
                    else
                        pr.Range = "high";
                    pr.Value = it.ToString();
                    od.Prints.Add(pr);
                }
                od.Time = 5;
                bigdataset.Add(od);

                ints = new List<int>(new int[] { 1, 2, 3, 4, 5, 11, 12, 13, 18 });
                od = new Ent.ShowData();
                foreach (int it in ints)
                {
                    Prints pr = new Prints();
                    if (it < 11)
                        pr.Range = "low";
                    else
                        pr.Range = "high";
                    pr.Value = it.ToString();
                    od.Prints.Add(pr);
                }
                od.Time = 1;
                smalldataset.Add(od);

                ints = new List<int>(new int[] { 0, 3, 5, 6, 7, 11, 13, 19, 20 });
                od = new Ent.ShowData();
                foreach (int it in ints)
                {
                    Prints pr = new Prints();
                    if (it < 11)
                        pr.Range = "low";
                    else
                        pr.Range = "high";
                    pr.Value = it.ToString();
                    od.Prints.Add(pr);
                }
                od.Time = 2;
                smalldataset.Add(od);

var printsEqualityComparer = new PrintsEqualityComparer();

                    var query4 = from a in smalldataset
                                 from b1 in bigdataset
                                 let timeIntersects = a.Prints.Intersect(b1.Prints, printsEqualityComparer)
                                 where timeIntersects.GroupBy(x => x.Range)
                                                     .All(x => x.Count() > 1)
                                 group b1 by a.Time into grouped
                                 select new
                                 {
                                     TimeInDataSetA = grouped.Key,
                                     TimeInDataSetB = grouped.ToArray()
                                 };

1 个答案:

答案 0 :(得分:3)

您可以对A中的每个项目执行相交,A过滤中的每个项目最小匹配为3,并按照A中设置的时间分组:

var query = from a in listA
            from b in listB
            where a.prints.Intersect(b.prints).Count() >= 3
            group b by a.time into grouped
            select new
            {
                TimeInDataSetA = grouped.Key,
                TimeInDataSetB = grouped.ToArray()
            };

编辑,根据您的新请求,您可以为intersect方法提供equalityComparer,以确定2个Prints实例的相等性。请注意,在下面的示例中,我提供了一个非常原始的实现。请阅读提供的链接。

// please see: http://blogs.msdn.com/b/ericlippert/archive/2011/02/28/guidelines-and-rules-for-gethashcode.aspx
class PrintsEqualityComparer : IEqualityComparer<Prints>
{
    public bool Equals(Prints x, Prints y)
    {
        return object.Equals(x, y) && object.Equals(x.value, y.value);
    }

    public int GetHashCode(Prints obj)
    {
        return obj.range.GetHashCode() ^ obj.value.GetHashCode();
    }
}
var printsEqualityComparer = new PrintsEqualityComparer();

var query = from a in listA
        from b in listB
        let timeIntersects = a.prints.Intersect(b.prints, printsEqualityComparer)
        where timeIntersects.GroupBy(x => x.range)
                            .All(x => x.Count() > 2)
        group b by a.time into grouped
        select new
        {
            TimeInDataSetA = grouped.Key,
            TimeInDataSetB = grouped.ToArray()
        };