Question

1

我有多个线程更新ConcurrentHashMap。每个线程根据键将一个整数列表附加到映射条目的值。任何线程都没有删除操作。

这里的要点是我希望尽可能减少锁定和同步的范围。

我看到computeIf ...（）方法的文档说＆＃34;其他线程的某些尝试更新操作可能会在计算正在进行时被阻止＆＃34;，这不是那么令人鼓舞。另一方面，当我查看其源代码时，我无法观察它在整个地图上的锁定/同步位置。

因此，我想知道使用computeIf ...（）和以下本土方法2＆＃39;的理论性能的比较。

2

另外，我觉得我在这里描述的问题可能是你可以在ConcurrentHashMap上执行的最简化的check-n-set（或者通常是＆＃39;复合＆＃39;）操作之一。

然而，我并不十分自信，并且对于如何在ConcurrentHashMap上进行这种简单的复合操作非常容易找到，没有锁定/同步在整个地图上。

因此，非常感谢任何一般的良好实践建议。

public void myConcurrentHashMapTest1() {

    ConcurrentHashMap<String, List<Integer>> myMap = new ConcurrentHashMap<String, List<Integer>>();

    // MAP KEY: a Word found by a thread on a page of a book 
    String myKey = "word1";

    // -- Method 1: 
    // Step 1.1 first, try to use computeIfPresent(). doc says it may lock the
    //      entire myMap. 
    myMap.computeIfPresent(myKey, (key,val) -> val.addAll(getMyVals()));
    // Step 1.2 then use computeIfAbsent(). Again, doc says it may lock the
    //      entire myMap. 
    myMap.computeIfAbsent(myKey, key -> getMyVals());    
}

public void myConcurrentHashMapTest2() {        
    // -- Method 2: home-grown lock splitting (kind of). Will it theoretically 
    //      perform better? 

    // Step 2.1: TRY to directly put an empty list for the key
    //      This may have no effect if the key is already present in the map
    List<Integer> myEmptyList = new ArrayList<Integer>();
    myMap.putIfAbsent(myKey, myEmptyList);

    // Step 2.2: By now, we should have the key present in the map
    //      ASSUMPTION: no thread does removal 
    List<Integer> listInMap = myMap.get(myKey);

    // Step 2.3: Synchronize on that list, append all the values 
    synchronized(listInMap){
        listInMap.addAll(getMyVals());
    }

}

public List<Integer> getMyVals(){
    // MAP VALUE: e.g. Page Indices where word is found (by a thread)
    List<Integer> myValList = new ArrayList<Integer>(); 
    myValList.add(1);
    myValList.add(2);

    return myValList;
}

Answer 1

基于对Javadoc的误解，你基于你的假设（使用ConcurrentHashMap按预期对你来说太慢了）。 Javadoc并未声明整个地图将被锁定。它也没有声明每个computeIfAbsent()操作执行悲观锁定。

实际上可以锁定的是bin（a.k.a. bucket），它对应于ConcurrentHashMap的内部数组后台中的单个元素。请注意，这不是包含多个存储桶的Java 7的地图段。当这样的bin被锁定时，可能被阻止的操作仅仅是对散列到同一bin的键的更新。

另一方面，您的解决方案并不意味着避免ConcurrentHashMap内的所有内部锁定 - computeIfAbsent()只是可以降级为使用synchronized的方法之一更新时阻止即使是您最初为某个键设置空列表的putIfAbsent()，如果它没有击中空箱，也会阻止。

但更糟糕的是，您的解决方案并不能保证synchronized批量更新的可见性。我们保证get() 在<{em> putIfAbsent()之前发生了它所观察到的值，但是之前没有发生在之前批量更新和随后的get()。

P.S。您可以在其OpenJDK实现中进一步了解ConcurrentHashMap中的锁定：http://hg.openjdk.java.net/jdk8/jdk8/jdk/file/687fd7c7986d/src/share/classes/java/util/concurrent/ConcurrentHashMap.java，第313-352行。

Answer 2

由于已经explained by Dimitar Dimitrov，compute…方法通常不会锁定整个地图。在最好的情况下，即不需要增加容量并且没有哈希冲突，只锁定单个密钥的映射。

然而，仍有一些事情你可以做得更好：

通常，避免执行多次查找。这适用于两种变体，使用computeIfPresent后跟computeIfAbsent，以及使用putIfAbsent后跟get
仍然建议最小化持有锁时执行的代码，即不要在持有锁时调用getMyVals()，因为它不依赖于地图的状态

将它们放在一起，更新应如下所示：

// compute without holding a lock
List<Integer> toAdd=getMyVals();
// update the map
myMap.compute(myKey, (key,val) -> {
    if(val==null) val=toAdd; else val.addAll(toAdd);
    return val;
});

或

// compute without holding a lock
List<Integer> toAdd=getMyVals();
// update the map
myMap.merge(myKey, toAdd, (a,b) -> { a.addAll(b); return a; });

可以简化为

myMap.merge(myKey, getMyVals(), (a,b) -> { a.addAll(b); return a; });

最小化JDK8 ConcurrentHashMap检查和设置操作的锁定范围

2 个答案: