Question

我仍在围绕Java中并发的工作方式。我理解（如果您订阅了OO Java 5并发模型），您实现Task或Callable分别使用run()或call()方法，并且你应该尽可能多地并行化这种实现方法。

但我仍然不理解Java中并发编程的固有内容：

如何为Task的{{1}}方法分配适当数量的并发工作？

作为一个具体的例子，如果我有一个I / O绑定的run()方法从本地系统上的文件中读取Herman Melville的 Moby Dick 的全部内容，该怎么办？。我只想说我想要这个readMobyDick()方法并发并由3个线程处理，其中：

线程＃1将本书的前1/3读入内存
线程＃2将本书的第二个1/3读入内存
线程＃3将书籍的最后1/3读入内存

我是否需要将Moby Dick分成三个文件并将它们分别传递给自己的任务，或者我只需从实现的readMobyDick()方法中调用readMobyDick()并以某种方式（以某种方式） run()知道如何在线程中打破工作。

我是一个非常直观的学习者，因此非常感谢任何正确方法的代码示例！谢谢！

Answer 1

你可能偶然选择了并行活动的绝对最糟糕的例子！

从单个机械磁盘并行读取实际上比使用单个线程读取要慢，因为实际上，当每个线程轮到运行时，您会将机械磁头弹回到磁盘的不同部分。最好留作单线程活动。

让我们再举一个例子，它类似于你的但实际上可以提供一些好处：假设我想在一个巨大的单词列表中搜索某个单词的出现（这个列表甚至可能来自一个磁盘文件，但就像我说的，由单个线程阅读）。假设我可以在你的例子中使用3个线程，每个线程搜索巨大单词列表的1/3，并保留一个本地计数器，显示搜索单词出现的次数。

在这种情况下，您需要将列表分为3个部分，将每个部分传递给其类型实现Runnable的不同对象，并使用run方法实现搜索。

运行时本身不知道如何进行分区或类似的东西，你必须自己指定它。还有许多其他分区策略，每个策略都有自己的优点和缺点，但我们现在可以坚持使用静态分区。

让我们看一些代码：

class SearchTask implements Runnable {
     private int localCounter = 0;
     private int start; // start index of search
     private int end;
     private List<String> words;
     private String token;

     public SearchTask(int start, int end, List<String> words, String token) {
         this.start = start;
         this.end = end;
         this.words = words;
         this.token = token;
     }

     public void run() {
         for(int i = start; i < end; i++) {
              if(words.get(i).equals(token)) localCounter++;
         }
     }

     public int getCounter() { return localCounter; }
}

// meanwhile in main :)

List<String> words = new ArrayList<String>();
// populate words 
// let's assume you have 30000 words

// create tasks
SearchTask task1 = new SearchTask(0, 10000, words, "John");
SearchTask task2 = new SearchTask(10000, 20000, words, "John");
SearchTask task3 = new SearchTask(20000, 30000, words, "John");

// create threads for each task
Thread t1 = new Thread(task1);
Thread t2 = new Thread(task2);
Thread t3 = new Thread(task3);

// start threads
t1.start();
t2.start();
t3.start();

// wait for threads to finish
t1.join();
t2.join();
t3.join();

// collect results
int counter = 0;
counter += task1.getCounter();
counter += task2.getCounter();
counter += task3.getCounter();

这应该很好用。请注意，在实际情况下，您将构建更通用的分区方案。如果您希望返回结果，也可以使用ExecutorService并实施Callable而不是Runnable。

这是使用更高级结构的另一个例子：

class SearchTask implements Callable<Integer> {
     private int localCounter = 0;
     private int start; // start index of search
     private int end;
     private List<String> words;
     private String token;

     public SearchTask(int start, int end, List<String> words, String token) {
         this.start = start;
         this.end = end;
         this.words = words;
         this.token = token;
     }

     public Integer call() {
         for(int i = start; i < end; i++) {
              if(words.get(i).equals(token)) localCounter++;
         }
         return localCounter;
     }        
}

// meanwhile in main :)

List<String> words = new ArrayList<String>();
// populate words 
// let's assume you have 30000 words

// create tasks
List<Callable> tasks = new ArrayList<Callable>();
tasks.add(new SearchTask(0, 10000, words, "John"));
tasks.add(new SearchTask(10000, 20000, words, "John"));
tasks.add(new SearchTask(20000, 30000, words, "John"));

// create thread pool and start tasks
ExecutorService exec = Executors.newFixedThreadPool(3);
List<Future> results = exec.invokeAll(tasks);

// wait for tasks to finish and collect results
int counter = 0;
for(Future f: results) {
    counter += f.get();
}

Answer 2

你选择了一个不好的例子，因为都铎是如此友善地指出。旋转磁盘硬件受移动盘片和磁头的物理限制，最有效的读取实现是按顺序读取每个块，这减少了移动磁头或等待磁盘对齐的需要。

也就是说，有些操作系统并不总是在磁盘上持续存储东西，对于那些记得的人来说，如果操作系统/文件系统没有为你完成工作，碎片整理可以提高磁盘性能。

如你所说，想要一个有益的程序，让我建议一个简单的矩阵加法。

假设您为每个核心创建了一个线程，您可以将任意两个矩阵划分为N（每个线程一个）行。添加矩阵（如果您还记得）可以这样工作：

A + B = C

或

[ a11, a12, a13 ]   [ b11, b12, b13]  =  [ (a11+b11), (a12+b12), (a13+c13) ]
[ a21, a22, a23 ] + [ b21, b22, b23]  =  [ (a21+b21), (a22+b22), (a23+c23) ]
[ a31, a32, a33 ]   [ b31, b32, b33]  =  [ (a31+b31), (a32+b32), (a33+c33) ]

因此，为了在N个线程中分配这个，我们只需要将行数和模数除以线程数来获得它将添加的“线程ID”。

matrix with 20 rows across 3 threads
row % 3 == 0 (for rows 0, 3, 6,  9, 12, 15, and 18)
row % 3 == 1 (for rows 1, 4, 7, 10, 13, 16, and 19)
row % 3 == 2 (for rows 2, 5, 8, 11, 14, and 17)
// row 20 doesn't exist, because we number rows from 0

现在每个线程“知道”它应该处理哪些行，并且“每行”的结果可以很简单地计算，因为结果不会进入其他线程的计算域。

现在需要的只是一个“结果”数据结构，它跟踪计算值的时间，并且当设置了最后一个值时，计算完成。在这个带有两个线程的矩阵加法结果的“假”示例中，用两个线程计算答案大约需要一半的时间。

// the following assumes that threads don't get rescheduled to different cores for 
// illustrative purposes only.  Real Threads are scheduled across cores due to
// availability and attempts to prevent unnecessary core migration of a running thread.
[ done, done, done ] // filled in at about the same time as row 2 (runs on core 3)
[ done, done, done ] // filled in at about the same time as row 1 (runs on core 1)
[ done, done, .... ] // filled in at about the same time as row 4 (runs on core 3)
[ done, ...., .... ] // filled in at about the same time as row 3 (runs on core 1)

多线程可以解决更复杂的问题，不同的技术可以解决不同的问题。我故意选择了一个最简单的例子。

Answer 3

使用run（）或call（）方法实现Task或Callable （分别），你应该尽可能多地并行化尽可能实施方法。

Task表示离散工作单元 将文件加载到内存中是一个独立的工作单元，因此可以将此活动委派给后台线程。即后台线程运行此加载文件的任务它是一个独立的工作单元，因为它没有其他依赖关系来完成它的工作（加载文件）并且具有不连续的边界。
你要问的是进一步将其划分为任务。即一个线程加载文件的1/3，而另一个线程加载2/3等如果您能够将任务划分为更多的子任务，那么根据定义，它根本不是一项任务。因此，加载文件本身就是一项任务。

举个例子：
假设您有一个GUI，您需要向用户显示来自5个不同文件的数据。为了呈现它们，您还需要准备一些数据结构来处理实际数据所有这些都是单独的任务例如。加载文件是5个不同的任务，因此可以通过5个不同的线程来完成数据结构的准备可以用不同的线程完成 GUI当然在另一个线程中运行所有这些可以同时发生

Answer 4

如果您的系统支持高吞吐量I / O，请按以下步骤操作：

How to read a file using multiple threads in Java when a high throughput(3GB/s) file system is available

以下是使用多个线程读取单个文件的解决方案。

将文件分成N个块，读取线程中的每个块，然后按顺序合并它们。注意跨越块边界的线。这是用户建议的基本思想 slaks

在单个20 GB文件的多线程实现之下的基准标记：

1个主题：50秒：400 MB / s

2个主题：30秒：666 MB / s

4个主题：20秒：1GB / s

8个主题：60秒：333 MB / s

等效Java7 readAllLines（）：400秒：50 MB / s

注意：这可能仅适用于支持高吞吐量I / O的系统，而不适用于通常的个人计算机

以下是代码的基本命中，有关完整的详细信息，请点击链接

public class FileRead implements Runnable
{

private FileChannel _channel;
private long _startLocation;
private int _size;
int _sequence_number;

public FileRead(long loc, int size, FileChannel chnl, int sequence)
{
    _startLocation = loc;
    _size = size;
    _channel = chnl;
    _sequence_number = sequence;
}

@Override
public void run()
{
        System.out.println("Reading the channel: " + _startLocation + ":" + _size);

        //allocate memory
        ByteBuffer buff = ByteBuffer.allocate(_size);

        //Read file chunk to RAM
        _channel.read(buff, _startLocation);

        //chunk to String
        String string_chunk = new String(buff.array(), Charset.forName("UTF-8"));

        System.out.println("Done Reading the channel: " + _startLocation + ":" + _size);

}

//args[0] is path to read file
//args[1] is the size of thread pool; Need to try different values to fing sweet spot
public static void main(String[] args) throws Exception
{
    FileInputStream fileInputStream = new FileInputStream(args[0]);
    FileChannel channel = fileInputStream.getChannel();
    long remaining_size = channel.size(); //get the total number of bytes in the file
    long chunk_size = remaining_size / Integer.parseInt(args[1]); //file_size/threads


    //thread pool
    ExecutorService executor = Executors.newFixedThreadPool(Integer.parseInt(args[1]));

    long start_loc = 0;//file pointer
    int i = 0; //loop counter
    while (remaining_size >= chunk_size)
    {
        //launches a new thread
        executor.execute(new FileRead(start_loc, toIntExact(chunk_size), channel, i));
        remaining_size = remaining_size - chunk_size;
        start_loc = start_loc + chunk_size;
        i++;
    }

    //load the last remaining piece
    executor.execute(new FileRead(start_loc, toIntExact(remaining_size), channel, i));

    //Tear Down

}

}

多线程读取大量文件

4 个答案: