Question

我正在开发一个haskell网络应用程序，我使用actor模式来管理多线程。我遇到的一件事是如何存储例如一组客户端套接字/句柄。当然，所有线程都必须可以访问，并且可以在客户端登录/注销时更改。

由于我来自势在必行的世界，我想到了某种锁机制，但当我注意到这是多么丑陋时，我想到了“纯粹的”可变性，实际上它有点纯粹：

import Control.Concurrent
import Control.Monad
import Network
import System.IO
import Data.List
import Data.Maybe
import System.Environment
import Control.Exception


newStorage :: (Eq a, Show a) => IO (Chan (String, Maybe (Chan [a]), Maybe a))
newStorage = do
  q <- newChan
  forkIO $ storage [] q
  return q


newHandleStorage :: IO (Chan (String, Maybe (Chan [Handle]), Maybe Handle))
newHandleStorage = newStorage


storage :: (Eq a, Show a) => [a] -> Chan (String, Maybe (Chan [a]), Maybe a) -> IO ()
storage s q = do
  let loop = (`storage` q)
  (req, reply, d) <- readChan q
  print ("processing " ++ show(d))
  case req of
    "add" -> loop ((fromJust d) : s)
    "remove" -> loop (delete (fromJust d) s)
    "get" -> do
      writeChan (fromJust reply) s
      loop s


store s d = writeChan s ("add", Nothing, Just d)
unstore s d = writeChan s ("remove", Nothing, Just d)
request s = do
  chan <- newChan
  writeChan s ("get", Just chan, Nothing)
  readChan chan

关键是线程（actor）正在管理项目列表并根据传入请求修改列表。由于线程非常便宜，我认为这可能是一个非常好的功能替代方案。

当然这只是一个原型（快速脏概念证明）。所以我的问题是：

这是管理共享可变变量的“好方法”（在演员世界中）吗？
这个模式已经有了一个库吗？（我已经搜索但我什么也没找到）

此致克里斯

Answer 1

以下是使用stm和pipes-network的快速而肮脏的示例。这将设置一个简单的服务器，允许客户端连接，递增或递减计数器。它将显示一个非常简单的状态栏，显示所有已连接客户端的当前结果，并在断开连接时从条形图中删除客户端标记。

首先我将从服务器开始，我慷慨地评论代码来解释它是如何工作的：

import Control.Concurrent.STM (STM, atomically)
import Control.Concurrent.STM.TVar
import qualified Data.HashMap.Strict as H
import Data.Foldable (forM_)

import Control.Concurrent (forkIO, threadDelay)
import Control.Monad (unless)
import Control.Monad.Trans.State.Strict
import qualified Data.ByteString.Char8 as B
import Control.Proxy
import Control.Proxy.TCP
import System.IO

main = do
    hSetBuffering stdout NoBuffering

    {- These are the internal data structures.  They should be an implementation
       detail and you should never expose these references to the
       "business logic" part of the application. -}
    -- I use nRef to keep track of creating fresh Ints (which identify users)
    nRef <- newTVarIO 0       :: IO (TVar Int)
    {- hMap associates every user (i.e. Int) with a counter

       Notice how I've "striped" the hash map by storing STM references to the
       values instead of storing the values directly.  This means that I only
       actually write the hashmap when adding or removing users, which reduces
       contention for the hash map.

       Since each user gets their own unique STM reference for their counter,
       modifying counters does not cause contention with other counters or
       contention with the hash map. -}
    hMap <- newTVarIO H.empty :: IO (TVar (H.HashMap Int (TVar Int)))

    {- The following code makes heavy use of Haskell's pure closures.  Each
       'let' binding closes over its current environment, which is safe since
        Haskell is pure. -}

    let {- 'getCounters' is the only server-facing command in our STM API.  The
           only permitted operation is retrieving the current set of user
           counters.

           'getCounters' closes over the 'hMap' reference currently in scope so
           that the server never needs to be aware about our internal
           implementation. -}
        getCounters :: STM [Int]
        getCounters = do
            refs <- fmap H.elems (readTVar hMap)
            mapM readTVar refs

        {- 'init' is the only client-facing command in our STM API.  It
            initializes the client's entry in the hash map and returns two
            commands: the first command is what the client calls to 'increment'
            their counter and the second command is what the client calls to log
            off and delete
            'delete' command.

            Notice that those two returned commands each close over the client's
            unique STM reference so the client never needs to be aware of how
            exactly 'init' is implemented under the hood. -}
        init :: STM (STM (), STM ())
        init = do
            n   <- readTVar nRef
            writeTVar nRef $! n + 1

            ref <- newTVar 0
            modifyTVar' hMap (H.insert n ref)

            let incrementRef :: STM ()
                incrementRef = do
                    mRef <- fmap (H.lookup n) (readTVar hMap)
                    forM_ mRef $ \ref -> modifyTVar' ref (+ 1)

                deleteRef :: STM ()
                deleteRef = modifyTVar' hMap (H.delete n)

            return (incrementRef, deleteRef)

    {- Now for the actual program logic.  Everything past this point only uses
       the approved STM API (i.e. 'getCounters' and 'init').  If I wanted I
       could factor the above approved STM API into a separate module to enforce
       the encapsulation boundary, but I am lazy. -}

    {- Fork a thread which polls the current state of the counters and displays
       it to the console.  There is a way to implement this without polling but
       this gets the job done for now.

       Most of what it is doing is just some simple tricks to reuse the same
       console line instead of outputting a stream of lines.  Otherwise it
       would be just:

       forkIO $ forever $ do
           ns <- atomically getCounters
           print ns
    -}
    forkIO $ (`evalStateT` 0) $ forever $ do
        del <- get
        lift $ do
            putStr (replicate del '\b')
            putStr (replicate del ' ' )
            putStr (replicate del '\b')
        ns <- lift $ atomically getCounters
        let str = show ns
        lift $ putStr str
        put $! length str
        lift $ threadDelay 10000

    {- Fork a thread for each incoming connection, which listens to the client's
       commands and translates them into 'STM' actions -}
    serve HostAny "8080" $ \(socket, _) -> do
        (increment, delete) <- atomically init

        {- Right now, just do the dumb thing and convert all keypresses into
           increment commands, with the exception of the 'q' key, which will
           quit -}
        let handler :: (Proxy p) => () -> Consumer p Char IO ()
            handler () = runIdentityP loop
              where
                loop = do
                    c <- request ()
                    unless (c == 'q') $ do
                        lift $ atomically increment
                        loop

        {- This uses my 'pipes' library.  It basically is a high-level way to
           say:

           * Read binary packets from the socket no bigger than 4096 bytes

           * Get the first character from each packet and discard the rest

           * Handle the character using the above 'handler' function -}
        runProxy $ socketReadS 4096 socket >-> mapD B.head >-> handler

        {- The above pipeline finishes either when the socket closes or
           'handler' stops looping because it received a 'q'.  Either case means
           that the client is done so we log them out using 'delete'. -}
        atomically delete

接下来是客户端，它只是打开一个连接并将所有按键转发为单个数据包：

import Control.Monad
import Control.Proxy
import Control.Proxy.Safe
import Control.Proxy.TCP.Safe
import Data.ByteString.Char8 (pack)
import System.IO

main = do
    hSetBuffering stdin NoBuffering
    hSetEcho      stdin False

    {- Again, this uses my 'pipes' library.  It basically says:

        * Read characters from the console using 'commands'

        * Pack them into a binary format

        * send them to a server running at 127.0.0.1:8080

        This finishes looping when the user types a 'q' or the connection is
        closed for whatever reason.
    -}
    runSafeIO $ runProxy $ runEitherK $
         try . commands
     >-> mapD (\c -> pack [c])
     >-> connectWriteD Nothing "127.0.0.1" "8080"

commands :: (Proxy p) => () -> Producer p Char IO ()
commands () = runIdentityP loop
  where
    loop = do
        c <- lift getChar
        respond c
        unless (c == 'q') loop

非常简单：commands生成Char s流，然后转换为ByteString，然后作为数据包发送到服务器。

如果您运行服务器和几个客户端并让它们分别键入几个键，则服务器显示屏将输出一个列表，显示每个客户端键入的键数：

[1,6,4]

...如果某些客户端断开连接，它们将从列表中删除：

[1,4]

请注意，这些示例中的pipes组件将在即将发布的pipes-4.0.0版本中大大简化，但当前的pipes生态系统仍可按原样完成工作。

Answer 2

首先，我绝对建议使用您自己的特定数据类型来表示命令。当使用(String, Maybe (Chan [a]), Maybe a)时，一个错误的客户端可以通过发送一个未知的命令或发送("add", Nothing, Nothing)等来使你的演员崩溃。我建议像

data Command a = Add a | Remove a | Get (Chan [a])

然后，您可以通过保存方式对storage中的命令进行模式匹配。

演员有自己的优势，但我觉得他们有一些弊端。例如，从演员那里获得答案需要向其发送命令然后等待答复。并且客户端无法完全确定它是否得到了回复，并且回复将是某种特定类型 - 您不能说我只想要这个特定命令的答案（以及它们中有多少个）。 / p>

举个例子，我将给出一个简单的STM解决方案。最好使用哈希表或（平衡树）集，但由于Handle既不实现Ord也不实现Hashable，我们不能使用这些数据结构，所以我将继续使用列表。

module ThreadSet (
    TSet, add, remove, get
) where

import Control.Monad
import Control.Monad.STM
import Control.Concurrent.STM.TVar
import Data.List (delete)

newtype TSet a = TSet (TVar [a])

add :: (Eq a) => a -> TSet a -> STM ()
add x (TSet v) = readTVar v >>= writeTVar v . (x :)

remove :: (Eq a) => a -> TSet a -> STM ()
remove x (TSet v) = readTVar v >>= writeTVar v . delete x

get :: (Eq a) => TSet a -> STM [a]
get (TSet v) = readTVar v

此模块实现基于STM的任意元素集。您可以拥有多个此类集，并在一次成功或失败的单个STM事务中一起使用它们。例如

-- | Ensures that there is exactly one element `x` in the set.
add1 :: (Eq a) => a -> TSet a -> STM ()
add1 x v = remove x v >> add x v

对于演员来说这很难，你必须将它作为演员的另一个命令添加，你不能把它组成现有的动作并且仍然具有原子性。

更新：有一个有趣的article解释了为什么Clojure设计师选择不使用演员。例如，使用actor，即使你有很多读取，只有非常少的写入可变结构，它们都被序列化，这可能会极大地影响性能。

Haskell - 基于演员的可变性

2 个答案: