Question

我可以使用一些伪代码，或者更好的Python。我正在尝试为Python IRC机器人实现速率限制队列，它部分工作，但如果有人触发的消息少于限制（例如，速率限制是每8秒5条消息，而此人只触发4条消息），并且下一个触发器超过8秒（例如，16秒后），机器人发送消息，但是队列变满并且机器人等待8秒，即使自8秒时间段已经过去也不需要它。

Answer 1

这里是simplest algorithm，如果您只想在邮件到达太快时丢弃它们（而不是排队，这是有道理的，因为队列可能会变得任意大）：

rate = 5.0; // unit: messages
per  = 8.0; // unit: seconds
allowance = rate; // unit: messages
last_check = now(); // floating-point, e.g. usec accuracy. Unit: seconds

when (message_received):
  current = now();
  time_passed = current - last_check;
  last_check = current;
  allowance += time_passed * (rate / per);
  if (allowance > rate):
    allowance = rate; // throttle
  if (allowance < 1.0):
    discard_message();
  else:
    forward_message();
    allowance -= 1.0;

此解决方案中没有数据结构，定时器等，它干净利落地工作:)为了看到这一点，'允许'最多以每秒5/8单位的速度增长，即每8秒最多5个单位。转发的每条消息都会扣除一个单位，因此每8秒钟不能发送超过5条消息。

请注意rate应该是一个整数，即没有非零小数部分，否则算法将无法正常工作（实际速率不会是rate/per）。例如。 rate=0.5; per=1.0;不起作用，因为allowance永远不会增长到1.0。但rate=1.0; per=2.0;工作正常。

Answer 2

在排队的函数之前使用此装饰器@RateLimited（ratepersec）。

基本上，这会检查自上次以来是否已经过1 /速率秒，如果没有，则等待剩余的时间，否则它不会等待。这实际上限制了你的速率/秒。装饰器可以应用于您想要限速的任何功能。

在您的情况下，如果您希望每8秒最多包含5条消息，请在sendToQueue函数之前使用@RateLimited（0.625）。

import time

def RateLimited(maxPerSecond):
    minInterval = 1.0 / float(maxPerSecond)
    def decorate(func):
        lastTimeCalled = [0.0]
        def rateLimitedFunction(*args,**kargs):
            elapsed = time.clock() - lastTimeCalled[0]
            leftToWait = minInterval - elapsed
            if leftToWait>0:
                time.sleep(leftToWait)
            ret = func(*args,**kargs)
            lastTimeCalled[0] = time.clock()
            return ret
        return rateLimitedFunction
    return decorate

@RateLimited(2)  # 2 per second at most
def PrintNumber(num):
    print num

if __name__ == "__main__":
    print "This should print 1,2,3... at about 2 per second."
    for i in range(1,100):
        PrintNumber(i)

Answer 3

Token Bucket实现起来相当简单。

从一个带有5个令牌的桶开始。

每5/8秒：如果存储桶少于5个令牌，请添加一个。

每次要发送消息时：如果存储桶有≥1个令牌，请取出一个令牌并发送消息。否则，等待/删除消息/等等。

（显然，在实际代码中，你使用整数计数器而不是真实的标记，你可以通过存储时间戳来优化每5/8步骤）

再次阅读问题，如果速率限制每8秒完全重置一次，那么这是一个修改：

在很久以前的某个时间（例如，在纪元）开始时间戳last_send。此外，从相同的5令牌桶开始。

每5/8秒执行一次规则。

每次发送消息时：首先，检查last_send是否≥8秒前。如果是这样，请填充桶（将其设置为5个令牌）。其次，如果存储桶中有令牌，则发送消息（否则，丢弃/等待/等）。第三，将last_send设置为现在。

这应该适用于那种情况。

我实际上是用这样的策略编写了一个IRC机器人（第一种方法）。它在Perl中，而不是Python，但这里有一些代码来说明：

这里的第一部分处理向桶添加令牌。您可以看到基于时间（第2行到最后一行）添加令牌的优化，然后最后一行将桶内容限制为最大值（MESSAGE_BURST）

    my $start_time = time;
    ...
    # Bucket handling
    my $bucket = $conn->{fujiko_limit_bucket};
    my $lasttx = $conn->{fujiko_limit_lasttx};
    $bucket += ($start_time-$lasttx)/MESSAGE_INTERVAL;
    ($bucket <= MESSAGE_BURST) or $bucket = MESSAGE_BURST;

$ conn是一种传递的数据结构。这是在一个常规运行的方法中（它计算下次有什么事情要做，并且长时间睡眠或直到它获得网络流量）。该方法的下一部分处理发送。它相当复杂，因为消息具有与之相关的优先级。

    # Queue handling. Start with the ultimate queue.
    my $queues = $conn->{fujiko_queues};
    foreach my $entry (@{$queues->[PRIORITY_ULTIMATE]}) {
            # Ultimate is special. We run ultimate no matter what. Even if
            # it sends the bucket negative.
            --$bucket;
            $entry->{code}(@{$entry->{args}});
    }
    $queues->[PRIORITY_ULTIMATE] = [];

这是第一个队列，无论如何都会运行。即使它让我们的连接因洪水而死亡。用于非常重要的事情，比如响应服务器的PING。接下来，其余的队列：

    # Continue to the other queues, in order of priority.
    QRUN: for (my $pri = PRIORITY_HIGH; $pri >= PRIORITY_JUNK; --$pri) {
            my $queue = $queues->[$pri];
            while (scalar(@$queue)) {
                    if ($bucket < 1) {
                            # continue later.
                            $need_more_time = 1;
                            last QRUN;
                    } else {
                            --$bucket;
                            my $entry = shift @$queue;
                            $entry->{code}(@{$entry->{args}});
                    }
            }
    }

最后，存储桶状态被保存回$ conn数据结构（实际上稍晚于该方法;它首先计算它将在多长时间内完成更多工作）

    # Save status.
    $conn->{fujiko_limit_bucket} = $bucket;
    $conn->{fujiko_limit_lasttx} = $start_time;

如您所见，实际的铲斗处理代码非常小 - 约四行。其余代码是优先级队列处理。僵尸程序具有优先级队列，例如，与之聊天的人不能阻止它执行其重要的启动/禁止职责。

Answer 4

阻止处理直到可以发送消息，从而排队进一步消息，antti的漂亮解决方案也可以这样修改：

rate = 5.0; // unit: messages
per  = 8.0; // unit: seconds
allowance = rate; // unit: messages
last_check = now(); // floating-point, e.g. usec accuracy. Unit: seconds

when (message_received):
  current = now();
  time_passed = current - last_check;
  last_check = current;
  allowance += time_passed * (rate / per);
  if (allowance > rate):
    allowance = rate; // throttle
  if (allowance < 1.0):
    time.sleep( (1-allowance) * (per/rate))
    forward_message();
    allowance = 0.0;
  else:
    forward_message();
    allowance -= 1.0;

它只是等到有足够的余量发送消息。为了不以两倍的速率开始，补贴也可以用0初始化。

Answer 5

一种解决方案是将时间戳附加到每个队列项，并在8秒过后丢弃该项。您可以在每次添加队列时执行此检查。

仅当您将队列大小限制为5并在队列已满时丢弃任何添加项时，此方法才有效。

Answer 6

保留最后五行的发送时间。保持排队的消息，直到最近的第五条消息（如果存在）过去至少8秒（将last_five作为一个数组）：

now = time.time()
if len(last_five) == 0 or (now - last_five[-1]) >= 8.0:
    last_five.insert(0, now)
    send_message(msg)
if len(last_five) > 5:
    last_five.pop()

Answer 7

如果有人仍然感兴趣，我会将这个简单的可调用类与定时LRU密钥值存储结合使用，以限制每个IP的请求率。使用双端队列，但可以改写为与列表一起使用。

from collections import deque
import time


class RateLimiter:
    def __init__(self, maxRate=5, timeUnit=1):
        self.timeUnit = timeUnit
        self.deque = deque(maxlen=maxRate)

    def __call__(self):
        if self.deque.maxlen == len(self.deque):
            cTime = time.time()
            if cTime - self.deque[0] > self.timeUnit:
                self.deque.append(cTime)
                return False
            else:
                return True
        self.deque.append(time.time())
        return False

r = RateLimiter()
for i in range(0,100):
    time.sleep(0.1)
    print(i, "block" if r() else "pass")

Answer 8

只是从接受的答案中执行代码的python。

Array
(
  [0] => value1
  [1] => value2
  [2] => value3        
  [3] => value4
  [4] => value5
  [5] => value6    
  [6] => value7
  [7] => value8
  [8] => value9           
)

Answer 9

这个怎么样：

long check_time = System.currentTimeMillis();
int msgs_sent_count = 0;

private boolean isRateLimited(int msgs_per_sec) {
    if (System.currentTimeMillis() - check_time > 1000) {
        check_time = System.currentTimeMillis();
        msgs_sent_count = 0;
    }

    if (msgs_sent_count > (msgs_per_sec - 1)) {
        return true;
    } else {
        msgs_sent_count++;
    }

    return false;
}

Answer 10

我需要Scala的变体。这是：

case class Limiter[-A, +B](callsPerSecond: (Double, Double), f: A ⇒ B) extends (A ⇒ B) {

  import Thread.sleep
  private def now = System.currentTimeMillis / 1000.0
  private val (calls, sec) = callsPerSecond
  private var allowance  = 1.0
  private var last = now

  def apply(a: A): B = {
    synchronized {
      val t = now
      val delta_t = t - last
      last = t
      allowance += delta_t * (calls / sec)
      if (allowance > calls)
        allowance = calls
      if (allowance < 1d) {
        sleep(((1 - allowance) * (sec / calls) * 1000d).toLong)
      }
      allowance -= 1
    }
    f(a)
  }

}

以下是它的使用方法：

val f = Limiter((5d, 8d), { 
  _: Unit ⇒ 
    println(System.currentTimeMillis) 
})
while(true){f(())}

什么是一个好的速率限制算法？

10 个答案: