我有一个关于奇怪异常的问题。
我在生产中遇到了一些RejectedExecutionException(每天大约200个)。 Reactor使用Schedulers.fromExecutorService()在自定义调度程序上运行。
因此,我首先检查了队列大小或ExcutorService
的任何值,但这很正常。没有完整的队列。没有关机。
这是引发异常的代码。
return reactionRepository
.getPage(context, scanQuery)
.buffer(100)
.concatMap(Flux::fromIterable)
.flatMapSequential(likeSn -> findOne(context, parentId, likeSn)
.transform(ReactiveHelpers.defaultIfNotFoundOrError(Optional.empty())))
.filter(Optional::isPresent)
.map(Optional::get)
.doOnError(e -> log.error("Failed to find likes", e));
getPage()返回Flux对象。以下代码是从Redis集群读取信息的主要代码。
...
return bucketList.publishOn(redisScheduler)
.filter(val -> val.getScore() >= 0)
.map(Value::getValue)
这是我在日志文件中得到的异常日志。此错误日志写在上面的.doOnError(e -> log.error("Failed to find likes", e));
行中。
reactor.core.Exceptions$ReactorRejectedExecutionException: Scheduler unavailable
at reactor.core.Exceptions.failWithRejected(Exceptions.java:249)
at reactor.core.publisher.Operators.onRejectedExecution(Operators.java:412)
at reactor.core.publisher.FluxPublishOn$PublishOnSubscriber.trySchedule(FluxPublishOn.java:293)
at reactor.core.publisher.FluxPublishOn$PublishOnSubscriber.request(FluxPublishOn.java:261)
at reactor.core.publisher.FluxBuffer$BufferExactSubscriber.request(FluxBuffer.java:111)
at reactor.core.publisher.FluxConcatMap$ConcatMapImmediate.onSubscribe(FluxConcatMap.java:227)
at reactor.core.publisher.FluxBuffer$BufferExactSubscriber.onSubscribe(FluxBuffer.java:125)
at reactor.core.publisher.FluxPublishOn$PublishOnSubscriber.onSubscribe(FluxPublishOn.java:209)
at reactor.core.publisher.FluxConcatArray.subscribe(FluxConcatArray.java:78)
at reactor.core.publisher.FluxPublishOn.subscribe(FluxPublishOn.java:108)
at reactor.core.publisher.FluxBuffer.subscribe(FluxBuffer.java:72)
at reactor.core.publisher.FluxConcatMap.subscribe(FluxConcatMap.java:121)
at reactor.core.publisher.Flux.subscribe(Flux.java:6877)
at reactor.core.publisher.FluxMergeSequential.subscribe(FluxMergeSequential.java:99)
at reactor.core.publisher.FluxFilter.subscribe(FluxFilter.java:52)
at reactor.core.publisher.FluxMap.subscribe(FluxMap.java:62)
at reactor.core.publisher.FluxPeek.subscribe(FluxPeek.java:83)
at reactor.core.publisher.MonoCollectList.subscribe(MonoCollectList.java:59)
at reactor.core.publisher.MonoMapFuseable.subscribe(MonoMapFuseable.java:59)
at reactor.core.publisher.MonoOnErrorResume.subscribe(MonoOnErrorResume.java:44)
at reactor.core.publisher.Mono.subscribe(Mono.java:3080)
at reactor.core.publisher.MonoZip.subscribe(MonoZip.java:128)
at reactor.core.publisher.MonoFlatMap.subscribe(MonoFlatMap.java:60)
at reactor.core.publisher.Mono.subscribe(Mono.java:3080)
at reactor.core.publisher.FluxFlatMap$FlatMapMain.onNext(FluxFlatMap.java:372)
at reactor.core.publisher.FluxGroupBy$UnicastGroupedFlux.drainRegular(FluxGroupBy.java:554)
at reactor.core.publisher.FluxGroupBy$UnicastGroupedFlux.drain(FluxGroupBy.java:630)
at reactor.core.publisher.FluxGroupBy$UnicastGroupedFlux.subscribe(FluxGroupBy.java:696)
at reactor.core.publisher.FluxFlatMap.subscribe(FluxFlatMap.java:97)
at reactor.core.publisher.Flux.subscribe(Flux.java:6877)
at reactor.core.publisher.FluxFlatMap$FlatMapMain.onNext(FluxFlatMap.java:372)
at reactor.core.publisher.FluxGroupBy$GroupByMain.drainLoop(FluxGroupBy.java:380)
at reactor.core.publisher.FluxGroupBy$GroupByMain.drain(FluxGroupBy.java:316)
at reactor.core.publisher.FluxGroupBy$GroupByMain.onNext(FluxGroupBy.java:201)
at reactor.core.publisher.FluxIterable$IterableSubscription.slowPath(FluxIterable.java:244)
at reactor.core.publisher.FluxIterable$IterableSubscription.request(FluxIterable.java:202)
at reactor.core.publisher.FluxGroupBy$GroupByMain.onSubscribe(FluxGroupBy.java:165)
at reactor.core.publisher.FluxIterable.subscribe(FluxIterable.java:140)
at reactor.core.publisher.FluxIterable.subscribe(FluxIterable.java:64)
at reactor.core.publisher.FluxGroupBy.subscribe(FluxGroupBy.java:82)
at reactor.core.publisher.FluxFlatMap.subscribe(FluxFlatMap.java:97)
at reactor.core.publisher.MonoCollect.subscribe(MonoCollect.java:66)
at reactor.core.publisher.MonoMapFuseable.subscribe(MonoMapFuseable.java:59)
at reactor.core.publisher.MonoOnAssembly.subscribe(MonoOnAssembly.java:76)
at reactor.core.publisher.Mono.subscribe(Mono.java:3080)
at reactor.core.publisher.MonoSubscribeOn$SubscribeOnSubscriber.run(MonoSubscribeOn.java:123)
at reactor.core.scheduler.WorkerTask.call(WorkerTask.java:84)
at reactor.core.scheduler.WorkerTask.call(WorkerTask.java:37)
at io.micrometer.core.instrument.AbstractTimer.recordCallable(AbstractTimer.java:143)
at io.micrometer.core.instrument.Timer.lambda$wrap$1(Timer.java:137)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.util.concurrent.RejectedExecutionException: Scheduler unavailable
at reactor.core.Exceptions.<clinit>(Exceptions.java:502)
at reactor.core.publisher.Operators.onOperatorError(Operators.java:345)
at reactor.core.publisher.Operators.onOperatorError(Operators.java:323)
at reactor.core.publisher.Operators.onOperatorError(Operators.java:305)
at reactor.core.publisher.MonoError.subscribe(MonoError.java:53)
at reactor.core.publisher.Mono.subscribe(Mono.java:3080)
at reactor.core.publisher.FluxSwitchIfEmpty$SwitchIfEmptySubscriber.onComplete(FluxSwitchIfEmpty.java:75)
at reactor.core.publisher.FluxSwitchIfEmpty$SwitchIfEmptySubscriber.onComplete(FluxSwitchIfEmpty.java:78)
at reactor.core.publisher.Operators.complete(Operators.java:128)
at reactor.core.publisher.MonoEmpty.subscribe(MonoEmpty.java:45)
at reactor.core.publisher.Mono.subscribe(Mono.java:3080)
我不知道为什么会引发此异常。
无论如何,在调试此问题时,我只看到一次抛出异常的案例。 (我不确定这是真正的原因。)
在ExecutorSchedulerWorker::schedule
中,!tasks.add(r)
语句的评估结果为true,因此引发了异常。
ExecutorTrackedRunnable r = new ExecutorTrackedRunnable(task, this, true);
if (!tasks.add(r)) {
throw Exceptions.failWithRejected();
}
这是我现在才知道的线索。
有人知道这个问题吗?任何建议都可以帮助我。
编辑1 。添加提到的代码。这是处理我的自定义异常的帮助程序代码
public static <T> Function<Mono<T>, Publisher<T>> defaultIfNotFoundOrError(T defaultValue) {
return source -> source.onErrorResume(ReactionStorageException.class,
e -> {
if (e.getErrorCode() == ReactionStorageErrorCode.NOT_FOUND) {
return Mono.just(defaultValue);
} else {
return Mono.error(e);
}
});
}
然后findOne()
public Mono<Optional<Like>> findOne(final RequesterContext context,
final String parentId,
final int sn,
final boolean handleFaulted) {
Preconditions.checkArgument(!Strings.isNullOrEmpty(parentId),
"parentId must not be an empty value");
Preconditions.checkArgument(sn >= StorageConstants.BASE_SN,
"Serial number must be greater than BASE_SN value");
final String likeInfoKey = RedisKeys.reactionInfo(reactionType, parentId, sn);
return cmds.hgetall(likeInfoKey)
.publishOn(redisScheduler)
.flatMap(m -> Mono.justOrEmpty(LikeRedisMapper.from(m)))
.switchIfEmpty(ReactiveHelpers.mapOrEmpty(handleFaulted,
requestFaultedLike(context, parentId, sn)))
.switchIfEmpty(ExceptionUtils.generate(ReactionStorageErrorCode.NOT_FOUND,
"Like(%s, %d) cannot be found",
parentId, sn))
.map(Optional::ofNullable)
.doOnError(e -> log.trace("Failed to find a like", e));
}
更新2 。经过深度调试后,由于取消了源,因此引发了异常。此取消是由Mono.zip(A,B,C ...)引起的。上方的来源是B。如果A是空来源,则应取消B。但偶尔在收到取消信号后B请求正在处理。