$gnatmake -o hello *.adb
gcc -c hello.adb
gnatbind -x hello.ali
gnatlink hello.ali -o hello
$hello
Starting Worker
Starting Worker
Starting Worker
Starting Worker
Starting Worker
Starting Worker
Starting Worker
Starting Worker
Starting Worker
Starting Worker
Worker Completed Work
Starting Worker
Worker Completed Work
Starting Worker
Worker Completed Work
Starting Worker
Worker Completed Work
Starting Worker
Worker Completed Work
Starting Worker
Worker Completed Work
Starting Worker
Worker Completed Work
Starting Worker
Worker Completed Work
Worker Completed Work
Starting Worker
Worker Completed Work
Starting Worker
Starting Worker
Received Stop Signal
Worker Completed Work
Worker Completed Work
Worker Completed Work
Worker Completed Work
Worker Completed Work
Worker Completed Work
Worker Completed Work
Worker Completed Work
Worker Completed Work
Worker Completed Work
Manager is Finished
Program is Done
更改为STATE
后,一个没有大量内存计算的简单Spark流媒体应用就消耗了 17GB 的内存。
集群设置:
YARN资源管理器显示:Mem Total-18GB,vCore Total-4
Spark流媒体应用程序源代码可以在这里找到,并且您看到它并没有太大作用:
Spark Submit命令(通过SSH而不是GCLOUD SDK):
RUNNING
为什么这么简单的应用程序会分配那么多内存?
我使用的是GCP Dataproc默认配置,是否应该修改任何YARN配置?