几天 - 也许一周 - mongodb随机崩溃而没有任何有意义的登录mongodb.log。它只是工作,然后停止响应。
文件中的最后几行看起来非常实用:
2016-05-31T20:41:59.273+0000 I COMMAND [conn146] query prod.user query: { username: "anonymousUser" } planSummary: COLLSCAN ntoskip:0 nscanned:0 nscannedObjects:328 keyUpdates:0 writeConflicts:0 numYields:21 nreturned:0 reslen:20 locks:{ Global: { acquireCount: { r: 44 } }, MMAPV1Journal: { acquireCount: { r: 22 } }, Database: { acquireCount: { r: 22 } }, Collection: { acquireCount: { R: 22 } } } 7049ms
2016-05-31T20:42:00.665+0000 I COMMAND [conn124] command admin.$cmd command: buildInfo { buildinfo: 1 } keyUpdates:0 writeConflicts:0 numYields:0 reslen:813 locks:{} 400ms
2016-05-31T20:42:01.694+0000 I COMMAND [conn150] query prod.announcements planSummary: COLLSCAN ntoreturn:0 ntoskip:0 nscanned:0 nscannedObjects:1 keyUpdates:0 writeConflicts:0 numYields:3 nreturned:1 reslen:433 locks:{ Global: { acquireCount: { r: 8 } }, MMAPV1Journal: { acquireCount: { r: 4 } }, Database: { acquireCount: { r: 4 } }, Collection: { acquireCount: { R: 4 } } } 2470ms
我没有内核日志记录 - 只是激活了它。
I / var / log / messages我在20:42:04有这行,即记录了最后一个mongo查询后3秒
May 31 20:42:04 ip-172-31-6-27 kernel: [213213.126793] traps: mongod[9632] trap int3 ip:1263bc5 sp:7fb5247a4250 error:0
从我所看到的,陷阱是一种中断,但无法真正理解这意味着什么。系统刚刚告诉mongo停止了吗?有什么办法可以理解发生了什么吗?
编辑:忘了添加一些关于我的信息的信息:
操作系统信息:
NAME="Amazon Linux AMI"
VERSION="2015.03"
ID="amzn"
ID_LIKE="rhel fedora"
VERSION_ID="2015.03"
PRETTY_NAME="Amazon Linux AMI 2015.03"
ANSI_COLOR="0;33"
CPE_NAME="cpe:/o:amazon:linux:2015.03:ga"
HOME_URL="http://aws.amazon.com/amazon-linux-ami/"
Amazon Linux AMI release 2015.03
db size约为60 Mo
编辑2:运行db.serverStatus()(我重新启动实例后,不确定是否有任何实际帮助来调试问题)并保存结果here
编辑3:尝试在dmesg中查看任何内存问题
[ec2-user@ip-172-31-6-27 log]$ cat dmesg | grep memory
[ 0.000000] Base memory trampoline at [ffff880000098000] 98000 size 24576
[ 0.000000] init_memory_mapping: [mem 0x00000000-0x000fffff]
[ 0.000000] init_memory_mapping: [mem 0x3fe00000-0x3fffffff]
[ 0.000000] init_memory_mapping: [mem 0x3c000000-0x3fdfffff]
[ 0.000000] init_memory_mapping: [mem 0x00100000-0x3bffffff]
[ 0.000000] Early memory node ranges
[ 0.000000] please try 'cgroup_disable=memory' option if you don't want memory cgroups
[ 0.028161] Initializing cgroup subsys memory
[ 0.627295] Freeing initrd memory: 13968K (ffff88003724c000 - ffff880037ff0000)
[ 1.584023] Freeing unused kernel memory: 1088K (ffffffff81ad4000 - ffffffff81be4000)
[ 1.591756] Freeing unused kernel memory: 1416K (ffff88000149e000 - ffff880001600000)
[ 1.598026] Freeing unused kernel memory: 1928K (ffff88000181e000 - ffff880001a00000)
感谢您在此提供的任何帮助:)