Name | hadcm3n_o6m0_1980_40_008389009_0 |
Workunit | 8539868 |
Created | 3 Jun 2013, 15:19:11 UTC |
Sent | 5 Jun 2013, 12:08:11 UTC |
Report deadline | 4 Sep 2013, 19:35:22 UTC |
Received | 6 Jun 2013, 16:32:09 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 23 hours 53 min 38 sec |
CPU time | 23 hours 13 min 42 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 1.99 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 14:35:29 (16343): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:30:11 (16898): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:34:28 (17465): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:34:29 (17465): No heartbeat from core client for 30 sec - exiting 15:34:30 (17465): No heartbeat from core client for 30 sec - exiting 15:34:31 (17465): No heartbeat from core client for 30 sec - exiting 15:34:32 (17465): No heartbeat from core client for 30 sec - exiting 15:34:33 (17465): No heartbeat from core client for 30 sec - exiting 15:34:34 (17465): No heartbeat from core client for 30 sec - exiting 15:34:35 (17465): No heartbeat from core client for 30 sec - exiting 15:34:36 (17465): No heartbeat from core client for 30 sec - exiting 15:34:37 (17465): No heartbeat from core client for 30 sec - exiting 15:34:38 (17465): No heartbeat from core client for 30 sec - exiting 15:34:39 (17465): No heartbeat from core client for 30 sec - exiting 15:34:40 (17465): No heartbeat from core client for 30 sec - exiting 15:34:41 (17465): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 15:38:38 (17580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:16:13 (17734): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:24:24 (18641): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:28:43 (18828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:28:44 (18828): No heartbeat from core client for 30 sec - exiting 17:28:45 (18828): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 17:33:23 (18972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:32:47 (19086): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:28:18 (20181): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:33:06 (20744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:33:07 (20744): No heartbeat from core client for 30 sec - exiting 20:33:08 (20744): No heartbeat from core client for 30 sec - exiting 20:33:09 (20744): No heartbeat from core client for 30 sec - exiting 20:37:32 (20895): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:37:33 (20895): No heartbeat from core client for 30 sec - exiting 20:37:34 (20895): No heartbeat from core client for 30 sec - exiting 20:37:35 (20895): No heartbeat from core client for 30 sec - exiting 20:37:36 (20895): No heartbeat from core client for 30 sec - exiting 20:37:37 (20895): No heartbeat from core client for 30 sec - exiting 20:37:38 (20895): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 20:41:56 (21059): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:41:57 (21059): No heartbeat from core client for 30 sec - exiting 20:41:58 (21059): No heartbeat from core client for 30 sec - exiting 20:41:59 (21059): No heartbeat from core client for 30 sec - exiting 20:42:00 (21059): No heartbeat from core client for 30 sec - exiting 20:42:01 (21059): No heartbeat from core client for 30 sec - exiting 20:42:02 (21059): No heartbeat from core client for 30 sec - exiting 20:42:03 (21059): No heartbeat from core client for 30 sec - exiting 20:42:04 (21059): No heartbeat from core client for 30 sec - exiting 20:42:05 (21059): No heartbeat from core client for 30 sec - exiting 20:42:06 (21059): No heartbeat from core client for 30 sec - exiting 20:42:07 (21059): No heartbeat from core client for 30 sec - exiting 20:42:08 (21059): No heartbeat from core client for 30 sec - exiting 22:04:20 (21215): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:04:21 (21215): No heartbeat from core client for 30 sec - exiting 22:04:22 (21215): No heartbeat from core client for 30 sec - exiting 22:04:23 (21215): No heartbeat from core client for 30 sec - exiting 22:04:24 (21215): No heartbeat from core client for 30 sec - exiting 22:04:25 (21215): No heartbeat from core client for 30 sec - exiting 22:25:00 (21974): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:28:27 (22237): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:32:27 (22325): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:40:27 (22454): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:45:12 (23108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:44:34 (23245): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:26:29 (23846): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:30:06 (25280): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:42:57 (28820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:47:15 (29543): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:51:14 (29664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:51:15 (29664): No heartbeat from core client for 30 sec - exiting 10:51:16 (29664): No heartbeat from core client for 30 sec - exiting 10:55:35 (29791): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:00:05 (29928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:49:43 (30077): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:12:02 (31082): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:12:03 (31082): No heartbeat from core client for 30 sec - exiting 13:12:04 (31082): No heartbeat from core client for 30 sec - exiting 13:12:05 (31082): No heartbeat from core client for 30 sec - exiting 14:49:54 (31362): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:54:47 (32320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:54:48 (32320): No heartbeat from core client for 30 sec - exiting 14:54:49 (32320): No heartbeat from core client for 30 sec - exiting 14:54:50 (32320): No heartbeat from core client for 30 sec - exiting 14:54:51 (32320): No heartbeat from core client for 30 sec - exiting 14:54:52 (32320): No heartbeat from core client for 30 sec - exiting 14:54:53 (32320): No heartbeat from core client for 30 sec - exiting 14:54:54 (32320): No heartbeat from core client for 30 sec - exiting 14:54:55 (32320): No heartbeat from core client for 30 sec - exiting 14:54:56 (32320): No heartbeat from core client for 30 sec - exiting 14:54:57 (32320): No heartbeat from core client for 30 sec - exiting 14:54:58 (32320): No heartbeat from core client for 30 sec - exiting 14:54:59 (32320): No heartbeat from core client for 30 sec - exiting 14:55:00 (32320): No heartbeat from core client for 30 sec - exiting 14:55:01 (32320): No heartbeat from core client for 30 sec - exiting 14:55:02 (32320): No heartbeat from core client for 30 sec - exiting 14:55:03 (32320): No heartbeat from core client for 30 sec - exiting 14:55:04 (32320): No heartbeat from core client for 30 sec - exiting 14:55:05 (32320): No heartbeat from core client for 30 sec - exiting 14:58:55 (32473): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:07:23 (32620): No heartbeat from core client for 30 sec - exiting 15:07:24 (32620): No heartbeat from core client for 30 sec - exiting 15:07:25 (32620): No heartbeat from core client for 30 sec - exiting 15:07:26 (32620): No heartbeat from core client for 30 sec - exiting 15:07:27 (32620): No heartbeat from core client for 30 sec - exiting 15:07:28 (32620): No heartbeat from core client for 30 sec - exiting 15:07:29 (32620): No heartbeat from core client for 30 sec - exiting 15:07:30 (32620): No heartbeat from core client for 30 sec - exiting 15:07:31 (32620): No heartbeat from core client for 30 sec - exiting 15:07:32 (32620): No heartbeat from core client for 30 sec - exiting 15:07:33 (32620): No heartbeat from core client for 30 sec - exiting 15:07:34 (32620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:12:05 (32795): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:51:44 (32929): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:59:59 (33340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:00:00 (33340): No heartbeat from core client for 30 sec - exiting 16:00:01 (33340): No heartbeat from core client for 30 sec - exiting 16:00:02 (33340): No heartbeat from core client for 30 sec - exiting 16:00:03 (33340): No heartbeat from core client for 30 sec - exiting 16:04:39 (33527): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:04:40 (33527): No heartbeat from core client for 30 sec - exiting 16:04:41 (33527): No heartbeat from core client for 30 sec - exiting 16:04:42 (33527): No heartbeat from core client for 30 sec - exiting 16:04:43 (33527): No heartbeat from core client for 30 sec - exiting 16:04:44 (33527): No heartbeat from core client for 30 sec - exiting 16:50:40 (33662): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:54:33 (34190): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:59:02 (34334): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:59:03 (34334): No heartbeat from core client for 30 sec - exiting 16:59:04 (34334): No heartbeat from core client for 30 sec - exiting 16:59:05 (34334): No heartbeat from core client for 30 sec - exiting 16:59:06 (34334): No heartbeat from core client for 30 sec - exiting 16:59:07 (34334): No heartbeat from core client for 30 sec - exiting 16:59:08 (34334): No heartbeat from core client for 30 sec - exiting 16:59:09 (34334): No heartbeat from core client for 30 sec - exiting 16:59:10 (34334): No heartbeat from core client for 30 sec - exiting 16:59:11 (34334): No heartbeat from core client for 30 sec - exiting 16:59:12 (34334): No heartbeat from core client for 30 sec - exiting 16:59:13 (34334): No heartbeat from core client for 30 sec - exiting 16:59:14 (34334): No heartbeat from core client for 30 sec - exiting 16:59:15 (34334): No heartbeat from core client for 30 sec - exiting 16:59:16 (34334): No heartbeat from core client for 30 sec - exiting 16:59:17 (34334): No heartbeat from core client for 30 sec - exiting 16:59:18 (34334): No heartbeat from core client for 30 sec - exiting 16:59:19 (34334): No heartbeat from core client for 30 sec - exiting 16:59:20 (34334): No heartbeat from core client for 30 sec - exiting 16:59:21 (34334): No heartbeat from core client for 30 sec - exiting 16:59:22 (34334): No heartbeat from core client for 30 sec - exiting 16:59:23 (34334): No heartbeat from core client for 30 sec - exiting 16:59:24 (34334): No heartbeat from core client for 30 sec - exiting 16:59:25 (34334): No heartbeat from core client for 30 sec - exiting 16:59:26 (34334): No heartbeat from core client for 30 sec - exiting 16:59:27 (34334): No heartbeat from core client for 30 sec - exiting 16:59:28 (34334): No heartbeat from core client for 30 sec - exiting 17:04:09 (34486): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:04:10 (34486): No heartbeat from core client for 30 sec - exiting 17:04:11 (34486): No heartbeat from core client for 30 sec - exiting 17:04:12 (34486): No heartbeat from core client for 30 sec - exiting 17:04:13 (34486): No heartbeat from core client for 30 sec - exiting 17:04:14 (34486): No heartbeat from core client for 30 sec - exiting 17:08:25 (34630): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:08:26 (34630): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7737400] [0xf7737425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75541df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7557825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf753f4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34775, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf770c400] [0xf770c425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75291df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf752c825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75144d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34775, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf779e400] [0xf779e425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75bb1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75be825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75a64d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34775, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7714400] [0xf7714425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75311df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7534825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf751c4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34775, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77db400] [0xf77db425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75f81df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75fb825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75e34d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34775, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77a1400] [0xf77a1425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75be1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c1825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75a94d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34775, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
06 Jun 2013 09:26:35 | 1282401 | 15825307 | hadcm3n_o6m0_1980_40_008389009_0 | 25,920 | 63,316 | 2.4427 |
©2024 climateprediction.net