Name | hadcm3n_3889_1940_40_008265135_4 |
Workunit | 8420259 |
Created | 23 May 2013, 1:09:01 UTC |
Sent | 23 May 2013, 1:09:34 UTC |
Report deadline | 22 Aug 2013, 8:36:45 UTC |
Received | 30 May 2013, 22:19:47 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 7 days 12 hours 49 min 23 sec |
CPU time | 7 days 8 hours 58 min 44 sec |
Validate state | Invalid |
Credit | 2,488.32 |
Device peak FLOPS | 2.01 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 09:35:20 (7744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:23:50 (13143): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:56:34 (19682): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:54:29 (52383): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:28:03 (10408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:12:21 (14784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:12:22 (14784): No heartbeat from core client for 30 sec - exiting 06:40:35 (37913): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:45:34 (38946): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:45:35 (38946): No heartbeat from core client for 30 sec - exiting 06:45:36 (38946): No heartbeat from core client for 30 sec - exiting 06:45:37 (38946): No heartbeat from core client for 30 sec - exiting 06:45:38 (38946): No heartbeat from core client for 30 sec - exiting 06:45:39 (38946): No heartbeat from core client for 30 sec - exiting 06:45:40 (38946): No heartbeat from core client for 30 sec - exiting 06:45:41 (38946): No heartbeat from core client for 30 sec - exiting 06:45:42 (38946): No heartbeat from core client for 30 sec - exiting 06:45:43 (38946): No heartbeat from core client for 30 sec - exiting 06:45:44 (38946): No heartbeat from core client for 30 sec - exiting 06:45:45 (38946): No heartbeat from core client for 30 sec - exiting 06:45:46 (38946): No heartbeat from core client for 30 sec - exiting 06:45:47 (38946): No heartbeat from core client for 30 sec - exiting 06:45:48 (38946): No heartbeat from core client for 30 sec - exiting 06:45:49 (38946): No heartbeat from core client for 30 sec - exiting 09:04:31 (39121): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:36:02 (40763): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:28 (41744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:29 (41744): No heartbeat from core client for 30 sec - exiting 10:40:30 (41744): No heartbeat from core client for 30 sec - exiting 10:40:31 (41744): No heartbeat from core client for 30 sec - exiting 10:40:32 (41744): No heartbeat from core client for 30 sec - exiting 10:40:33 (41744): No heartbeat from core client for 30 sec - exiting 10:40:34 (41744): No heartbeat from core client for 30 sec - exiting 10:40:35 (41744): No heartbeat from core client for 30 sec - exiting 10:40:36 (41744): No heartbeat from core client for 30 sec - exiting 10:40:37 (41744): No heartbeat from core client for 30 sec - exiting 10:40:38 (41744): No heartbeat from core client for 30 sec - exiting 10:40:39 (41744): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 10:45:30 (41965): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:03:47 (42115): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:08:55 (44024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:13:25 (44181): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:13:26 (44181): No heartbeat from core client for 30 sec - exiting 14:13:27 (44181): No heartbeat from core client for 30 sec - exiting 14:13:28 (44181): No heartbeat from core client for 30 sec - exiting 14:13:29 (44181): No heartbeat from core client for 30 sec - exiting 14:51:27 (44407): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:51:28 (44407): No heartbeat from core client for 30 sec - exiting 15:01:03 (44917): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:01:04 (44917): No heartbeat from core client for 30 sec - exiting 15:01:05 (44917): No heartbeat from core client for 30 sec - exiting 15:01:06 (44917): No heartbeat from core client for 30 sec - exiting 15:05:46 (45124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:05:47 (45124): No heartbeat from core client for 30 sec - exiting 15:05:48 (45124): No heartbeat from core client for 30 sec - exiting 15:05:49 (45124): No heartbeat from core client for 30 sec - exiting 15:05:50 (45124): No heartbeat from core client for 30 sec - exiting 15:05:51 (45124): No heartbeat from core client for 30 sec - exiting 15:05:52 (45124): No heartbeat from core client for 30 sec - exiting 15:05:53 (45124): No heartbeat from core client for 30 sec - exiting 15:05:54 (45124): No heartbeat from core client for 30 sec - exiting 15:05:55 (45124): No heartbeat from core client for 30 sec - exiting 15:05:56 (45124): No heartbeat from core client for 30 sec - exiting 15:05:57 (45124): No heartbeat from core client for 30 sec - exiting 15:05:58 (45124): No heartbeat from core client for 30 sec - exiting 15:05:59 (45124): No heartbeat from core client for 30 sec - exiting 15:06:00 (45124): No heartbeat from core client for 30 sec - exiting 15:06:01 (45124): No heartbeat from core client for 30 sec - exiting 15:06:02 (45124): No heartbeat from core client for 30 sec - exiting 15:06:03 (45124): No heartbeat from core client for 30 sec - exiting 15:06:04 (45124): No heartbeat from core client for 30 sec - exiting 15:42:25 (45343): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:47:25 (45786): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:47:26 (45786): No heartbeat from core client for 30 sec - exiting 16:26:39 (45985): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:31:38 (46440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:31:39 (46440): No heartbeat from core client for 30 sec - exiting 16:31:40 (46440): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 16:43:01 (46656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:03:45 (46840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:29:13 (47104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:29:14 (47104): No heartbeat from core client for 30 sec - exiting 17:29:15 (47104): No heartbeat from core client for 30 sec - exiting 17:29:16 (47104): No heartbeat from core client for 30 sec - exiting 17:29:17 (47104): No heartbeat from core client for 30 sec - exiting 17:29:18 (47104): No heartbeat from core client for 30 sec - exiting 17:29:19 (47104): No heartbeat from core client for 30 sec - exiting 17:29:20 (47104): No heartbeat from core client for 30 sec - exiting 17:29:21 (47104): No heartbeat from core client for 30 sec - exiting 17:29:22 (47104): No heartbeat from core client for 30 sec - exiting 17:29:23 (47104): No heartbeat from core client for 30 sec - exiting 17:29:24 (47104): No heartbeat from core client for 30 sec - exiting 17:38:36 (47505): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:38:37 (47505): No heartbeat from core client for 30 sec - exiting 17:38:38 (47505): No heartbeat from core client for 30 sec - exiting 17:38:39 (47505): No heartbeat from core client for 30 sec - exiting 17:38:40 (47505): No heartbeat from core client for 30 sec - exiting 17:38:41 (47505): No heartbeat from core client for 30 sec - exiting 17:38:42 (47505): No heartbeat from core client for 30 sec - exiting 18:19:10 (47723): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:19:11 (47723): No heartbeat from core client for 30 sec - exiting 18:19:12 (47723): No heartbeat from core client for 30 sec - exiting 18:19:13 (47723): No heartbeat from core client for 30 sec - exiting 18:23:44 (48236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:28:22 (48445): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:28:23 (48445): No heartbeat from core client for 30 sec - exiting 18:28:24 (48445): No heartbeat from core client for 30 sec - exiting 18:41:08 (48637): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:41:09 (48637): No heartbeat from core client for 30 sec - exiting 18:41:10 (48637): No heartbeat from core client for 30 sec - exiting 19:51:15 (48849): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:56:00 (49621): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:56:01 (49621): No heartbeat from core client for 30 sec - exiting 19:56:02 (49621): No heartbeat from core client for 30 sec - exiting 19:56:03 (49621): No heartbeat from core client for 30 sec - exiting 19:56:04 (49621): No heartbeat from core client for 30 sec - exiting 19:56:05 (49621): No heartbeat from core client for 30 sec - exiting 19:56:06 (49621): No heartbeat from core client for 30 sec - exiting 19:56:07 (49621): No heartbeat from core client for 30 sec - exiting 19:56:08 (49621): No heartbeat from core client for 30 sec - exiting 19:56:09 (49621): No heartbeat from core client for 30 sec - exiting 19:56:10 (49621): No heartbeat from core client for 30 sec - exiting 19:56:11 (49621): No heartbeat from core client for 30 sec - exiting 19:56:12 (49621): No heartbeat from core client for 30 sec - exiting 19:56:13 (49621): No heartbeat from core client for 30 sec - exiting 19:56:14 (49621): No heartbeat from core client for 30 sec - exiting 19:56:15 (49621): No heartbeat from core client for 30 sec - exiting 19:56:16 (49621): No heartbeat from core client for 30 sec - exiting 19:56:17 (49621): No heartbeat from core client for 30 sec - exiting 19:56:18 (49621): No heartbeat from core client for 30 sec - exiting 19:56:19 (49621): No heartbeat from core client for 30 sec - exiting 19:56:20 (49621): No heartbeat from core client for 30 sec - exiting 19:56:21 (49621): No heartbeat from core client for 30 sec - exiting 19:56:22 (49621): No heartbeat from core client for 30 sec - exiting 19:56:23 (49621): No heartbeat from core client for 30 sec - exiting 19:56:24 (49621): No heartbeat from core client for 30 sec - exiting 19:56:25 (49621): No heartbeat from core client for 30 sec - exiting 20:00:34 (49807): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:01:05 (49807): No heartbeat from core client for 30 sec - exiting 21:12:14 (50003): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:17:03 (50770): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:17:04 (50770): No heartbeat from core client for 30 sec - exiting 21:17:05 (50770): No heartbeat from core client for 30 sec - exiting 21:17:06 (50770): No heartbeat from core client for 30 sec - exiting 21:17:07 (50770): No heartbeat from core client for 30 sec - exiting 21:17:08 (50770): No heartbeat from core client for 30 sec - exiting 21:17:09 (50770): No heartbeat from core client for 30 sec - exiting 21:17:10 (50770): No heartbeat from core client for 30 sec - exiting 21:17:11 (50770): No heartbeat from core client for 30 sec - exiting 21:17:12 (50770): No heartbeat from core client for 30 sec - exiting 21:17:13 (50770): No heartbeat from core client for 30 sec - exiting 21:17:14 (50770): No heartbeat from core client for 30 sec - exiting 21:17:15 (50770): No heartbeat from core client for 30 sec - exiting 21:17:16 (50770): No heartbeat from core client for 30 sec - exiting 21:17:17 (50770): No heartbeat from core client for 30 sec - exiting 21:17:18 (50770): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77c6400] [0xf77c6425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75e31df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75e6825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75ce4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50982, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf777e400] [0xf777e425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf759b1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf759e825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75864d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50982, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76e6400] [0xf76e6425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75031df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7506825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74ee4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50982, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7719400] [0xf7719425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75361df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7539825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75214d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50982, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7741400] [0xf7741425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf755e1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7561825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75494d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50982, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf770c400] [0xf770c425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75291df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf752c825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75144d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50982, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 May 2013 05:58:04 | 1282401 | 15793557 | hadcm3n_3889_1940_40_008265135_4 | 207,360 | 594,990 | 2.8694 |
29 May 2013 06:52:21 | 1282401 | 15793557 | hadcm3n_3889_1940_40_008265135_4 | 181,440 | 514,380 | 2.8350 |
28 May 2013 07:53:26 | 1282401 | 15793557 | hadcm3n_3889_1940_40_008265135_4 | 155,520 | 437,711 | 2.8145 |
27 May 2013 12:10:07 | 1282401 | 15793557 | hadcm3n_3889_1940_40_008265135_4 | 129,600 | 367,557 | 2.8361 |
26 May 2013 16:02:24 | 1282401 | 15793557 | hadcm3n_3889_1940_40_008265135_4 | 103,680 | 298,374 | 2.8778 |
25 May 2013 18:04:33 | 1282401 | 15793557 | hadcm3n_3889_1940_40_008265135_4 | 77,760 | 224,897 | 2.8922 |
24 May 2013 20:36:55 | 1282401 | 15793557 | hadcm3n_3889_1940_40_008265135_4 | 51,840 | 149,685 | 2.8874 |
23 May 2013 22:39:21 | 1282401 | 15793557 | hadcm3n_3889_1940_40_008265135_4 | 25,920 | 71,314 | 2.7513 |
©2024 climateprediction.net