Name | hadcm3n_o3ye_1940_40_008383445_0 |
Workunit | 8534304 |
Created | 1 Jun 2013, 8:16:50 UTC |
Sent | 1 Jun 2013, 12:13:35 UTC |
Report deadline | 31 Aug 2013, 19:40:46 UTC |
Received | 6 Jun 2013, 8:45:26 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1134931 |
Run time | 3 days 23 hours 23 min 50 sec |
CPU time | 3 days 18 hours 50 min 3 sec |
Validate state | Invalid |
Credit | 2,177.28 |
Device peak FLOPS | 3.11 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:14:40 (4178): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:49:05 (32138): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:01:05 (5425): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:13:56 (5705): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:14:29 (5705): No heartbeat from core client for 30 sec - exiting 14:14:30 (5705): No heartbeat from core client for 30 sec - exiting 14:14:31 (5705): No heartbeat from core client for 30 sec - exiting 14:14:32 (5705): No heartbeat from core client for 30 sec - exiting 14:14:33 (5705): No heartbeat from core client for 30 sec - exiting 14:14:34 (5705): No heartbeat from core client for 30 sec - exiting 14:14:35 (5705): No heartbeat from core client for 30 sec - exiting 14:14:36 (5705): No heartbeat from core client for 30 sec - exiting 14:14:37 (5705): No heartbeat from core client for 30 sec - exiting 14:14:38 (5705): No heartbeat from core client for 30 sec - exiting 14:14:39 (5705): No heartbeat from core client for 30 sec - exiting 14:14:40 (5705): No heartbeat from core client for 30 sec - exiting 14:14:41 (5705): No heartbeat from core client for 30 sec - exiting 14:14:42 (5705): No heartbeat from core client for 30 sec - exiting 14:14:43 (5705): No heartbeat from core client for 30 sec - exiting 14:14:44 (5705): No heartbeat from core client for 30 sec - exiting 14:14:45 (5705): No heartbeat from core client for 30 sec - exiting 14:14:46 (5705): No heartbeat from core client for 30 sec - exiting 14:14:47 (5705): No heartbeat from core client for 30 sec - exiting 14:14:48 (5705): No heartbeat from core client for 30 sec - exiting 14:14:49 (5705): No heartbeat from core client for 30 sec - exiting 14:14:50 (5705): No heartbeat from core client for 30 sec - exiting 14:14:51 (5705): No heartbeat from core client for 30 sec - exiting 14:14:52 (5705): No heartbeat from core client for 30 sec - exiting 14:14:53 (5705): No heartbeat from core client for 30 sec - exiting 14:14:54 (5705): No heartbeat from core client for 30 sec - exiting 14:14:55 (5705): No heartbeat from core client for 30 sec - exiting 14:14:57 (5705): No heartbeat from core client for 30 sec - exiting 14:14:58 (5705): No heartbeat from core client for 30 sec - exiting 14:14:59 (5705): No heartbeat from core client for 30 sec - exiting 14:15:00 (5705): No heartbeat from core client for 30 sec - exiting 14:15:02 (5705): No heartbeat from core client for 30 sec - exiting 14:15:03 (5705): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:43:11 (5846): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:43:34 (5846): No heartbeat from core client for 30 sec - exiting 15:43:35 (5846): No heartbeat from core client for 30 sec - exiting 15:43:36 (5846): No heartbeat from core client for 30 sec - exiting 15:43:37 (5846): No heartbeat from core client for 30 sec - exiting 15:43:38 (5846): No heartbeat from core client for 30 sec - exiting 15:43:39 (5846): No heartbeat from core client for 30 sec - exiting 15:43:40 (5846): No heartbeat from core client for 30 sec - exiting 15:43:41 (5846): No heartbeat from core client for 30 sec - exiting 15:43:42 (5846): No heartbeat from core client for 30 sec - exiting 15:43:43 (5846): No heartbeat from core client for 30 sec - exiting 15:43:44 (5846): No heartbeat from core client for 30 sec - exiting 15:43:45 (5846): No heartbeat from core client for 30 sec - exiting 15:43:46 (5846): No heartbeat from core client for 30 sec - exiting 15:43:47 (5846): No heartbeat from core client for 30 sec - exiting 15:43:48 (5846): No heartbeat from core client for 30 sec - exiting 15:43:49 (5846): No heartbeat from core client for 30 sec - exiting 15:43:50 (5846): No heartbeat from core client for 30 sec - exiting 15:43:51 (5846): No heartbeat from core client for 30 sec - exiting 15:43:52 (5846): No heartbeat from core client for 30 sec - exiting 15:43:53 (5846): No heartbeat from core client for 30 sec - exiting 15:43:54 (5846): No heartbeat from core client for 30 sec - exiting 15:43:55 (5846): No heartbeat from core client for 30 sec - exiting 15:43:56 (5846): No heartbeat from core client for 30 sec - exiting 15:43:57 (5846): No heartbeat from core client for 30 sec - exiting 15:43:58 (5846): No heartbeat from core client for 30 sec - exiting 15:44:38 (5846): No heartbeat from core client for 30 sec - exiting 15:45:20 (5846): No heartbeat from core client for 30 sec - exiting 15:45:21 (5846): No heartbeat from core client for 30 sec - exiting 15:45:22 (5846): No heartbeat from core client for 30 sec - exiting 15:45:23 (5846): No heartbeat from core client for 30 sec - exiting 15:45:24 (5846): No heartbeat from core client for 30 sec - exiting 15:45:25 (5846): No heartbeat from core client for 30 sec - exiting 15:45:26 (5846): No heartbeat from core client for 30 sec - exiting 15:45:27 (5846): No heartbeat from core client for 30 sec - exiting 15:45:28 (5846): No heartbeat from core client for 30 sec - exiting 15:45:29 (5846): No heartbeat from core client for 30 sec - exiting 15:45:30 (5846): No heartbeat from core client for 30 sec - exiting 15:45:31 (5846): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76ef400] [0xf76ef430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf74f91df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf74fc825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74e44d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6504, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf774a400] [0xf774a430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75541df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7557825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf753f4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6504, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76e3400] [0xf76e3430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf74ed1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf74f0825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74d84d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6504, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf772e400] [0xf772e430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75381df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf753b825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75234d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6504, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76fa400] [0xf76fa430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75041df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7507825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74ef4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6504, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76e0400] [0xf76e0430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf74ea1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf74ed825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74d54d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6504, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Jun 2013 12:21:52 | 1134931 | 15815168 | hadcm3n_o3ye_1940_40_008383445_0 | 181,440 | 322,585 | 1.7779 |
04 Jun 2013 22:00:06 | 1134931 | 15815168 | hadcm3n_o3ye_1940_40_008383445_0 | 155,520 | 276,320 | 1.7767 |
04 Jun 2013 07:54:29 | 1134931 | 15815168 | hadcm3n_o3ye_1940_40_008383445_0 | 129,600 | 230,311 | 1.7771 |
03 Jun 2013 18:49:14 | 1134931 | 15815168 | hadcm3n_o3ye_1940_40_008383445_0 | 103,680 | 183,897 | 1.7737 |
03 Jun 2013 05:05:34 | 1134931 | 15815168 | hadcm3n_o3ye_1940_40_008383445_0 | 77,760 | 137,687 | 1.7707 |
02 Jun 2013 16:16:41 | 1134931 | 15815168 | hadcm3n_o3ye_1940_40_008383445_0 | 51,840 | 91,404 | 1.7632 |
02 Jun 2013 02:26:26 | 1134931 | 15815168 | hadcm3n_o3ye_1940_40_008383445_0 | 25,920 | 45,231 | 1.7450 |
©2024 cpdn.org