Name | hadcm3n_4dq8_1980_40_008325028_1 |
Workunit | 8476163 |
Created | 31 Mar 2013, 19:31:33 UTC |
Sent | 31 Mar 2013, 19:32:05 UTC |
Report deadline | 1 Jul 2013, 2:59:16 UTC |
Received | 7 Dec 2013, 15:05:01 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1260232 |
Run time | 10 days 0 hours 34 min 6 sec |
CPU time | 6 days 5 hours 1 min 49 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 1.50 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3312, iMonCtr=1 Model crash detected, will try to restart... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 22:45:34 (3200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:46:25 (4220): No heartbeat from core client for 30 sec - exiting 22:46:26 (4220): No heartbeat from core client for 30 sec - exiting 22:46:27 (4220): No heartbeat from core client for 30 sec - exiting 22:46:29 (4220): No heartbeat from core client for 30 sec - exiting 22:46:30 (4220): No heartbeat from core client for 30 sec - exiting 22:46:31 (4220): No heartbeat from core client for 30 sec - exiting 22:46:32 (4220): No heartbeat from core client for 30 sec - exiting 22:46:33 (4220): No heartbeat from core client for 30 sec - exiting 22:46:34 (4220): No heartbeat from core client for 30 sec - exiting 22:46:35 (4220): No heartbeat from core client for 30 sec - exiting 22:46:36 (4220): No heartbeat from core client for 30 sec - exiting 22:46:37 (4220): No heartbeat from core client for 30 sec - exiting 22:46:38 (4220): No heartbeat from core client for 30 sec - exiting 22:46:40 (4220): No heartbeat from core client for 30 sec - exiting 22:46:41 (4220): No heartbeat from core client for 30 sec - exiting 22:46:42 (4220): No heartbeat from core client for 30 sec - exiting 22:46:43 (4220): No heartbeat from core client for 30 sec - exiting 22:46:44 (4220): No heartbeat from core client for 30 sec - exiting 22:46:45 (4220): No heartbeat from core client for 30 sec - exiting 22:46:46 (4220): No heartbeat from core client for 30 sec - exiting 22:46:47 (4220): No heartbeat from core client for 30 sec - exiting 22:46:48 (4220): No heartbeat from core client for 30 sec - exiting 22:46:49 (4220): No heartbeat from core client for 30 sec - exiting 22:46:50 (4220): No heartbeat from core client for 30 sec - exiting 22:46:52 (4220): No heartbeat from core client for 30 sec - exiting 22:46:53 (4220): No heartbeat from core client for 30 sec - exiting 22:46:54 (4220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:46:55 (4220): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4064, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1 Model crash detected, will try to restart... 04:12:13 (3492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:36:19 (3860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:57:00 (3820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... C20:39:46 (3588): No heartbeat from core client for 30 sec - exiting 20:39:47 (3588): No heartbeat from core client for 30 sec - exiting 20:39:48 (3588): No heartbeat from core client for 30 sec - exiting 20:39:49 (3588): No heartbeat from core client for 30 sec - exiting 20:39:50 (3588): No heartbeat from core client for 30 sec - exiting 20:39:51 (3588): No heartbeat from core client for 30 sec - exiting 20:39:52 (3588): No heartbeat from core client for 30 sec - exiting 20:39:53 (3588): No heartbeat from core client for 30 sec - exiting 20:39:54 (3588): No heartbeat from core client for 30 sec - exiting 20:39:56 (3588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:39:57 (3588): No heartbeat from core client for 30 sec - exiting 20:39:58 (3588): No heartbeat from core client for 30 sec - exiting 20:39:59 (3588): No heartbeat from core client for 30 sec - exiting 20:40:00 (3588): No heartbeat from core client for 30 sec - exiting 20:40:01 (3588): No heartbeat from core client for 30 sec - exiting 20:40:02 (3588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=1 Model crash detected, will try to restart... 09:48:42 (4820): No heartbeat from core client for 30 sec - exiting 09:48:43 (4820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:44 (4820): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3896, iMonCtr=1 Model crash detected, will try to restart... 11:21:46 (3656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 01:15:07 (4372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 03:45:06 (3548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:22:28 (3524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1 Model crash detected, will try to restart... 12:54:41 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:39:53 (3272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3276, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:35:19 (4532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1448, iMonCtr=1 Model crash detected, will try to restart... 10:23:32 (5040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:29:11 (2032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:58:46 (436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:46:20 (4048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4108, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Dec 2013 14:08:38 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 259,200 | 536,507 | 2.0699 |
01 Dec 2013 14:09:23 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 233,280 | 488,659 | 2.0947 |
24 Nov 2013 21:05:19 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 207,360 | 442,085 | 2.1320 |
21 Nov 2013 05:12:15 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 181,440 | 394,578 | 2.1747 |
16 Nov 2013 16:59:33 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 155,520 | 348,044 | 2.2379 |
09 Nov 2013 16:53:55 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 129,600 | 302,193 | 2.3317 |
23 Oct 2013 13:44:32 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 103,680 | 254,810 | 2.4577 |
07 Oct 2013 10:11:13 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 77,760 | 192,796 | 2.4794 |
05 Oct 2013 09:03:58 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 51,840 | 131,877 | 2.5439 |
25 Apr 2013 10:57:51 | 1260232 | 15697722 | hadcm3n_4dq8_1980_40_008325028_1 | 25,920 | 67,493 | 2.6039 |
©2024 cpdn.org