Name | hadcm3n_o8ii_1900_40_008466157_1 |
Workunit | 8616996 |
Created | 8 Jan 2014, 0:15:25 UTC |
Sent | 8 Jan 2014, 0:15:33 UTC |
Report deadline | 9 Apr 2014, 7:42:44 UTC |
Received | 25 Apr 2014, 2:15:52 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1309837 |
Run time | 18 days 8 hours 7 min 25 sec |
CPU time | 15 days 5 hours 39 min 12 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 0.91 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5100, iMonCtr=1 Model crash detected, will try to restart... 14:19:04 (4764): No heartbeat from core client for 30 sec - exiting 14:19:05 (4764): No heartbeat from core client for 30 sec - exiting 14:19:06 (4764): No heartbeat from core client for 30 sec - exiting 14:19:08 (4764): No heartbeat from core client for 30 sec - exiting 14:19:09 (4764): No heartbeat from core client for 30 sec - exiting 14:19:10 (4764): No heartbeat from core client for 30 sec - exiting 14:19:11 (4764): No heartbeat from core client for 30 sec - exiting 14:19:12 (4764): No heartbeat from core client for 30 sec - exiting 14:19:13 (4764): No heartbeat from core client for 30 sec - exiting 14:19:14 (4764): No heartbeat from core client for 30 sec - exiting 14:19:15 (4764): No heartbeat from core client for 30 sec - exiting 14:19:16 (4764): No heartbeat from core client for 30 sec - exiting 14:19:17 (4764): No heartbeat from core client for 30 sec - exiting 14:19:18 (4764): No heartbeat from core client for 30 sec - exiting 14:19:20 (4764): No heartbeat from core client for 30 sec - exiting 14:19:21 (4764): No heartbeat from core client for 30 sec - exiting 14:19:22 (4764): No heartbeat from core client for 30 sec - exiting 14:19:23 (4764): No heartbeat from core client for 30 sec - exiting 14:19:24 (4764): No heartbeat from core client for 30 sec - exiting 14:19:25 (4764): No heartbeat from core client for 30 sec - exiting 14:19:26 (4764): No heartbeat from core client for 30 sec - exiting 14:19:27 (4764): No heartbeat from core client for 30 sec - exiting 14:19:28 (4764): No heartbeat from core client for 30 sec - exiting 14:19:29 (4764): No heartbeat from core client for 30 sec - exiting 14:19:30 (4764): No heartbeat from core client for 30 sec - exiting 14:19:32 (4764): No heartbeat from core client for 30 sec - exiting 14:19:33 (4764): No heartbeat from core client for 30 sec - exiting 14:19:34 (4764): No heartbeat from core client for 30 sec - exiting 14:19:35 (4764): No heartbeat from core client for 30 sec - exiting 14:19:36 (4764): No heartbeat from core client for 30 sec - exiting 14:19:37 (4764): No heartbeat from core client for 30 sec - exiting 14:19:38 (4764): No heartbeat from core client for 30 sec - exiting 14:19:39 (4764): No heartbeat from core client for 30 sec - exiting 14:19:40 (4764): No heartbeat from core client for 30 sec - exiting 14:19:41 (4764): No heartbeat from core client for 30 sec - exiting 14:19:42 (4764): No heartbeat from core client for 30 sec - exiting 14:19:44 (4764): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:27:23 (4224): No heartbeat from core client for 30 sec - exiting 22:27:24 (4224): No heartbeat from core client for 30 sec - exiting 22:27:26 (4224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:23:36 (4108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:16:51 (4516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:00:41 (2696): No heartbeat from core client for 30 sec - exiting 10:00:42 (2696): No heartbeat from core client for 30 sec - exiting 10:00:44 (2696): No heartbeat from core client for 30 sec - exiting 10:00:45 (2696): No heartbeat from core client for 30 sec - exiting 10:00:46 (2696): No heartbeat from core client for 30 sec - exiting 10:00:47 (2696): No heartbeat from core client for 30 sec - exiting 10:00:48 (2696): No heartbeat from core client for 30 sec - exiting 10:00:49 (2696): No heartbeat from core client for 30 sec - exiting 10:00:50 (2696): No heartbeat from core client for 30 sec - exiting 10:00:51 (2696): No heartbeat from core client for 30 sec - exiting 10:00:52 (2696): No heartbeat from core client for 30 sec - exiting 10:00:53 (2696): No heartbeat from core client for 30 sec - exiting 10:00:55 (2696): No heartbeat from core client for 30 sec - exiting 10:00:56 (2696): No heartbeat from core client for 30 sec - exiting 10:00:57 (2696): No heartbeat from core client for 30 sec - exiting 10:00:58 (2696): No heartbeat from core client for 30 sec - exiting 10:00:59 (2696): No heartbeat from core client for 30 sec - exiting 10:01:00 (2696): No heartbeat from core client for 30 sec - exiting 10:01:01 (2696): No heartbeat from core client for 30 sec - exiting 10:01:02 (2696): No heartbeat from core client for 30 sec - exiting 10:01:03 (2696): No heartbeat from core client for 30 sec - exiting 10:01:04 (2696): No heartbeat from core client for 30 sec - exiting 10:01:05 (2696): No heartbeat from core client for 30 sec - exiting 10:01:07 (2696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Apr 2014 03:38:57 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 518,400 | 1,316,349 | 2.5393 |
14 Apr 2014 02:34:21 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 492,480 | 1,256,800 | 2.5520 |
10 Apr 2014 03:14:48 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 466,560 | 1,195,098 | 2.5615 |
06 Apr 2014 22:46:16 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 440,640 | 1,129,794 | 2.5640 |
02 Apr 2014 22:16:21 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 414,720 | 1,066,859 | 2.5725 |
28 Mar 2014 01:36:07 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 388,800 | 1,006,896 | 2.5898 |
23 Mar 2014 23:41:26 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 362,880 | 944,918 | 2.6039 |
17 Mar 2014 22:42:31 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 336,960 | 882,625 | 2.6194 |
14 Mar 2014 01:37:43 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 311,040 | 818,493 | 2.6315 |
02 Mar 2014 23:26:20 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 285,120 | 754,920 | 2.6477 |
23 Feb 2014 03:00:12 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 259,200 | 694,054 | 2.6777 |
16 Feb 2014 18:53:40 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 233,280 | 619,057 | 2.6537 |
13 Feb 2014 23:39:18 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 207,360 | 550,399 | 2.6543 |
05 Feb 2014 04:50:16 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 181,440 | 482,308 | 2.6582 |
01 Feb 2014 21:41:27 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 155,520 | 415,429 | 2.6712 |
26 Jan 2014 23:35:54 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 129,600 | 347,577 | 2.6819 |
23 Jan 2014 00:33:41 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 103,680 | 279,308 | 2.6939 |
21 Jan 2014 03:03:49 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 77,760 | 213,726 | 2.7485 |
18 Jan 2014 17:56:30 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 51,840 | 147,498 | 2.8453 |
12 Jan 2014 18:24:13 | 1309837 | 16203229 | hadcm3n_o8ii_1900_40_008466157_1 | 25,920 | 74,408 | 2.8707 |
©2024 cpdn.org