Name | hadcm3n_3dgl_1940_40_008258207_4 |
Workunit | 8413331 |
Created | 28 Feb 2013, 2:33:54 UTC |
Sent | 28 Feb 2013, 2:34:23 UTC |
Report deadline | 30 May 2013, 10:01:34 UTC |
Received | 8 Jun 2013, 5:25:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1166383 |
Run time | 97 days 21 hours 46 min 56 sec |
CPU time | 89 days 22 hours 50 min 51 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 0.76 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5756, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5008, iMonCtr=1 Model crash detected, will try to restart... 04:57:23 (6016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:29:16 (2420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Ocean Restart file copy failed on 3dglko.daf31c0 21:00:56 (4724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:31:49 (7908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:32:44 (7328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:00:34 (5340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:32:03 (4168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5968, iMonCtr=1 Model crash detected, will try to restart... 00:14:23 (5152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7352, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5704, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 21:52:01 (6708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7976, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 17:30:56 (2820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:55:42 (6384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:25:41 (5868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6848, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3848, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4936, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Jun 2013 04:26:51 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 777,600 | 7,771,831 | 9.9946 |
03 Jun 2013 23:39:14 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 751,680 | 7,449,633 | 9.9106 |
30 May 2013 20:08:43 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 725,760 | 7,131,107 | 9.8257 |
26 May 2013 18:40:30 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 699,840 | 6,812,188 | 9.7339 |
22 May 2013 09:03:28 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 673,920 | 6,493,242 | 9.6350 |
18 May 2013 05:22:22 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 648,000 | 6,167,537 | 9.5178 |
14 May 2013 00:35:36 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 622,080 | 5,853,323 | 9.4093 |
09 May 2013 21:17:25 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 596,160 | 5,553,077 | 9.3147 |
06 May 2013 01:11:13 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 570,240 | 5,249,069 | 9.2050 |
02 May 2013 04:14:35 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 544,320 | 4,942,897 | 9.0809 |
28 Apr 2013 04:34:46 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 518,400 | 4,646,658 | 8.9635 |
24 Apr 2013 14:50:33 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 492,480 | 4,364,618 | 8.8625 |
20 Apr 2013 21:28:26 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 466,560 | 4,089,822 | 8.7659 |
17 Apr 2013 13:30:00 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 440,640 | 3,823,838 | 8.6779 |
14 Apr 2013 06:55:58 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 414,720 | 3,561,635 | 8.5880 |
11 Apr 2013 02:11:59 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 388,800 | 3,302,722 | 8.4947 |
07 Apr 2013 18:09:20 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 362,880 | 3,057,662 | 8.4261 |
04 Apr 2013 19:32:48 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 336,960 | 2,821,527 | 8.3735 |
01 Apr 2013 20:12:49 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 311,040 | 2,584,116 | 8.3080 |
29 Mar 2013 20:28:22 | 1166383 | 15643173 | hadcm3n_3dgl_1940_40_008258207_4 | 285,120 | 2,346,326 | 8.2293 |
©2024 cpdn.org