Name | hadam3p_eu_2k6w_1987_1_007428459_0 |
Workunit | 7625962 |
Created | 28 Aug 2011, 20:15:01 UTC |
Sent | 28 Aug 2011, 20:15:09 UTC |
Report deadline | 10 Aug 2012, 1:35:09 UTC |
Received | 9 Sep 2011, 19:27:16 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 1 (0x00000001) Unknown error code |
Computer ID | 959923 |
Run time | 5 days 17 hours 58 min 26 sec |
CPU time | 4 days 13 hours 10 min 10 sec |
Validate state | Invalid |
Credit | 1,591.48 |
Device peak FLOPS | 2.50 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4256, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5640, iMonCtr=2 Leaving CPDN_Main::Monitor... 20:18:15 (4760): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:18:45 (4760): No heartbeat from core client for 30 sec - exiting 20:18:46 (4760): No heartbeat from core client for 30 sec - exiting 20:18:47 (4760): No heartbeat from core client for 30 sec - exiting 20:18:48 (4760): No heartbeat from core client for 30 sec - exiting 20:18:49 (4760): No heartbeat from core client for 30 sec - exiting 21:25:46 (2832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:17:39 (4824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:20:50 (5992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:20:51 (5992): No heartbeat from core client for 30 sec - exiting 00:14:43 (4492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:14:44 (4492): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2320, selfPID=2320, iMonCtr=2 00:14:45 (4492): No heartbeat from core client for 30 sec - exiting 00:14:46 (4492): No heartbeat from core client for 30 sec - exiting 00:18:06 (3056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:21:36 (5084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:24:25 (5404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:27:54 (4936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:32:56 (5396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:37:55 (3844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:37:56 (3844): No heartbeat from core client for 30 sec - exiting 01:26:05 (4028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5712, selfPID=5712, iMonCtr=2 01:28:55 (5920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:30:40 (4824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:30:41 (4824): No heartbeat from core client for 30 sec - exiting 01:34:32 (5032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:34:33 (5032): No heartbeat from core client for 30 sec - exiting 14:20:50 (5928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:21:28 (4244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5740, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1300, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5640, selfPID=2728, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5884, selfPID=5884, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5884, selfPID=5328, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Sep 2011 15:28:03 | 959923 | 13308216 | hadam3p_eu_2k6w_1987_1_007428459_0 | 92,256 | 384,236 | 4.1649 |
08 Sep 2011 17:18:03 | 959923 | 13308216 | hadam3p_eu_2k6w_1987_1_007428459_0 | 80,736 | 337,794 | 4.1839 |
02 Sep 2011 20:24:53 | 959923 | 13308216 | hadam3p_eu_2k6w_1987_1_007428459_0 | 69,217 | 291,032 | 4.2046 |
02 Sep 2011 19:23:55 | 959923 | 13308216 | hadam3p_eu_2k6w_1987_1_007428459_0 | 69,216 | 290,430 | 4.1960 |
02 Sep 2011 03:37:15 | 959923 | 13308216 | hadam3p_eu_2k6w_1987_1_007428459_0 | 57,696 | 243,176 | 4.2148 |
01 Sep 2011 19:17:42 | 959923 | 13308216 | hadam3p_eu_2k6w_1987_1_007428459_0 | 46,176 | 195,352 | 4.2306 |
31 Aug 2011 17:40:11 | 959923 | 13308216 | hadam3p_eu_2k6w_1987_1_007428459_0 | 34,656 | 147,511 | 4.2564 |
30 Aug 2011 07:50:47 | 959923 | 13308216 | hadam3p_eu_2k6w_1987_1_007428459_0 | 23,136 | 99,460 | 4.2989 |
29 Aug 2011 15:23:14 | 959923 | 13308216 | hadam3p_eu_2k6w_1987_1_007428459_0 | 11,616 | 49,481 | 4.2597 |
©2024 cpdn.org