Name | hadam3p_anz_raxc_2012_1_008743774_2 |
Workunit | 8889752 |
Created | 11 May 2014, 17:38:20 UTC |
Sent | 11 May 2014, 18:07:09 UTC |
Report deadline | 23 Apr 2015, 23:27:09 UTC |
Received | 16 Jun 2014, 5:56:43 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 194 (0x000000C2) EXIT_ABORTED_BY_CLIENT |
Computer ID | 1316397 |
Run time | 12 days 15 hours 26 min 58 sec |
CPU time | 11 days 13 hours 19 min 33 sec |
Validate state | Invalid |
Credit | 5,974.74 |
Device peak FLOPS | 2.89 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=2 03:52:33 (4828): No heartbeat from core client for 30 sec - exiting 03:52:34 (4828): No heartbeat from core client for 30 sec - exiting 03:52:35 (4828): No heartbeat from core client for 30 sec - exiting 03:52:36 (4828): No heartbeat from core client for 30 sec - exiting 03:52:37 (4828): No heartbeat from core client for 30 sec - exiting 03:52:38 (4828): No heartbeat from core client for 30 sec - exiting 03:52:39 (4828): No heartbeat from core client for 30 sec - exiting 03:52:40 (4828): No heartbeat from core client for 30 sec - exiting 03:52:41 (4828): No heartbeat from core client for 30 sec - exiting 03:52:42 (4828): No heartbeat from core client for 30 sec - exiting 03:52:43 (4828): No heartbeat from core client for 30 sec - exiting 03:52:44 (4828): No heartbeat from core client for 30 sec - exiting 03:52:45 (4828): No heartbeat from core client for 30 sec - exiting 03:52:46 (4828): No heartbeat from core client for 30 sec - exiting 03:52:47 (4828): No heartbeat from core client for 30 sec - exiting 03:52:48 (4828): No heartbeat from core client for 30 sec - exiting 03:52:49 (4828): No heartbeat from core client for 30 sec - exiting 03:52:50 (4828): No heartbeat from core client for 30 sec - exiting 03:52:51 (4828): No heartbeat from core client for 30 sec - exiting 03:52:52 (4828): No heartbeat from core client for 30 sec - exiting 03:52:53 (4828): No heartbeat from core client for 30 sec - exiting 03:52:54 (4828): No heartbeat from core client for 30 sec - exiting 03:52:55 (4828): No heartbeat from core client for 30 sec - exiting 03:52:56 (4828): No heartbeat from core client for 30 sec - exiting 03:52:57 (4828): No heartbeat from core client for 30 sec - exiting 03:52:58 (4828): No heartbeat from core client for 30 sec - exiting 03:52:59 (4828): No heartbeat from core client for 30 sec - exiting 03:53:00 (4828): No heartbeat from core client for 30 sec - exiting 03:53:01 (4828): No heartbeat from core client for 30 sec - exiting 03:53:02 (4828): No heartbeat from core client for 30 sec - exiting 03:53:03 (4828): No heartbeat from core client for 30 sec - exiting 03:53:04 (4828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=156, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5544, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> finish file present too long </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Jun 2014 05:58:07 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 138,539 | 997,848 | 7.2027 |
02 Jun 2014 18:48:48 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 127,019 | 919,665 | 7.2404 |
01 Jun 2014 18:59:05 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 115,499 | 837,678 | 7.2527 |
31 May 2014 16:48:48 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 103,979 | 753,806 | 7.2496 |
30 May 2014 13:49:04 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 92,459 | 669,118 | 7.2369 |
28 May 2014 12:56:47 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 80,939 | 584,977 | 7.2274 |
27 May 2014 12:16:46 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 69,419 | 501,027 | 7.2174 |
26 May 2014 11:48:34 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 57,899 | 417,266 | 7.2068 |
25 May 2014 12:12:08 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 46,379 | 333,869 | 7.1987 |
24 May 2014 11:11:59 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 34,859 | 250,497 | 7.1860 |
13 May 2014 22:18:56 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 23,339 | 167,078 | 7.1587 |
12 May 2014 19:13:08 | 1316397 | 16635514 | hadam3p_anz_raxc_2012_1_008743774_2 | 11,819 | 83,752 | 7.0862 |
©2024 cpdn.org