Name | hadam3p_eu_96ma_1970_1_007869471_0 |
Workunit | 8024583 |
Created | 13 Apr 2012, 3:01:32 UTC |
Sent | 13 Apr 2012, 3:01:56 UTC |
Report deadline | 26 Mar 2013, 8:21:56 UTC |
Received | 17 Apr 2012, 18:43:57 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 962831 |
Run time | 2 days 20 hours 4 min 9 sec |
CPU time | 2 days 18 hours 43 min 21 sec |
Validate state | Invalid |
Credit | 1,988.94 |
Device peak FLOPS | 3.10 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:00:02 (85572): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 06:00:03 (85572): No heartbeat from core client for 30 sec - exiting 06:00:04 (85572): No heartbeat from core client for 30 sec - exiting 06:00:05 (85572): No heartbeat from core client for 30 sec - exiting 06:00:06 (85572): No heartbeat from core client for 30 sec - exiting 06:00:07 (85572): No heartbeat from core client for 30 sec - exiting 06:00:08 (85572): No heartbeat from core client for 30 sec - exiting 06:00:09 (85572): No heartbeat from core client for 30 sec - exiting 06:00:10 (85572): No heartbeat from core client for 30 sec - exiting 06:00:11 (85572): No heartbeat from core client for 30 sec - exiting 06:00:12 (85572): No heartbeat from core client for 30 sec - exiting 06:00:13 (85572): No heartbeat from core client for 30 sec - exiting 06:01:07 (89648): No heartbeat from core client for 30 sec - exiting 06:01:08 (89648): No heartbeat from core client for 30 sec - exiting 06:01:09 (89648): No heartbeat from core client for 30 sec - exiting 06:01:10 (89648): No heartbeat from core client for 30 sec - exiting 06:01:11 (89648): No heartbeat from core client for 30 sec - exiting 06:01:12 (89648): No heartbeat from core client for 30 sec - exiting 06:01:13 (89648): No heartbeat from core client for 30 sec - exiting 06:01:14 (89648): No heartbeat from core client for 30 sec - exiting 06:01:15 (89648): No heartbeat from core client for 30 sec - exiting 06:01:16 (89648): No heartbeat from core client for 30 sec - exiting 06:01:17 (89648): No heartbeat from core client for 30 sec - exiting 06:01:18 (89648): No heartbeat from core client for 30 sec - exiting 06:01:19 (89648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:00:03 (84480): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 06:00:04 (84480): No heartbeat from core client for 30 sec - exiting 06:00:05 (84480): No heartbeat from core client for 30 sec - exiting 06:00:06 (84480): No heartbeat from core client for 30 sec - exiting 06:00:07 (84480): No heartbeat from core client for 30 sec - exiting 06:00:08 (84480): No heartbeat from core client for 30 sec - exiting 06:00:09 (84480): No heartbeat from core client for 30 sec - exiting 06:00:10 (84480): No heartbeat from core client for 30 sec - exiting 06:00:11 (84480): No heartbeat from core client for 30 sec - exiting 06:00:12 (84480): No heartbeat from core client for 30 sec - exiting 06:00:13 (84480): No heartbeat from core client for 30 sec - exiting 06:02:04 (89648): No heartbeat from core client for 30 sec - exiting 06:02:05 (89648): No heartbeat from core client for 30 sec - exiting 06:02:06 (89648): No heartbeat from core client for 30 sec - exiting 06:02:07 (89648): No heartbeat from core client for 30 sec - exiting 06:02:08 (89648): No heartbeat from core client for 30 sec - exiting 06:02:09 (89648): No heartbeat from core client for 30 sec - exiting 06:02:10 (89648): No heartbeat from core client for 30 sec - exiting 06:02:11 (89648): No heartbeat from core client for 30 sec - exiting 06:02:12 (89648): No heartbeat from core client for 30 sec - exiting 06:02:13 (89648): No heartbeat from core client for 30 sec - exiting 06:02:14 (89648): No heartbeat from core client for 30 sec - exiting 06:02:15 (89648): No heartbeat from core client for 30 sec - exiting 06:02:16 (89648): No heartbeat from core client for 30 sec - exiting 06:02:17 (89648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:04:50 (94052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=93888, selfPID=93888, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... 06:10:18 (85484): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 06:10:19 (85484): No heartbeat from core client for 30 sec - exiting 06:10:20 (85484): No heartbeat from core client for 30 sec - exiting 06:10:21 (85484): No heartbeat from core client for 30 sec - exiting 06:10:22 (85484): No heartbeat from core client for 30 sec - exiting 06:10:23 (85484): No heartbeat from core client for 30 sec - exiting 06:10:24 (85484): No heartbeat from core client for 30 sec - exiting 06:10:25 (85484): No heartbeat from core client for 30 sec - exiting 06:10:26 (85484): No heartbeat from core client for 30 sec - exiting 06:10:27 (85484): No heartbeat from core client for 30 sec - exiting 06:10:28 (85484): No heartbeat from core client for 30 sec - exiting 06:10:29 (85484): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 05:59:56 (85028): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:10:29 (59224): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 06:10:36 (92376): Can't acquire lockfile (32) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:00:02 (92376): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 06:00:03 (92376): No heartbeat from core client for 30 sec - exiting 06:00:04 (92376): No heartbeat from core client for 30 sec - exiting 06:00:05 (92376): No heartbeat from core client for 30 sec - exiting 06:00:06 (92376): No heartbeat from core client for 30 sec - exiting 06:00:07 (92376): No heartbeat from core client for 30 sec - exiting 06:00:08 (92376): No heartbeat from core client for 30 sec - exiting 06:00:09 (92376): No heartbeat from core client for 30 sec - exiting 06:00:10 (92376): No heartbeat from core client for 30 sec - exiting 06:00:11 (92376): No heartbeat from core client for 30 sec - exiting 06:00:12 (92376): No heartbeat from core client for 30 sec - exiting 06:00:13 (92376): No heartbeat from core client for 30 sec - exiting 06:00:21 (85576): Can't acquire lockfile (32) - waiting 35s 06:01:27 (85576): No heartbeat from core client for 30 sec - exiting 06:01:28 (85576): No heartbeat from core client for 30 sec - exiting 06:01:29 (85576): No heartbeat from core client for 30 sec - exiting 06:01:30 (85576): No heartbeat from core client for 30 sec - exiting 06:01:31 (85576): No heartbeat from core client for 30 sec - exiting 06:01:32 (85576): No heartbeat from core client for 30 sec - exiting 06:01:33 (85576): No heartbeat from core client for 30 sec - exiting 06:01:34 (85576): No heartbeat from core client for 30 sec - exiting 06:01:35 (85576): No heartbeat from core client for 30 sec - exiting 06:01:36 (85576): No heartbeat from core client for 30 sec - exiting 06:01:37 (85576): No heartbeat from core client for 30 sec - exiting 06:01:38 (85576): No heartbeat from core client for 30 sec - exiting 06:01:39 (85576): No heartbeat from core client for 30 sec - exiting 06:01:40 (85576): No heartbeat from core client for 30 sec - exiting 06:01:41 (85576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:01:42 (85576): No heartbeat from core client for 30 sec - exiting 06:02:34 (65108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=92760, iMonCtr=2 Model crash detected, will try to restart... 21:02:34 (6532): No heartbeat from core client for 30 sec - exiting 21:02:35 (6532): No heartbeat from core client for 30 sec - exiting 21:02:36 (6532): No heartbeat from core client for 30 sec - exiting 21:02:37 (6532): No heartbeat from core client for 30 sec - exiting 21:02:38 (6532): No heartbeat from core client for 30 sec - exiting 21:02:39 (6532): No heartbeat from core client for 30 sec - exiting 21:02:40 (6532): No heartbeat from core client for 30 sec - exiting 21:02:41 (6532): No heartbeat from core client for 30 sec - exiting 21:02:43 (6532): No heartbeat from core client for 30 sec - exiting 21:02:44 (6532): No heartbeat from core client for 30 sec - exiting 21:02:45 (6532): No heartbeat from core client for 30 sec - exiting 21:02:46 (6532): No heartbeat from core client for 30 sec - exiting 21:02:47 (6532): No heartbeat from core client for 30 sec - exiting 21:02:48 (6532): No heartbeat from core client for 30 sec - exiting 21:02:49 (6532): No heartbeat from core client for 30 sec - exiting 21:02:50 (6532): No heartbeat from core client for 30 sec - exiting 21:02:51 (6532): No heartbeat from core client for 30 sec - exiting 21:02:52 (6532): No heartbeat from core client for 30 sec - exiting 21:02:53 (6532): No heartbeat from core client for 30 sec - exiting 21:02:55 (6532): No heartbeat from core client for 30 sec - exiting 21:02:56 (6532): No heartbeat from core client for 30 sec - exiting 21:02:57 (6532): No heartbeat from core client for 30 sec - exiting 21:03:31 (6532): No heartbeat from core client for 30 sec - exiting 21:03:32 (6532): No heartbeat from core client for 30 sec - exiting 21:03:33 (6532): No heartbeat from core client for 30 sec - exiting 21:03:34 (6532): No heartbeat from core client for 30 sec - exiting 21:03:35 (6532): No heartbeat from core client for 30 sec - exiting 21:03:36 (6532): No heartbeat from core client for 30 sec - exiting 21:03:37 (6532): No heartbeat from core client for 30 sec - exiting 21:03:38 (6532): No heartbeat from core client for 30 sec - exiting 21:03:39 (6532): No heartbeat from core client for 30 sec - exiting 21:03:40 (6532): No heartbeat from core client for 30 sec - exiting 21:03:42 (6532): No heartbeat from core client for 30 sec - exiting 21:03:43 (6532): No heartbeat from core client for 30 sec - exiting 21:03:44 (6532): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=248, iMonCtr=1 21:03:45 (6532): No heartbeat from core client for 30 sec - exiting 21:03:46 (6532): No heartbeat from core client for 30 sec - exiting 21:03:47 (6532): No heartbeat from core client for 30 sec - exiting 21:03:48 (6532): No heartbeat from core client for 30 sec - exiting 21:03:49 (6532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=3308, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=512, selfPID=4348, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_96ma_1970_1_007869471_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_96ma_1970_1_007869471_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Apr 2012 16:48:20 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 115,296 | 236,556 | 2.0517 |
17 Apr 2012 08:03:59 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 103,776 | 212,963 | 2.0521 |
16 Apr 2012 16:24:08 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 92,256 | 189,556 | 2.0547 |
16 Apr 2012 09:14:07 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 80,736 | 166,168 | 2.0582 |
15 Apr 2012 18:40:16 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 69,216 | 142,634 | 2.0607 |
15 Apr 2012 10:51:26 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 57,696 | 118,975 | 2.0621 |
14 Apr 2012 19:21:55 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 46,176 | 95,344 | 2.0648 |
14 Apr 2012 11:03:44 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 34,656 | 71,602 | 2.0661 |
13 Apr 2012 18:42:25 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 23,136 | 47,691 | 2.0613 |
13 Apr 2012 10:08:23 | 962831 | 14397676 | hadam3p_eu_96ma_1970_1_007869471_0 | 11,616 | 24,014 | 2.0673 |
©2024 cpdn.org