Name | hadam3p_eu_aiqi_1984_1_008068264_0 |
Workunit | 8223378 |
Created | 19 Jul 2012, 17:21:53 UTC |
Sent | 19 Jul 2012, 17:32:45 UTC |
Report deadline | 1 Jul 2013, 22:52:45 UTC |
Received | 17 Aug 2012, 9:49:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1187786 |
Run time | 3 days 4 hours 40 min 28 sec |
CPU time | 3 days 1 hours 21 min 22 sec |
Validate state | Invalid |
Credit | 1,790.21 |
Device peak FLOPS | 3.06 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:33:21 (3756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3924, selfPID=2852, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4640, selfPID=2800, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4808, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3284, selfPID=3332, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No Process Handle Regional CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:47:17 (3888): No heartbeat from core client for 30 sec - exiting 17:47:18 (3888): No heartbeat from core client for 30 sec - exiting 17:47:19 (3888): No heartbeat from core client for 30 sec - exiting 17:47:21 (3888): No heartbeat from core client for 30 sec - exiting 17:47:22 (3888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4512, selfPID=5496, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3828, selfPID=3836, iMonCtr=1 Model crash detected, will try to restart... 17:39:56 (3492): No heartbeat from core client for 30 sec - exiting 17:39:57 (3492): No heartbeat from core client for 30 sec - exiting 17:39:58 (3492): No heartbeat from core client for 30 sec - exiting 17:40:00 (3492): No heartbeat from core client for 30 sec - exiting 17:40:01 (3492): No heartbeat from core client for 30 sec - exiting 17:40:02 (3492): No heartbeat from core client for 30 sec - exiting 17:40:03 (3492): No heartbeat from core client for 30 sec - exiting 17:40:04 (3492): No heartbeat from core client for 30 sec - exiting 17:40:05 (3492): No heartbeat from core client for 30 sec - exiting 17:40:06 (3492): No heartbeat from core client for 30 sec - exiting 17:40:07 (3492): No heartbeat from core client for 30 sec - exiting 17:40:08 (3492): No heartbeat from core client for 30 sec - exiting 17:40:09 (3492): No heartbeat from core client for 30 sec - exiting 17:40:10 (3492): No heartbeat from core client for 30 sec - exiting 17:40:12 (3492): No heartbeat from core client for 30 sec - exiting 17:40:13 (3492): No heartbeat from core client for 30 sec - exiting 17:40:14 (3492): No heartbeat from core client for 30 sec - exiting 17:40:15 (3492): No heartbeat from core client for 30 sec - exiting 17:40:16 (3492): No heartbeat from core client for 30 sec - exiting 17:40:17 (3492): No heartbeat from core client for 30 sec - exiting 17:40:18 (3492): No heartbeat from core client for 30 sec - exiting 17:40:19 (3492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:52:54 (3516): No heartbeat from core client for 30 sec - exiting 11:52:55 (3516): No heartbeat from core client for 30 sec - exiting 11:52:56 (3516): No heartbeat from core client for 30 sec - exiting 11:52:57 (3516): No heartbeat from core client for 30 sec - exiting 11:52:58 (3516): No heartbeat from core client for 30 sec - exiting 11:52:59 (3516): No heartbeat from core client for 30 sec - exiting 11:53:00 (3516): No heartbeat from core client for 30 sec - exiting 11:53:01 (3516): No heartbeat from core client for 30 sec - exiting 11:53:02 (3516): No heartbeat from core client for 30 sec - exiting 11:53:04 (3516): No heartbeat from core client for 30 sec - exiting 11:53:05 (3516): No heartbeat from core client for 30 sec - exiting 11:53:06 (3516): No heartbeat from core client for 30 sec - exiting 11:53:07 (3516): No heartbeat from core client for 30 sec - exiting 11:53:08 (3516): No heartbeat from core client for 30 sec - exiting 11:53:09 (3516): No heartbeat from core client for 30 sec - exiting 11:53:10 (3516): No heartbeat from core client for 30 sec - exiting 11:53:11 (3516): No heartbeat from core client for 30 sec - exiting 11:53:12 (3516): No heartbeat from core client for 30 sec - exiting 11:53:13 (3516): No heartbeat from core client for 30 sec - exiting 11:53:14 (3516): No heartbeat from core client for 30 sec - exiting 11:53:16 (3516): No heartbeat from core client for 30 sec - exiting 11:53:17 (3516): No heartbeat from core client for 30 sec - exiting 11:53:18 (3516): No heartbeat from core client for 30 sec - exiting 11:53:19 (3516): No heartbeat from core client for 30 sec - exiting 11:53:20 (3516): No heartbeat from core client for 30 sec - exiting 11:53:21 (3516): No heartbeat from core client for 30 sec - exiting 11:53:22 (3516): No heartbeat from core client for 30 sec - exiting 11:53:23 (3516): No heartbeat from core client for 30 sec - exiting 11:53:24 (3516): No heartbeat from core client for 30 sec - exiting 11:53:25 (3516): No heartbeat from core client for 30 sec - exiting 11:53:26 (3516): No heartbeat from core client for 30 sec - exiting 11:53:28 (3516): No heartbeat from core client for 30 sec - exiting 11:53:29 (3516): No heartbeat from core client for 30 sec - exiting 11:53:30 (3516): No heartbeat from core client for 30 sec - exiting 11:53:31 (3516): No heartbeat from core client for 30 sec - exiting 11:53:32 (3516): No heartbeat from core client for 30 sec - exiting 11:53:33 (3516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:53:34 (3516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1656, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1896, selfPID=3520, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3992, selfPID=2800, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5732, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3820, selfPID=3820, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3632, selfPID=3632, iMonCtr=2 11:18:01 (5076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakm.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_aiqi_1984_1_008068264_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_aiqi_1984_1_008068264_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_aiqi_1984_1_008068264_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Aug 2012 20:17:03 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 103,776 | 249,069 | 2.4001 |
11 Aug 2012 10:17:09 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 92,256 | 221,664 | 2.4027 |
08 Aug 2012 12:11:56 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 80,737 | 194,862 | 2.4135 |
07 Aug 2012 18:07:06 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 80,736 | 194,557 | 2.4098 |
06 Aug 2012 05:48:21 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 69,216 | 167,540 | 2.4205 |
03 Aug 2012 16:02:37 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 57,696 | 139,805 | 2.4231 |
30 Jul 2012 07:36:27 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 46,176 | 112,287 | 2.4317 |
24 Jul 2012 19:23:40 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 34,656 | 84,965 | 2.4517 |
21 Jul 2012 20:54:17 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 23,136 | 57,585 | 2.4890 |
20 Jul 2012 17:19:59 | 1187786 | 14947565 | hadam3p_eu_aiqi_1984_1_008068264_0 | 11,616 | 28,962 | 2.4933 |
©2024 cpdn.org