Name | hadam3p_saf_1c7x_2004_1_006934069_1 |
Workunit | 7137385 |
Created | 12 Mar 2011, 11:44:20 UTC |
Sent | 12 Mar 2011, 15:29:53 UTC |
Report deadline | 22 Feb 2012, 20:49:53 UTC |
Received | 6 Apr 2011, 3:56:57 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1088181 |
Run time | 3 days 8 hours 17 min 28 sec |
CPU time | 2 days 19 hours 2 min 43 sec |
Validate state | Invalid |
Credit | 749.07 |
Device peak FLOPS | 1.66 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6792, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:42:13 (9416): start_timer_thread(): CreateThread() failed, errno 0 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9416, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9964, selfPID=6224, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4588, selfPID=4588, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7588, selfPID=7588, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9304, iMonCtr=2 Model crash detected, will try to restart... 00:54:10 (2352): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... 15:27:23 (1476): start_timer_thread(): CreateThread() failed, errno 0 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3828, iMonCtr=2 Model crash detected, will try to restart... 20:21:44 (5936): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... 04:58:33 (2096): No heartbeat from core client for 30 sec - exiting 04:58:34 (2096): No heartbeat from core client for 30 sec - exiting 04:58:36 (2096): No heartbeat from core client for 30 sec - exiting 04:58:37 (2096): No heartbeat from core client for 30 sec - exiting 04:58:38 (2096): No heartbeat from core client for 30 sec - exiting 04:58:39 (2096): No heartbeat from core client for 30 sec - exiting 04:58:40 (2096): No heartbeat from core client for 30 sec - exiting 04:58:41 (2096): No heartbeat from core client for 30 sec - exiting 04:58:42 (2096): No heartbeat from core client for 30 sec - exiting 04:58:44 (2096): No heartbeat from core client for 30 sec - exiting 04:58:45 (2096): No heartbeat from core client for 30 sec - exiting 04:58:46 (2096): No heartbeat from core client for 30 sec - exiting 04:58:47 (2096): No heartbeat from core client for 30 sec - exiting 04:58:48 (2096): No heartbeat from core client for 30 sec - exiting 04:58:49 (2096): No heartbeat from core client for 30 sec - exiting 04:58:50 (2096): No heartbeat from core client for 30 sec - exiting 04:58:51 (2096): No heartbeat from core client for 30 sec - exiting 04:58:52 (2096): No heartbeat from core client for 30 sec - exiting 04:58:53 (2096): No heartbeat from core client for 30 sec - exiting 04:58:54 (2096): No heartbeat from core client for 30 sec - exiting 04:58:56 (2096): No heartbeat from core client for 30 sec - exiting 04:58:57 (2096): No heartbeat from core client for 30 sec - exiting 04:58:58 (2096): No heartbeat from core client for 30 sec - exiting 04:58:59 (2096): No heartbeat from core client for 30 sec - exiting 04:59:00 (2096): No heartbeat from core client for 30 sec - exiting 04:59:01 (2096): No heartbeat from core client for 30 sec - exiting 04:59:02 (2096): No heartbeat from core client for 30 sec - exiting 04:59:03 (2096): No heartbeat from core client for 30 sec - exiting 04:59:05 (2096): No heartbeat from core client for 30 sec - exiting 04:59:06 (2096): No heartbeat from core client for 30 sec - exiting 04:59:07 (2096): No heartbeat from core client for 30 sec - exiting 04:59:08 (2096): No heartbeat from core client for 30 sec - exiting 04:59:09 (2096): No heartbeat from core client for 30 sec - exiting 04:59:15 (2096): No heartbeat from core client for 30 sec - exiting 04:59:17 (2096): No heartbeat from core client for 30 sec - exiting 04:59:18 (2096): No heartbeat from core client for 30 sec - exiting 04:59:19 (2096): No heartbeat from core client for 30 sec - exiting 04:59:20 (2096): No heartbeat from core client for 30 sec - exiting 04:59:21 (2096): No heartbeat from core client for 30 sec - exiting 04:59:22 (2096): No heartbeat from core client for 30 sec - exiting 04:59:23 (2096): No heartbeat from core client for 30 sec - exiting 04:59:24 (2096): No heartbeat from core client for 30 sec - exiting 04:59:25 (2096): No heartbeat from core client for 30 sec - exiting 04:59:26 (2096): No heartbeat from core client for 30 sec - exiting 04:59:27 (2096): No heartbeat from core client for 30 sec - exiting 04:59:29 (2096): No heartbeat from core client for 30 sec - exiting 04:59:30 (2096): No heartbeat from core client for 30 sec - exiting 04:59:31 (2096): No heartbeat from core client for 30 sec - exiting 04:59:32 (2096): No heartbeat from core client for 30 sec - exiting 04:59:33 (2096): No heartbeat from core client for 30 sec - exiting 04:59:34 (2096): No heartbeat from core client for 30 sec - exiting 04:59:35 (2096): No heartbeat from core client for 30 sec - exiting 04:59:36 (2096): No heartbeat from core client for 30 sec - exiting 04:59:37 (2096): No heartbeat from core client for 30 sec - exiting 04:59:38 (2096): No heartbeat from core client for 30 sec - exiting 04:59:40 (2096): No heartbeat from core client for 30 sec - exiting 04:59:41 (2096): No heartbeat from core client for 30 sec - exiting 04:59:42 (2096): No heartbeat from core client for 30 sec - exiting 04:59:43 (2096): No heartbeat from core client for 30 sec - exiting 04:59:44 (2096): No heartbeat from core client for 30 sec - exiting 04:59:45 (2096): No heartbeat from core client for 30 sec - exiting 04:59:46 (2096): No heartbeat from core client for 30 sec - exiting 04:59:47 (2096): No heartbeat from core client for 30 sec - exiting 04:59:49 (2096): No heartbeat from core client for 30 sec - exiting 04:59:57 (5080): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - No 'heartbeat' from BOINC... 05:00:04 (5392): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... 04:57:42 (4540): start_timer_thread(): CreateThread() failed, errno 0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4540, selfPID=3912, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 05:10:09 (1172): start_timer_thread(): CreateThread() failed, errno 0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2192, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 21:57:31 (1444): start_timer_thread(): CreateThread() failed, errno 0 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5256, selfPID=5256, iMonCtr=2 16:26:40 (4188): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... 09:27:10 (2320): start_timer_thread(): CreateThread() failed, errno 0 09:27:11 (1696): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:48:06 (10056): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... C04:55:24 (2964): start_timer_thread(): CreateThread() failed, errno 0 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=2964, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3664, selfPID=3664, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3664, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 04:56:39 (4024): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_1c7x_2004_1_006934069_1_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c7x_2004_1_006934069_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c7x_2004_1_006934069_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c7x_2004_1_006934069_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c7x_2004_1_006934069_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c7x_2004_1_006934069_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c7x_2004_1_006934069_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1c7x_2004_1_006934069_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Apr 2011 08:50:12 | 1088181 | 12662922 | hadam3p_saf_1c7x_2004_1_006934069_1 | 46,176 | 231,321 | 5.0096 |
28 Mar 2011 00:05:32 | 1088181 | 12662922 | hadam3p_saf_1c7x_2004_1_006934069_1 | 34,656 | 170,716 | 4.9260 |
27 Mar 2011 02:14:57 | 1088181 | 12662922 | hadam3p_saf_1c7x_2004_1_006934069_1 | 23,136 | 114,217 | 4.9368 |
26 Mar 2011 03:57:04 | 1088181 | 12662922 | hadam3p_saf_1c7x_2004_1_006934069_1 | 11,616 | 56,231 | 4.8408 |
©2024 cpdn.org