Name | hadam3p_anz_n8su_2012_1_008599022_0 |
Workunit | 8745534 |
Created | 26 Mar 2014, 19:02:52 UTC |
Sent | 27 Mar 2014, 18:37:51 UTC |
Report deadline | 9 Mar 2015, 23:57:51 UTC |
Received | 2 Jun 2014, 16:25:28 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1232555 |
Run time | 4 days 1 hours 49 min 16 sec |
CPU time | 2 days 14 hours 1 min 27 sec |
Validate state | Invalid |
Credit | 1,503.36 |
Device peak FLOPS | 2.86 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:08:54 (6932): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... 18:50:56 (8684): start_timer_thread(): CreateThread() failed, errno 0 18:50:58 (8732): start_timer_thread(): CreateThread() failed, errno 0 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:05:59 (8572): start_timer_thread(): CreateThread() failed, errno 0 19:05:59 (4272): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18620, selfPID=19560, iMonCtr=1 Model crash detected, will try to restart... 19:46:29 (11108): start_timer_thread(): CreateThread() failed, errno 0 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 18:59:44 (7240): No heartbeat from core client for 30 sec - exiting 18:59:45 (7240): No heartbeat from core client for 30 sec - exiting 18:59:46 (7240): No heartbeat from core client for 30 sec - exiting 18:59:47 (7240): No heartbeat from core client for 30 sec - exiting 18:59:48 (7240): No heartbeat from core client for 30 sec - exiting 18:59:49 (7240): No heartbeat from core client for 30 sec - exiting 18:59:50 (7240): No heartbeat from core client for 30 sec - exiting 18:59:51 (7240): No heartbeat from core client for 30 sec - exiting 18:59:52 (7240): No heartbeat from core client for 30 sec - exiting 18:59:53 (7240): No heartbeat from core client for 30 sec - exiting 18:59:54 (7240): No heartbeat from core client for 30 sec - exiting 18:59:55 (7240): No heartbeat from core client for 30 sec - exiting 18:59:56 (7240): No heartbeat from core client for 30 sec - exiting 18:59:57 (7240): No heartbeat from core client for 30 sec - exiting 18:59:58 (7240): No heartbeat from core client for 30 sec - exiting 18:59:59 (7240): No heartbeat from core client for 30 sec - exiting 19:00:00 (7240): No heartbeat from core client for 30 sec - exiting 19:00:01 (7240): No heartbeat from core client for 30 sec - exiting 19:00:02 (7240): No heartbeat from core client for 30 sec - exiting 19:00:03 (7240): No heartbeat from core client for 30 sec - exiting 19:00:04 (7240): No heartbeat from core client for 30 sec - exiting 19:00:05 (7240): No heartbeat from core client for 30 sec - exiting 19:00:06 (7240): No heartbeat from core client for 30 sec - exiting 19:00:07 (7240): No heartbeat from core client for 30 sec - exiting 19:00:08 (7240): No heartbeat from core client for 30 sec - exiting 19:00:09 (7240): No heartbeat from core client for 30 sec - exiting 19:00:10 (7240): No heartbeat from core client for 30 sec - exiting 19:00:11 (7240): No heartbeat from core client for 30 sec - exiting 19:00:12 (7240): No heartbeat from core client for 30 sec - exiting 19:00:13 (7240): No heartbeat from core client for 30 sec - exiting 19:00:14 (7240): No heartbeat from core client for 30 sec - exiting 19:00:15 (7240): No heartbeat from core client for 30 sec - exiting 19:00:16 (7240): No heartbeat from core client for 30 sec - exiting 19:00:17 (7240): No heartbeat from core client for 30 sec - exiting 19:00:18 (7240): No heartbeat from core client for 30 sec - exiting 19:00:19 (7240): No heartbeat from core client for 30 sec - exiting 19:00:20 (7240): No heartbeat from core client for 30 sec - exiting 19:00:21 (7240): No heartbeat from core client for 30 sec - exiting 19:00:22 (7240): No heartbeat from core client for 30 sec - exiting 19:00:23 (7240): No heartbeat from core client for 30 sec - exiting 19:00:24 (7240): No heartbeat from core client for 30 sec - exiting 19:00:25 (7240): No heartbeat from core client for 30 sec - exiting 19:00:26 (7240): No heartbeat from core client for 30 sec - exiting 19:00:27 (7240): No heartbeat from core client for 30 sec - exiting 19:00:28 (7240): No heartbeat from core client for 30 sec - exiting 19:00:29 (7240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4820, selfPID=4820, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15660, selfPID=8244, iMonCtr=1 Model crash detected, will try to restart... 19:31:43 (6948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:07:36 (10564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 18:25:15 (7044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6260, selfPID=6260, iMonCtr=2 18:25:16 (7044): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=12912, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13012, selfPID=9916, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... 19:46:08 (6880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:46:09 (6880): No heartbeat from core client for 30 sec - exiting 19:46:10 (6880): No heartbeat from core client for 30 sec - exiting 19:46:11 (6880): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 20:10:14 (8864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:10:15 (8864): No heartbeat from core client for 30 sec - exiting 20:10:16 (8864): No heartbeat from core client for 30 sec - exiting 20:10:17 (8864): No heartbeat from core client for 30 sec - exiting 20:10:18 (8864): No heartbeat from core client for 30 sec - exiting 20:10:19 (8864): No heartbeat from core client for 30 sec - exiting 20:10:20 (8864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 18:16:07 (6796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:17:03 (5476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:13:22 (7148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:13:23 (7148): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 21:22:28 (10812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7252, selfPID=7008, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:34:38 (7104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 18:30:10 (9448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8508, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9144, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 22:16:44 (5800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 18:40:52 (8452): No heartbeat from core client for 30 sec - exiting 18:40:53 (8452): No heartbeat from core client for 30 sec - exiting 18:40:54 (8452): No heartbeat from core client for 30 sec - exiting 18:40:55 (8452): No heartbeat from core client for 30 sec - exiting 18:40:56 (8452): No heartbeat from core client for 30 sec - exiting 18:40:57 (8452): No heartbeat from core client for 30 sec - exiting 18:40:58 (8452): No heartbeat from core client for 30 sec - exiting 18:40:59 (8452): No heartbeat from core client for 30 sec - exiting 18:41:00 (8452): No heartbeat from core client for 30 sec - exiting 18:41:01 (8452): No heartbeat from core client for 30 sec - exiting 18:41:02 (8452): No heartbeat from core client for 30 sec - exiting 18:41:03 (8452): No heartbeat from core client for 30 sec - exiting 18:41:04 (8452): No heartbeat from core client for 30 sec - exiting 18:41:05 (8452): No heartbeat from core client for 30 sec - exiting 18:41:06 (8452): No heartbeat from core client for 30 sec - exiting 18:41:07 (8452): No heartbeat from core client for 30 sec - exiting 18:41:08 (8452): No heartbeat from core client for 30 sec - exiting 18:41:09 (8452): No heartbeat from core client for 30 sec - exiting 18:41:10 (8452): No heartbeat from core client for 30 sec - exiting 18:41:11 (8452): No heartbeat from core client for 30 sec - exiting 18:41:12 (8452): No heartbeat from core client for 30 sec - exiting 18:41:13 (8452): No heartbeat from core client for 30 sec - exiting 18:41:14 (8452): No heartbeat from core client for 30 sec - exiting 18:41:15 (8452): No heartbeat from core client for 30 sec - exiting 18:41:16 (8452): No heartbeat from core client for 30 sec - exiting 18:41:17 (8452): No heartbeat from core client for 30 sec - exiting 18:41:18 (8452): No heartbeat from core client for 30 sec - exiting 18:41:19 (8452): No heartbeat from core client for 30 sec - exiting 18:41:20 (8452): No heartbeat from core client for 30 sec - exiting 18:41:21 (8452): No heartbeat from core client for 30 sec - exiting 18:41:22 (8452): No heartbeat from core client for 30 sec - exiting 18:41:23 (8452): No heartbeat from core client for 30 sec - exiting 18:41:24 (8452): No heartbeat from core client for 30 sec - exiting 18:41:25 (8452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:20:02 (7092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:20:03 (7092): No heartbeat from core client for 30 sec - exiting 22:20:04 (7092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:35:12 (4848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:04:34 (5600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8296, selfPID=8292, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 18:53:11 (10220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3864, selfPID=3864, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:46:04 (8688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:46:05 (8688): No heartbeat from core client for 30 sec - exiting 19:46:06 (8688): No heartbeat from core client for 30 sec - exiting 19:46:07 (8688): No heartbeat from core client for 30 sec - exiting 19:46:08 (8688): No heartbeat from core client for 30 sec - exiting 19:46:09 (8688): No heartbeat from core client for 30 sec - exiting 19:46:10 (8688): No heartbeat from core client for 30 sec - exiting 19:46:11 (8688): No heartbeat from core client for 30 sec - exiting 19:46:12 (8688): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 20:19:07 (10260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=12584, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11332, selfPID=15680, iMonCtr=1 Model crash detected, will try to restart... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11332, selfPID=11332, iMonCtr=2 </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_n8su_2012_1_008599022_0_4.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n8su_2012_1_008599022_0_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n8su_2012_1_008599022_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n8su_2012_1_008599022_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n8su_2012_1_008599022_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n8su_2012_1_008599022_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n8su_2012_1_008599022_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n8su_2012_1_008599022_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n8su_2012_1_008599022_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 May 2014 19:56:15 | 1232555 | 16416314 | hadam3p_anz_n8su_2012_1_008599022_0 | 34,859 | 176,127 | 5.0526 |
29 Apr 2014 19:23:44 | 1232555 | 16416314 | hadam3p_anz_n8su_2012_1_008599022_0 | 23,339 | 117,515 | 5.0351 |
14 Apr 2014 18:49:37 | 1232555 | 16416314 | hadam3p_anz_n8su_2012_1_008599022_0 | 11,819 | 59,278 | 5.0155 |
©2024 cpdn.org