Name | hadam3p_saf_1s91_1996_1_007002845_2 |
Workunit | 7206161 |
Created | 20 Feb 2011, 22:04:38 UTC |
Sent | 20 Feb 2011, 22:57:25 UTC |
Report deadline | 3 Feb 2012, 4:17:25 UTC |
Received | 14 Mar 2011, 19:56:41 UTC |
Server state | Over |
Outcome | Didn't need |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1135560 |
Run time | 15 hours 2 min 1 sec |
CPU time | 1 hours 53 min 53 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 0.74 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4440, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4924, selfPID=4960, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 19:07:41 (3864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4296, selfPID=3584, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=2 Model crash detected, will try to restart... 11:12:36 (5068): No heartbeat from core client for 30 sec - exiting 11:12:37 (5068): No heartbeat from core client for 30 sec - exiting 11:12:38 (5068): No heartbeat from core client for 30 sec - exiting 11:12:39 (5068): No heartbeat from core client for 30 sec - exiting 11:12:40 (5068): No heartbeat from core client for 30 sec - exiting 11:12:41 (5068): No heartbeat from core client for 30 sec - exiting 11:12:42 (5068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:35:05 (2940): No heartbeat from core client for 30 sec - exiting 20:35:06 (2940): No heartbeat from core client for 30 sec - exiting 20:35:07 (2940): No heartbeat from core client for 30 sec - exiting 20:35:08 (2940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4912, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4248, iMonCtr=2 Model crash detected, will try to restart... 22:28:21 (4004): No heartbeat from core client for 30 sec - exiting 22:28:22 (4004): No heartbeat from core client for 30 sec - exiting 22:28:23 (4004): No heartbeat from core client for 30 sec - exiting 22:28:24 (4004): No heartbeat from core client for 30 sec - exiting 22:28:25 (4004): No heartbeat from core client for 30 sec - exiting 22:28:26 (4004): No heartbeat from core client for 30 sec - exiting 22:28:27 (4004): No heartbeat from core client for 30 sec - exiting 22:28:28 (4004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5972, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5544, selfPID=1072, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 21:46:34 (4324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5732, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5548, iMonCtr=2 Model crash detected, will try to restart... 21:54:51 (4552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:30:43 (1248): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4400, selfPID=4400, iMonCtr=2 22:00:12 (2756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5976, selfPID=5524, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4364, selfPID=4604, iMonCtr=1 Model crash detected, will try to restart... 10:07:46 (4468): No heartbeat from core client for 30 sec - exiting 10:07:47 (4468): No heartbeat from core client for 30 sec - exiting 10:07:48 (4468): No heartbeat from core client for 30 sec - exiting 10:07:49 (4468): No heartbeat from core client for 30 sec - exiting 10:07:50 (4468): No heartbeat from core client for 30 sec - exiting 10:07:51 (4468): No heartbeat from core client for 30 sec - exiting 10:07:52 (4468): No heartbeat from core client for 30 sec - exiting 10:07:53 (4468): No heartbeat from core client for 30 sec - exiting 10:07:54 (4468): No heartbeat from core client for 30 sec - exiting 10:07:55 (4468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5224, selfPID=3728, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 19:29:24 (3728): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_1.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_2.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1s91_1996_1_007002845_2_13.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
No trickles! |
---|
©2024 cpdn.org