Name | hadam3p_eu_64de_2001_1_007529061_1 |
Workunit | 7726293 |
Created | 6 Nov 2011, 12:44:21 UTC |
Sent | 6 Nov 2011, 13:07:01 UTC |
Report deadline | 18 Oct 2012, 18:27:01 UTC |
Received | 9 Dec 2011, 15:55:39 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1019661 |
Run time | 3 days 9 hours 49 min 51 sec |
CPU time | 2 days 22 hours 28 min 34 sec |
Validate state | Invalid |
Credit | 1,392.75 |
Device peak FLOPS | 2.32 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.6.38</core_client_version> <![CDATA[ <stderr_txt> 18:34:48 (5020): No heartbeat from core client for 30 sec - exiting 18:34:49 (5020): No heartbeat from core client for 30 sec - exiting 18:34:50 (5020): No heartbeat from core client for 30 sec - exiting 18:34:51 (5020): No heartbeat from core client for 30 sec - exiting 18:34:52 (5020): No heartbeat from core client for 30 sec - exiting 18:34:53 (5020): No heartbeat from core client for 30 sec - exiting 18:34:54 (5020): No heartbeat from core client for 30 sec - exiting 18:34:55 (5020): No heartbeat from core client for 30 sec - exiting 18:34:56 (5020): No heartbeat from core client for 30 sec - exiting 18:34:57 (5020): No heartbeat from core client for 30 sec - exiting 18:34:59 (5020): No heartbeat from core client for 30 sec - exiting 18:35:00 (5020): No heartbeat from core client for 30 sec - exiting 18:35:01 (5020): No heartbeat from core client for 30 sec - exiting 18:35:02 (5020): No heartbeat from core client for 30 sec - exiting 18:35:03 (5020): No heartbeat from core client for 30 sec - exiting 19:49:10 (4264): No heartbeat from core client for 30 sec - exiting 19:49:11 (4264): No heartbeat from core client for 30 sec - exiting 19:49:12 (4264): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4048, iMonCtr=2 Model crash detected, will try to restart... 19:52:05 (3604): No heartbeat from core client for 30 sec - exiting 19:52:06 (3604): No heartbeat from core client for 30 sec - exiting 19:52:07 (3604): No heartbeat from core client for 30 sec - exiting 19:52:08 (3604): No heartbeat from core client for 30 sec - exiting 19:52:09 (3604): No heartbeat from core client for 30 sec - exiting 19:52:11 (3604): No heartbeat from core client for 30 sec - exiting 19:52:12 (3604): No heartbeat from core client for 30 sec - exiting 19:52:13 (3604): No heartbeat from core client for 30 sec - exiting 19:52:14 (3604): No heartbeat from core client for 30 sec - exiting 19:52:15 (3604): No heartbeat from core client for 30 sec - exiting 19:52:16 (3604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6008, selfPID=2340, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:57:22 (4004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:11:06 (2000): No heartbeat from core client for 30 sec - exiting 19:11:07 (2000): No heartbeat from core client for 30 sec - exiting 19:11:08 (2000): No heartbeat from core client for 30 sec - exiting 19:11:09 (2000): No heartbeat from core client for 30 sec - exiting 19:11:10 (2000): No heartbeat from core client for 30 sec - exiting 19:11:11 (2000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4440, selfPID=4464, iMonCtr=1 Model crash detected, will try to restart... 12:01:33 (4748): No heartbeat from core client for 30 sec - exiting 12:01:34 (4748): No heartbeat from core client for 30 sec - exiting 12:01:35 (4748): No heartbeat from core client for 30 sec - exiting 12:01:36 (4748): No heartbeat from core client for 30 sec - exiting 12:01:37 (4748): No heartbeat from core client for 30 sec - exiting 12:01:38 (4748): No heartbeat from core client for 30 sec - exiting 12:01:39 (4748): No heartbeat from core client for 30 sec - exiting 12:01:40 (4748): No heartbeat from core client for 30 sec - exiting 12:01:41 (4748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5660, selfPID=4164, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... C20:14:12 (5120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6124, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3460, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 22:16:21 (1112): No heartbeat from core client for 30 sec - exiting 22:16:22 (1112): No heartbeat from core client for 30 sec - exiting 22:16:23 (1112): No heartbeat from core client for 30 sec - exiting 22:16:24 (1112): No heartbeat from core client for 30 sec - exiting 22:16:25 (1112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=712, selfPID=4768, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5704, selfPID=4172, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5820, selfPID=4564, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 16:53:35 (5560): No heartbeat from core client for 30 sec - exiting 16:53:36 (5560): No heartbeat from core client for 30 sec - exiting 16:53:37 (5560): No heartbeat from core client for 30 sec - exiting 16:53:38 (5560): No heartbeat from core client for 30 sec - exiting 16:53:39 (5560): No heartbeat from core client for 30 sec - exiting 16:53:40 (5560): No heartbeat from core client for 30 sec - exiting 16:53:41 (5560): No heartbeat from core client for 30 sec - exiting 16:53:42 (5560): No heartbeat from core client for 30 sec - exiting 16:53:43 (5560): No heartbeat from core client for 30 sec - exiting 16:53:44 (5560): No heartbeat from core client for 30 sec - exiting 16:53:46 (5560): No heartbeat from core client for 30 sec - exiting 16:53:47 (5560): No heartbeat from core client for 30 sec - exiting 16:53:48 (5560): No heartbeat from core client for 30 sec - exiting 16:53:49 (5560): No heartbeat from core client for 30 sec - exiting 16:53:50 (5560): No heartbeat from core client for 30 sec - exiting 16:53:51 (5560): No heartbeat from core client for 30 sec - exiting 16:53:52 (5560): No heartbeat from core client for 30 sec - exiting 16:53:53 (5560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_64de_2001_1_007529061\tmp\xaakm.namelists Image PC Routine Line Source hadam3p_eu_um_6.0 0122A39A Unknown Unknown Unknown hadam3p_eu_um_6.0 011D2CD0 Unknown Unknown Unknown hadam3p_eu_um_6.0 011D1E9A Unknown Unknown Unknown hadam3p_eu_um_6.0 011B2819 Unknown Unknown Unknown hadam3p_eu_um_6.0 010B2287 Unknown Unknown Unknown hadam3p_eu_um_6.0 0114E7B2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0114F2DA Unknown Unknown Unknown hadam3p_eu_um_6.0 00EC9BD2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0120E638 Unknown Unknown Unknown kernel32.dll 7595D309 Unknown Unknown Unknown ntdll.dll 76EE16C3 Unknown Unknown Unknown ntdll.dll 76EE1696 Unknown Unknown Unknown forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_64de_2001_1_007529061\tmp\xaakg.namelists Image PC Routine Line Source hadrm3p_eu_um_6.0 00B5C52A Unknown Unknown Unknown hadrm3p_eu_um_6.0 00B04460 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00B0362A Unknown Unknown Unknown hadrm3p_eu_um_6.0 00AE2469 Unknown Unknown Unknown hadrm3p_eu_um_6.0 009E66EB Unknown Unknown Unknown hadrm3p_eu_um_6.0 00A82AE2 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00A835AF Unknown Unknown Unknown hadrm3p_eu_um_6.0 00829860 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00B40893 Unknown Unknown Unknown kernel32.dll 7595D309 Unknown Unknown Unknown ntdll.dll 76EE16C3 Unknown Unknown Unknown ntdll.dll 76EE1696 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5336, selfPID=5256, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_64de_2001_1_007529061_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_64de_2001_1_007529061_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_64de_2001_1_007529061_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_64de_2001_1_007529061_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_64de_2001_1_007529061_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Dec 2011 11:03:49 | 1019661 | 13612938 | hadam3p_eu_64de_2001_1_007529061_1 | 80,736 | 226,996 | 2.8116 |
07 Dec 2011 13:33:27 | 1019661 | 13612938 | hadam3p_eu_64de_2001_1_007529061_1 | 69,216 | 194,890 | 2.8157 |
04 Dec 2011 14:41:23 | 1019661 | 13612938 | hadam3p_eu_64de_2001_1_007529061_1 | 57,696 | 162,411 | 2.8149 |
03 Dec 2011 10:16:36 | 1019661 | 13612938 | hadam3p_eu_64de_2001_1_007529061_1 | 46,176 | 129,635 | 2.8074 |
27 Nov 2011 16:02:37 | 1019661 | 13612938 | hadam3p_eu_64de_2001_1_007529061_1 | 34,659 | 96,744 | 2.7913 |
26 Nov 2011 17:26:41 | 1019661 | 13612938 | hadam3p_eu_64de_2001_1_007529061_1 | 34,656 | 96,309 | 2.7790 |
19 Nov 2011 19:13:42 | 1019661 | 13612938 | hadam3p_eu_64de_2001_1_007529061_1 | 23,136 | 64,441 | 2.7853 |
15 Nov 2011 17:41:25 | 1019661 | 13612938 | hadam3p_eu_64de_2001_1_007529061_1 | 11,616 | 32,449 | 2.7935 |
©2024 cpdn.org