climateprediction.net home page
Task 13612938

Task 13612938

Name hadam3p_eu_64de_2001_1_007529061_1
Workunit 7726293
Created 6 Nov 2011, 12:44:21 UTC
Sent 6 Nov 2011, 13:07:01 UTC
Report deadline 18 Oct 2012, 18:27:01 UTC
Received 9 Dec 2011, 15:55:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1019661
Run time 3 days 9 hours 49 min 51 sec
CPU time 2 days 22 hours 28 min 34 sec
Validate state Invalid
Credit 1,392.75
Device peak FLOPS 2.32 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.6.38</core_client_version>
<![CDATA[
<stderr_txt>
18:34:48 (5020): No heartbeat from core client for 30 sec - exiting
18:34:49 (5020): No heartbeat from core client for 30 sec - exiting
18:34:50 (5020): No heartbeat from core client for 30 sec - exiting
18:34:51 (5020): No heartbeat from core client for 30 sec - exiting
18:34:52 (5020): No heartbeat from core client for 30 sec - exiting
18:34:53 (5020): No heartbeat from core client for 30 sec - exiting
18:34:54 (5020): No heartbeat from core client for 30 sec - exiting
18:34:55 (5020): No heartbeat from core client for 30 sec - exiting
18:34:56 (5020): No heartbeat from core client for 30 sec - exiting
18:34:57 (5020): No heartbeat from core client for 30 sec - exiting
18:34:59 (5020): No heartbeat from core client for 30 sec - exiting
18:35:00 (5020): No heartbeat from core client for 30 sec - exiting
18:35:01 (5020): No heartbeat from core client for 30 sec - exiting
18:35:02 (5020): No heartbeat from core client for 30 sec - exiting
18:35:03 (5020): No heartbeat from core client for 30 sec - exiting
19:49:10 (4264): No heartbeat from core client for 30 sec - exiting
19:49:11 (4264): No heartbeat from core client for 30 sec - exiting
19:49:12 (4264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4048, iMonCtr=2
Model crash detected, will try to restart...
19:52:05 (3604): No heartbeat from core client for 30 sec - exiting
19:52:06 (3604): No heartbeat from core client for 30 sec - exiting
19:52:07 (3604): No heartbeat from core client for 30 sec - exiting
19:52:08 (3604): No heartbeat from core client for 30 sec - exiting
19:52:09 (3604): No heartbeat from core client for 30 sec - exiting
19:52:11 (3604): No heartbeat from core client for 30 sec - exiting
19:52:12 (3604): No heartbeat from core client for 30 sec - exiting
19:52:13 (3604): No heartbeat from core client for 30 sec - exiting
19:52:14 (3604): No heartbeat from core client for 30 sec - exiting
19:52:15 (3604): No heartbeat from core client for 30 sec - exiting
19:52:16 (3604): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6008, selfPID=2340, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:57:22 (4004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:11:06 (2000): No heartbeat from core client for 30 sec - exiting
19:11:07 (2000): No heartbeat from core client for 30 sec - exiting
19:11:08 (2000): No heartbeat from core client for 30 sec - exiting
19:11:09 (2000): No heartbeat from core client for 30 sec - exiting
19:11:10 (2000): No heartbeat from core client for 30 sec - exiting
19:11:11 (2000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4440, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
12:01:33 (4748): No heartbeat from core client for 30 sec - exiting
12:01:34 (4748): No heartbeat from core client for 30 sec - exiting
12:01:35 (4748): No heartbeat from core client for 30 sec - exiting
12:01:36 (4748): No heartbeat from core client for 30 sec - exiting
12:01:37 (4748): No heartbeat from core client for 30 sec - exiting
12:01:38 (4748): No heartbeat from core client for 30 sec - exiting
12:01:39 (4748): No heartbeat from core client for 30 sec - exiting
12:01:40 (4748): No heartbeat from core client for 30 sec - exiting
12:01:41 (4748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5660, selfPID=4164, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
C20:14:12 (5120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6124, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3460, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
22:16:21 (1112): No heartbeat from core client for 30 sec - exiting
22:16:22 (1112): No heartbeat from core client for 30 sec - exiting
22:16:23 (1112): No heartbeat from core client for 30 sec - exiting
22:16:24 (1112): No heartbeat from core client for 30 sec - exiting
22:16:25 (1112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=712, selfPID=4768, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5704, selfPID=4172, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5820, selfPID=4564, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
16:53:35 (5560): No heartbeat from core client for 30 sec - exiting
16:53:36 (5560): No heartbeat from core client for 30 sec - exiting
16:53:37 (5560): No heartbeat from core client for 30 sec - exiting
16:53:38 (5560): No heartbeat from core client for 30 sec - exiting
16:53:39 (5560): No heartbeat from core client for 30 sec - exiting
16:53:40 (5560): No heartbeat from core client for 30 sec - exiting
16:53:41 (5560): No heartbeat from core client for 30 sec - exiting
16:53:42 (5560): No heartbeat from core client for 30 sec - exiting
16:53:43 (5560): No heartbeat from core client for 30 sec - exiting
16:53:44 (5560): No heartbeat from core client for 30 sec - exiting
16:53:46 (5560): No heartbeat from core client for 30 sec - exiting
16:53:47 (5560): No heartbeat from core client for 30 sec - exiting
16:53:48 (5560): No heartbeat from core client for 30 sec - exiting
16:53:49 (5560): No heartbeat from core client for 30 sec - exiting
16:53:50 (5560): No heartbeat from core client for 30 sec - exiting
16:53:51 (5560): No heartbeat from core client for 30 sec - exiting
16:53:52 (5560): No heartbeat from core client for 30 sec - exiting
16:53:53 (5560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_64de_2001_1_007529061\tmp\xaakm.namelists
Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  0122A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  011D2CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  011D1E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  011B2819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  010B2287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0114E7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0114F2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00EC9BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0120E638  Unknown               Unknown  Unknown
kernel32.dll       7595D309  Unknown               Unknown  Unknown
ntdll.dll          76EE16C3  Unknown               Unknown  Unknown
ntdll.dll          76EE1696  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_64de_2001_1_007529061\tmp\xaakg.namelists
Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  00B5C52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00B04460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00B0362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00AE2469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  009E66EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00A82AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00A835AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00829860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00B40893  Unknown               Unknown  Unknown
kernel32.dll       7595D309  Unknown               Unknown  Unknown
ntdll.dll          76EE16C3  Unknown               Unknown  Unknown
ntdll.dll          76EE1696  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5336, selfPID=5256, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_64de_2001_1_007529061_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_64de_2001_1_007529061_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_64de_2001_1_007529061_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_64de_2001_1_007529061_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_64de_2001_1_007529061_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Dec 2011 11:03:49 1019661 13612938 hadam3p_eu_64de_2001_1_007529061_1 80,736 226,996 2.8116
07 Dec 2011 13:33:27 1019661 13612938 hadam3p_eu_64de_2001_1_007529061_1 69,216 194,890 2.8157
04 Dec 2011 14:41:23 1019661 13612938 hadam3p_eu_64de_2001_1_007529061_1 57,696 162,411 2.8149
03 Dec 2011 10:16:36 1019661 13612938 hadam3p_eu_64de_2001_1_007529061_1 46,176 129,635 2.8074
27 Nov 2011 16:02:37 1019661 13612938 hadam3p_eu_64de_2001_1_007529061_1 34,659 96,744 2.7913
26 Nov 2011 17:26:41 1019661 13612938 hadam3p_eu_64de_2001_1_007529061_1 34,656 96,309 2.7790
19 Nov 2011 19:13:42 1019661 13612938 hadam3p_eu_64de_2001_1_007529061_1 23,136 64,441 2.7853
15 Nov 2011 17:41:25 1019661 13612938 hadam3p_eu_64de_2001_1_007529061_1 11,616 32,449 2.7935


©2024 cpdn.org