climateprediction.net home page
Task 13924010

Task 13924010

Name hadcm3n_yian_1940_40_007682505_1
Workunit 7837592
Created 15 Jan 2012, 23:48:23 UTC
Sent 15 Jan 2012, 23:48:35 UTC
Report deadline 16 Apr 2012, 7:15:46 UTC
Received 2 Jul 2012, 11:26:23 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1017301
Run time 9 days 9 hours 37 min 30 sec
CPU time 7 days 21 hours 44 min 7 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 2.74 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
15:42:34 (6004): No heartbeat from core client for 30 sec - exiting
15:42:35 (6004): No heartbeat from core client for 30 sec - exiting
15:42:36 (6004): No heartbeat from core client for 30 sec - exiting
15:42:37 (6004): No heartbeat from core client for 30 sec - exiting
15:42:38 (6004): No heartbeat from core client for 30 sec - exiting
15:42:39 (6004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:45:05 (10692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
Ocean Restart file copy failed on yianko.dae1220
Ocean Restart file copy failed on yianko.dae4ch0
Ocean Restart file copy failed on yianko.dae4ci0
Ocean Restart file copy failed on yianko.daf4b20
Ocean Restart file copy failed on yianko.daf8740
Ocean Restart file copy failed on yianko.daf8750
Ocean Restart file copy failed on yianko.daf8760
Ocean Restart file copy failed on yianko.daf8770
Ocean Restart file copy failed on yianko.daf8780
Ocean Restart file copy failed on yianko.daf8790
Ocean Restart file copy failed on yianko.daf87a0
Ocean Restart file copy failed on yianko.daf87b0
Ocean Restart file copy failed on yianko.daf87c0
Ocean Restart file copy failed on yianko.daf87d0
Ocean Restart file copy failed on yianko.daf87e0
Ocean Restart file copy failed on yianko.daf87f0
Ocean Restart file copy failed on yianko.daf87g0
Ocean Restart file copy failed on yianko.daf87h0
Ocean Restart file copy failed on yianko.daf87i0
07:13:01 (18056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  2048    
Atmos Hold Restart file rename failed on atmos_restart.hold

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  2048    
Atmos Hold Restart file rename failed on atmos_restart.hold

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  2048    
Atmos Hold Restart file rename failed on atmos_restart.hold

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  2048    
Atmos Hold Restart file rename failed on atmos_restart.hold

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  2048    
Atmos Hold Restart file rename failed on atmos_restart.hold

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  2048    
Atmos Hold Restart file rename failed on atmos_restart.hold

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Jul 2012 11:29:05 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 440,640 659,692 1.4971
30 Jun 2012 13:46:11 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 414,720 619,452 1.4937
30 Jun 2012 00:24:58 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 388,800 578,974 1.4891
29 Jun 2012 11:25:30 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 362,880 538,421 1.4837
28 Jun 2012 22:44:10 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 336,960 498,564 1.4796
28 Jun 2012 11:26:29 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 311,040 460,161 1.4794
27 Jun 2012 22:54:06 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 285,120 421,668 1.4789
27 Jun 2012 11:35:39 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 259,200 384,444 1.4832
27 Jun 2012 00:14:42 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 233,280 347,458 1.4894
26 Jun 2012 13:05:00 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 207,360 310,830 1.4990
26 Jun 2012 01:41:59 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 181,440 274,384 1.5123
25 Jun 2012 14:41:44 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 155,520 237,877 1.5296
25 Jun 2012 02:42:44 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 129,600 200,862 1.5499
24 Jun 2012 09:35:39 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 103,680 164,056 1.5823
23 Jun 2012 21:07:13 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 77,760 126,209 1.6231
23 Jun 2012 09:08:44 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 51,840 88,689 1.7108
22 Jun 2012 21:50:17 1017301 13924010 hadcm3n_yian_1940_40_007682505_1 25,920 51,097 1.9713


©2024 cpdn.org