climateprediction.net home page
Task 16369302

Task 16369302

Name hadcm3n_7h6a_1980_40_008433541_3
Workunit 8584397
Created 15 Mar 2014, 2:45:33 UTC
Sent 15 Mar 2014, 2:45:38 UTC
Report deadline 16 Sep 2023, 8:05:38 UTC
Received 16 Jun 2014, 13:36:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1277257
Run time 20 days 6 hours 3 min 42 sec
CPU time 17 days 9 hours 39 min 26 sec
Validate state Invalid
Credit 11,197.44
Device peak FLOPS 2.92 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:38:43 (6408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:58:58 (6652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:59:47 (6856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/7h6ako.pjj1c10
Error converting file to netcdf: dataout/7h6ako.pij1c10
Error converting file to netcdf: dataout/7h6ako.pfj1c10
Error converting file to netcdf: dataout/7h6aka.phj1c10
Error converting file to netcdf: dataout/7h6aka.pgj1c10
Error converting file to netcdf: dataout/7h6aka.pej1c10
Error converting file to netcdf: dataout/7h6aka.pdj1c10
08:40:59 (8004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:13:43 (3592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8792, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8792, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8792, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8792, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8792, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8792, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Jun 2014 04:51:35 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 933,120 1,466,805 1.5719
10 Jun 2014 09:49:40 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 907,200 1,426,970 1.5729
10 Jun 2014 09:03:33 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 881,280 1,386,444 1.5732
10 Jun 2014 09:02:31 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 855,360 1,345,690 1.5732
10 Jun 2014 09:01:23 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 829,440 1,304,001 1.5721
10 Jun 2014 07:42:57 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 803,520 1,262,169 1.5708
05 Jun 2014 11:20:11 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 777,600 1,220,712 1.5698
04 Jun 2014 23:38:28 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 751,680 1,179,637 1.5693
04 Jun 2014 12:26:39 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 725,760 1,138,554 1.5688
04 Jun 2014 00:09:25 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 699,840 1,097,246 1.5679
03 Jun 2014 12:27:22 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 673,920 1,056,018 1.5670
03 Jun 2014 00:44:59 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 648,000 1,014,947 1.5663
02 Jun 2014 13:13:36 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 622,080 974,280 1.5662
02 Jun 2014 01:40:54 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 596,160 933,897 1.5665
01 Jun 2014 13:19:28 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 570,240 893,659 1.5672
01 Jun 2014 01:11:20 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 544,320 853,325 1.5677
31 May 2014 09:56:42 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 518,400 813,470 1.5692
30 May 2014 17:06:59 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 492,480 773,715 1.5711
27 May 2014 13:12:23 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 466,560 733,574 1.5723
26 May 2014 19:04:44 1277257 16369302 hadcm3n_7h6a_1980_40_008433541_3 440,640 691,236 1.5687


©2024 cpdn.org