climateprediction.net home page
Task 13464951

Task 13464951

Name hadcm3n_u27g_1980_40_007458297_3
Workunit 7655800
Created 7 Oct 2011, 2:29:07 UTC
Sent 7 Oct 2011, 2:29:12 UTC
Report deadline 6 Jan 2012, 9:56:23 UTC
Received 5 Nov 2011, 21:07:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 878893
Run time 23 days 7 hours 31 min 42 sec
CPU time 20 days 12 hours 36 min 25 sec
Validate state Invalid
Credit 12,130.56
Device peak FLOPS 2.93 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3772, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:29:15 (3064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/u27gko.pjl5c10
Error converting file to netcdf: dataout/u27gko.pil5c10
Error converting file to netcdf: dataout/u27gko.pfl5c10
Error converting file to netcdf: dataout/u27gka.phl5c10
Error converting file to netcdf: dataout/u27gka.pgl5c10
Error converting file to netcdf: dataout/u27gka.pel5c10
Error converting file to netcdf: dataout/u27gka.pdl5c10
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3264, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3264, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3264, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3348, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3348, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3348, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Nov 2011 14:50:14 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 1,010,880 1,755,990 1.7371
03 Nov 2011 23:04:03 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 984,960 1,706,349 1.7324
03 Nov 2011 08:59:35 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 959,040 1,659,450 1.7303
02 Nov 2011 17:58:08 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 933,120 1,613,706 1.7294
02 Nov 2011 02:31:36 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 907,200 1,567,996 1.7284
01 Nov 2011 11:31:39 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 881,280 1,524,703 1.7301
31 Oct 2011 22:26:59 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 855,360 1,481,578 1.7321
31 Oct 2011 19:39:45 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 829,440 1,438,503 1.7343
31 Oct 2011 19:21:04 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 803,520 1,395,477 1.7367
31 Oct 2011 18:58:57 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 777,600 1,352,391 1.7392
31 Oct 2011 18:39:37 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 751,680 1,309,261 1.7418
31 Oct 2011 18:20:56 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 725,760 1,266,115 1.7445
31 Oct 2011 17:37:59 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 699,840 1,223,092 1.7477
31 Oct 2011 17:18:58 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 673,920 1,180,025 1.7510
31 Oct 2011 16:59:11 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 648,000 1,136,791 1.7543
31 Oct 2011 16:38:33 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 622,080 1,093,577 1.7579
31 Oct 2011 15:36:02 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 596,160 1,050,541 1.7622
31 Oct 2011 14:26:17 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 570,240 1,006,954 1.7658
31 Oct 2011 14:26:17 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 544,320 963,095 1.7694
31 Oct 2011 14:26:17 878893 13464951 hadcm3n_u27g_1980_40_007458297_3 518,400 919,664 1.7740


©2024 cpdn.org