climateprediction.net home page
Task 13584682

Task 13584682

Name hadcm3n_y97i_1900_40_007521761_4
Workunit 7719236
Created 2 Nov 2011, 7:09:00 UTC
Sent 2 Nov 2011, 7:19:46 UTC
Report deadline 1 Feb 2012, 14:46:57 UTC
Received 19 Nov 2011, 2:49:07 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1177504
Run time 16 days 0 hours 11 min 35 sec
CPU time 13 days 13 hours 28 min 47 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 1.90 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
06:48:33 (4652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1
Model crash detected, will try to restart...
06:10:27 (4692): No heartbeat from core client for 30 sec - exiting
06:10:28 (4692): No heartbeat from core client for 30 sec - exiting
06:10:29 (4692): No heartbeat from core client for 30 sec - exiting
06:10:30 (4692): No heartbeat from core client for 30 sec - exiting
06:10:31 (4692): No heartbeat from core client for 30 sec - exiting
06:10:32 (4692): No heartbeat from core client for 30 sec - exiting
06:10:33 (4692): No heartbeat from core client for 30 sec - exiting
06:10:34 (4692): No heartbeat from core client for 30 sec - exiting
06:10:35 (4692): No heartbeat from core client for 30 sec - exiting
06:10:36 (4692): No heartbeat from core client for 30 sec - exiting
06:10:38 (4692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:41:26 (2000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITHEAD: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
forrtl: There is not enough space on the disk.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10932, iMonCtr=1
Model crash detected, will try to restart...
forrtl: There is not enough space on the disk.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10932, iMonCtr=1
Model crash detected, will try to restart...
forrtl: There is not enough space on the disk.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10932, iMonCtr=1
Model crash detected, will try to restart...
forrtl: There is not enough space on the disk.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10932, iMonCtr=1
Model crash detected, will try to restart...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                   Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Nov 2011 09:29:22 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 492,480 1,150,199 2.3355
17 Nov 2011 13:19:33 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 466,560 1,089,336 2.3348
16 Nov 2011 17:07:53 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 440,640 1,028,625 2.3344
15 Nov 2011 22:19:47 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 414,720 967,900 2.3339
15 Nov 2011 18:54:45 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 388,800 907,017 2.3329
15 Nov 2011 18:54:45 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 362,880 846,038 2.3315
15 Nov 2011 18:54:45 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 336,960 783,411 2.3249
15 Nov 2011 18:54:45 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 311,040 722,779 2.3237
15 Nov 2011 18:54:45 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 285,120 661,933 2.3216
15 Nov 2011 18:54:45 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 259,200 602,347 2.3239
09 Nov 2011 19:40:46 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 233,280 544,522 2.3342
08 Nov 2011 23:50:46 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 207,360 486,146 2.3445
08 Nov 2011 06:39:40 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 181,440 427,348 2.3553
07 Nov 2011 10:46:46 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 155,520 366,598 2.3572
06 Nov 2011 15:51:08 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 129,600 305,265 2.3554
05 Nov 2011 19:39:02 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 103,680 245,840 2.3711
04 Nov 2011 23:01:04 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 77,760 185,558 2.3863
04 Nov 2011 02:27:29 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 51,840 122,657 2.3661
03 Nov 2011 03:40:32 1177504 13584682 hadcm3n_y97i_1900_40_007521761_4 25,920 59,871 2.3098


©2024 cpdn.org