climateprediction.net home page
Task 15558613

Task 15558613

Name hadcm3n_zm0e_1920_40_008255879_3
Workunit 8411003
Created 26 Jan 2013, 19:40:03 UTC
Sent 26 Jan 2013, 19:40:30 UTC
Report deadline 28 Apr 2013, 3:07:41 UTC
Received 28 Feb 2013, 19:58:12 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1213041
Run time 9 days 22 hours 24 min 55 sec
CPU time 9 days 8 hours 40 min 12 sec
Validate state Invalid
Credit 4,354.56
Device peak FLOPS 3.04 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
16:55:09 (3468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:12:27 (4968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:27:42 (3476): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:55:54 (4664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:17:15 (4504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:20:25 (3556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:20:26 (3556): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
16:45:41 (3880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3460, iMonCtr=1
Model cSuspended CPDN Monitor - Suspend request from BOINC...
17:44:05 (4460): No heartbeat from core client for 30 sec - exiting
17:44:06 (4460): No heartbeat from core client for 30 sec - exiting
17:44:07 (4460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:49:59 (3408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:01:58 (3552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:43:35 (4100): No heartbeat from core client for 30 sec - exiting
10:43:36 (4100): No heartbeat from core client for 30 sec - exiting
10:43:37 (4100): No heartbeat from core client for 30 sec - exiting
10:43:38 (4100): No heartbeat from core client for 30 sec - exiting
10:43:39 (4100): No heartbeat from core client for 30 sec - exiting
10:43:40 (4100): No heartbeat from core client for 30 sec - exiting
10:43:41 (4100): No heartbeat from core client for 30 sec - exiting
10:43:42 (4100): No heartbeat from core client for 30 sec - exiting
10:43:43 (4100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:16:28 (4236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
17:11:40 (3636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 116 - Return code = 16

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish
17:13:00 (4732): No heartbeat from core client for 30 sec - exiting

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Feb 2013 22:16:33 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 362,880 766,839 2.1132
24 Feb 2013 13:30:25 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 336,960 710,974 2.1100
22 Feb 2013 19:41:49 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 311,040 655,174 2.1064
21 Feb 2013 12:55:10 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 285,120 599,625 2.1031
19 Feb 2013 17:41:40 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 259,200 544,250 2.0997
17 Feb 2013 18:44:20 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 233,280 488,903 2.0958
16 Feb 2013 15:05:36 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 207,360 433,271 2.0895
13 Feb 2013 18:54:23 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 181,440 379,289 2.0904
10 Feb 2013 15:00:43 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 155,520 324,934 2.0893
06 Feb 2013 18:50:37 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 129,600 271,424 2.0943
03 Feb 2013 20:58:48 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 103,680 215,977 2.0831
02 Feb 2013 14:15:33 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 77,760 160,371 2.0624
30 Jan 2013 19:04:42 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 51,840 106,209 2.0488
28 Jan 2013 16:45:47 1213041 15558613 hadcm3n_zm0e_1920_40_008255879_3 25,920 52,836 2.0384


©2024 cpdn.org