climateprediction.net home page
Task 17428358

Task 17428358

Name hadcm3n_xcqq_1940_40_009152808_2
Workunit 9283144
Created 18 Nov 2014, 0:18:54 UTC
Sent 18 Nov 2014, 0:34:46 UTC
Report deadline 17 Feb 2015, 8:01:57 UTC
Received 3 Dec 2014, 21:51:48 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1254930
Run time 6 days 20 hours 32 min 9 sec
CPU time 5 days 21 hours 11 min 50 sec
Validate state Invalid
Credit 4,354.56
Device peak FLOPS 2.74 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:21:29 (7496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7032, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5768, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5768, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:50:36 (7520): No heartbeat from core client for 30 sec - exiting
06:50:37 (7520): No heartbeat from core client for 30 sec - exiting
06:50:38 (7520): No heartbeat from core client for 30 sec - exiting
06:50:39 (7520): No heartbeat from core client for 30 sec - exiting
06:50:40 (7520): No heartbeat from core client for 30 sec - exiting
06:50:41 (7520): No heartbeat from core client for 30 sec - exiting
06:50:42 (7520): No heartbeat from core client for 30 sec - exiting
06:50:43 (7520): No heartbeat from core client for 30 sec - exiting
06:50:44 (7520): No heartbeat from core client for 30 sec - exiting
06:50:45 (7520): No heartbeat from core client for 30 sec - exiting
06:50:46 (7520): No heartbeat from core client for 30 sec - exiting
06:50:47 (7520): No heartbeat from core client for 30 sec - exiting
06:50:48 (7520): No heartbeat from core client for 30 sec - exiting
06:50:49 (7520): No heartbeat from core client for 30 sec - exiting
06:50:50 (7520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1

Model crashed: SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1
SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1

Model crashed: SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1
SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1

Model crashed: SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1
SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1

Model crashed: SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1
SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1

Model crashed: SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1
SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1

Model crashed: SETPOS: Unit 41 to Word Address 1065772379 Failed with Error Code -1
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Dec 2014 09:13:10 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 362,880 491,151 1.3535
02 Dec 2014 23:14:08 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 336,960 456,915 1.3560
02 Dec 2014 07:22:29 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 311,040 423,260 1.3608
30 Nov 2014 06:45:42 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 285,120 389,232 1.3652
29 Nov 2014 19:48:17 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 259,200 354,387 1.3672
29 Nov 2014 01:01:45 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 233,280 319,460 1.3694
28 Nov 2014 05:46:21 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 207,360 284,785 1.3734
27 Nov 2014 19:25:49 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 181,440 250,063 1.3782
27 Nov 2014 08:37:03 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 155,520 214,938 1.3821
26 Nov 2014 22:07:33 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 129,600 179,854 1.3878
26 Nov 2014 09:58:24 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 103,680 144,294 1.3917
25 Nov 2014 22:43:37 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 77,760 108,586 1.3964
25 Nov 2014 07:02:08 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 51,840 72,535 1.3992
24 Nov 2014 17:04:06 1254930 17428358 hadcm3n_xcqq_1940_40_009152808_2 25,920 36,070 1.3916


©2024 climateprediction.net