climateprediction.net home page
Task 12748180

Task 12748180

Name hadcm3n_o6af_1900_40_007203482_1
Workunit 7401762
Created 28 Mar 2011, 14:16:30 UTC
Sent 29 Mar 2011, 6:54:04 UTC
Report deadline 28 Jun 2011, 14:21:15 UTC
Received 3 May 2011, 3:33:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1070684
Run time 14 days 7 hours 36 min 12 sec
CPU time 13 days 11 hours 14 min 29 sec
Validate state Invalid
Credit 7,153.92
Device peak FLOPS 2.07 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
21:50:03 (6536): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:51:55 (7388): Can't acquire lockfile (32) - waiting 35s
21:52:18 (1112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
01:15:31 (7328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:43:35 (1104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:35:22 (5588): No heartbeat from core client for 30 sec - exiting
18:35:23 (5588): No heartbeat from core client for 30 sec - exiting
18:35:24 (5588): No heartbeat from core client for 30 sec - exiting
18:35:25 (5588): No heartbeat from core client for 30 sec - exiting
18:35:26 (5588): No heartbeat from core client for 30 sec - exiting
18:35:27 (5588): No heartbeat from core client for 30 sec - exiting
18:35:28 (5588): No heartbeat from core client for 30 sec - exiting
18:35:29 (5588): No heartbeat from core client for 30 sec - exiting
18:35:30 (5588): No heartbeat from core client for 30 sec - exiting
18:35:31 (5588): No heartbeat from core client for 30 sec - exiting
18:35:32 (5588): No heartbeat from core client for 30 sec - exiting
18:35:33 (5588): No heartbeat from core client for 30 sec - exiting
18:35:34 (5588): No heartbeat from core client for 30 sec - exiting
18:35:35 (5588): No heartbeat from core client for 30 sec - exiting
18:35:36 (5588): No heartbeat from core client for 30 sec - exiting
18:35:37 (5588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8028, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8028, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3004, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3004, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3004, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 May 2011 03:34:56 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 596,160 1,157,928 1.9423
03 May 2011 03:34:56 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 570,240 1,107,088 1.9414
03 May 2011 03:34:56 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 544,320 1,059,283 1.9461
03 May 2011 03:34:56 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 518,400 1,010,860 1.9500
03 May 2011 03:34:56 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 492,480 961,255 1.9519
03 May 2011 03:34:56 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 466,560 910,374 1.9512
03 May 2011 03:34:55 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 440,640 860,741 1.9534
24 Apr 2011 23:07:31 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 414,720 811,606 1.9570
12 Apr 2011 09:05:04 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 388,800 762,196 1.9604
12 Apr 2011 09:05:03 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 362,880 710,431 1.9578
12 Apr 2011 09:04:03 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 336,960 658,480 1.9542
12 Apr 2011 09:03:35 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 311,040 606,905 1.9512
12 Apr 2011 09:03:35 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 285,120 554,701 1.9455
08 Apr 2011 00:24:50 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 259,200 503,316 1.9418
08 Apr 2011 00:24:50 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 233,280 452,309 1.9389
08 Apr 2011 00:24:50 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 207,360 400,739 1.9326
08 Apr 2011 00:24:50 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 181,440 349,526 1.9264
03 Apr 2011 22:27:10 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 155,520 298,504 1.9194
03 Apr 2011 22:27:10 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 129,600 248,092 1.9143
03 Apr 2011 22:27:10 1070684 12748180 hadcm3n_o6af_1900_40_007203482_1 103,680 197,294 1.9029


©2024 cpdn.org