climateprediction.net home page
Task 15907871

Task 15907871

Name hadcm3n_n26p_1880_40_008374255_1
Workunit 8525114
Created 25 Jul 2013, 16:02:31 UTC
Sent 25 Jul 2013, 16:03:08 UTC
Report deadline 24 Oct 2013, 23:30:19 UTC
Received 23 Aug 2013, 13:46:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1251442
Run time 13 days 18 hours 5 min 47 sec
CPU time 9 days 11 hours 53 min 46 sec
Validate state Invalid
Credit 3,732.48
Device peak FLOPS 2.05 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6156, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7176, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4780, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1532, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6128, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6128, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3008, iMonCtr=1
Model crash detected, will try to restart...
12:02:04 (4412): No heartbeat from core client for 30 sec - exiting
12:02:05 (4412): No heartbeat from core client for 30 sec - exiting
12:02:06 (4412): No heartbeat from core client for 30 sec - exiting
12:02:07 (4412): No heartbeat from core client for 30 sec - exiting
12:02:08 (4412): No heartbeat from core client for 30 sec - exiting
12:02:09 (4412): No heartbeat from core client for 30 sec - exiting
12:02:10 (4412): No heartbeat from core client for 30 sec - exiting
12:02:11 (4412): No heartbeat from core client for 30 sec - exiting
12:02:12 (4412): No heartbeat from core client for 30 sec - exiting
12:02:14 (4412): No heartbeat from core client for 30 sec - exiting
12:02:15 (4412): No heartbeat from core client for 30 sec - exiting
12:02:16 (4412): No heartbeat from core client for 30 sec - exiting
12:02:17 (4412): No heartbeat from core client for 30 sec - exiting
12:02:18 (4412): No heartbeat from core client for 30 sec - exiting
12:02:19 (4412): No heartbeat from core client for 30 sec - exiting
12:02:20 (4412): No heartbeat from core client for 30 sec - exiting
12:02:21 (4412): No heartbeat from core client for 30 sec - exiting
12:02:22 (4412): No heartbeat from core client for 30 sec - exiting
12:02:23 (4412): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:06:31 (3464): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:12:26 (1664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3856, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=776, iMonCtr=1
Model crash detected, will try to restart...
10:24:50 (5064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:24:51 (5064): No heartbeat from core client for 30 sec - exiting
10:24:52 (5064): No heartbeat from core client for 30 sec - exiting
10:24:53 (5064): No heartbeat from core client for 30 sec - exiting
10:24:54 (5064): No heartbeat from core client for 30 sec - exiting
10:24:55 (5064): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1
Model crash detected, will try to restart...
13:32:20 (6088): No heartbeat from core client for 30 sec - exiting
13:32:22 (6088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
15:38:12 (5492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2136, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3804, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6544, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3020, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2984, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5432, iMonCtr=1
Model crash detected, will try to restart...
11:54:35 (6184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6012, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Aug 2013 19:59:25 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 311,040 796,950 2.5622
19 Aug 2013 19:47:09 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 285,120 728,746 2.5559
17 Aug 2013 02:08:30 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 259,200 663,809 2.5610
15 Aug 2013 14:58:00 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 233,280 597,974 2.5633
14 Aug 2013 17:49:28 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 207,360 532,812 2.5695
14 Aug 2013 17:49:28 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 181,440 465,629 2.5663
14 Aug 2013 17:49:28 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 155,520 391,468 2.5172
14 Aug 2013 17:49:28 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 129,600 323,607 2.4970
14 Aug 2013 17:49:28 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 103,680 255,933 2.4685
14 Aug 2013 17:49:28 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 77,760 189,956 2.4428
14 Aug 2013 17:49:28 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 51,840 127,842 2.4661
29 Jul 2013 14:34:52 1251442 15907871 hadcm3n_n26p_1880_40_008374255_1 25,920 63,856 2.4636


©2024 climateprediction.net