climateprediction.net home page
Task 16132819

Task 16132819

Name hadcm3n_o8qj_1900_40_008466446_3
Workunit 8617285
Created 5 Dec 2013, 15:12:34 UTC
Sent 5 Dec 2013, 15:18:01 UTC
Report deadline 6 Mar 2014, 22:45:12 UTC
Received 23 Dec 2013, 10:31:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1296485
Run time 10 days 9 hours 51 min 36 sec
CPU time 9 days 17 hours 7 min 52 sec
Validate state Invalid
Credit 3,732.48
Device peak FLOPS 1.99 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
Enheden genkender ikke kommandoen.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
10:18:35 (5152): No heartbeat from core client for 30 sec - exiting
10:18:36 (5152): No heartbeat from core client for 30 sec - exiting
10:18:37 (5152): No heartbeat from core client for 30 sec - exiting
10:18:38 (5152): No heartbeat from core client for 30 sec - exiting
10:18:39 (5152): No heartbeat from core client for 30 sec - exiting
10:18:40 (5152): No heartbeat from core client for 30 sec - exiting
10:18:41 (5152): No heartbeat from core client for 30 sec - exiting
10:18:42 (5152): No heartbeat from core client for 30 sec - exiting
10:18:43 (5152): No heartbeat from core client for 30 sec - exiting
10:18:44 (5152): No heartbeat from core client for 30 sec - exiting
10:18:45 (5152): No heartbeat from core client for 30 sec - exiting
10:18:46 (5152): No heartbeat from core client for 30 sec - exiting
10:18:47 (5152): No heartbeat from core client for 30 sec - exiting
10:18:48 (5152): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
10:02:52 (5312): No heartbeat from core client for 30 sec - exiting
10:02:53 (5312): No heartbeat from core client for 30 sec - exiting
10:02:54 (5312): No heartbeat from core client for 30 sec - exiting
10:02:55 (5312): No heartbeat from core client for 30 sec - exiting
10:02:56 (5312): No heartbeat from core client for 30 sec - exiting
10:02:57 (5312): No heartbeat from core client for 30 sec - exiting
10:02:58 (5312): No heartbeat from core client for 30 sec - exiting
10:02:59 (5312): No heartbeat from core client for 30 sec - exiting
10:03:00 (5312): No heartbeat from core client for 30 sec - exiting
10:03:01 (5312): No heartbeat from core client for 30 sec - exiting
10:03:02 (5312): No heartbeat from core client for 30 sec - exiting
10:03:03 (5312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4284, iMonCtr=1
Model crash detected, will try to restart...
10:23:00 (4240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4232, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1
Model crash detected, will try to restart...
10:30:34 (5568): No heartbeat from core client for 30 sec - exiting
10:30:35 (5568): No heartbeat from core client for 30 sec - exiting
10:30:36 (5568): No heartbeat from core client for 30 sec - exiting
10:30:37 (5568): No heartbeat from core client for 30 sec - exiting
10:30:38 (5568): No heartbeat from core client for 30 sec - exiting
10:30:39 (5568): No heartbeat from core client for 30 sec - exiting
10:30:40 (5568): No heartbeat from core client for 30 sec - exiting
10:30:41 (5568): No heartbeat from core client for 30 sec - exiting
10:30:42 (5568): No heartbeat from core client for 30 sec - exiting
10:30:43 (5568): No heartbeat from core client for 30 sec - exiting
10:30:44 (5568): No heartbeat from core client for 30 sec - exiting
10:30:45 (5568): No heartbeat from core client for 30 sec - exiting
10:30:46 (5568): No heartbeat from core client for 30 sec - exiting
10:30:47 (5568): No heartbeat from core client for 30 sec - exiting
10:30:48 (5568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=1
Model crash detected, will try to restart...
10:47:06 (5784): No heartbeat from core client for 30 sec - exiting
10:47:07 (5784): No heartbeat from core client for 30 sec - exiting
10:47:08 (5784): No heartbeat from core client for 30 sec - exiting
10:47:09 (5784): No heartbeat from core client for 30 sec - exiting
10:47:10 (5784): No heartbeat from core client for 30 sec - exiting
10:47:11 (5784): No heartbeat from core client for 30 sec - exiting
10:47:12 (5784): No heartbeat from core client for 30 sec - exiting
10:47:13 (5784): No heartbeat from core client for 30 sec - exiting
10:47:14 (5784): No heartbeat from core client for 30 sec - exiting
10:47:15 (5784): No heartbeat from core client for 30 sec - exiting
10:47:16 (5784): No heartbeat from core client for 30 sec - exiting
10:47:17 (5784): No heartbeat from core client for 30 sec - exiting
10:47:18 (5784): No heartbeat from core client for 30 sec - exiting
10:47:19 (5784): No heartbeat from core client for 30 sec - exiting
10:47:20 (5784): No heartbeat from core client for 30 sec - exiting
10:47:21 (5784): No heartbeat from core client for 30 sec - exiting
10:47:22 (5784): No heartbeat from core client for 30 sec - exiting
10:47:23 (5784): No heartbeat from core client for 30 sec - exiting
10:47:24 (5784): No heartbeat from core client for 30 sec - exiting
10:47:25 (5784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Dec 2013 18:00:48 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 311,040 817,127 2.6271
20 Dec 2013 14:20:41 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 285,120 752,002 2.6375
18 Dec 2013 23:37:36 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 259,200 684,105 2.6393
17 Dec 2013 15:42:07 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 233,280 615,262 2.6374
16 Dec 2013 12:23:34 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 207,360 546,673 2.6363
14 Dec 2013 22:33:32 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 181,440 483,848 2.6667
13 Dec 2013 18:05:23 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 155,520 415,645 2.6726
12 Dec 2013 11:26:37 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 129,600 346,226 2.6715
10 Dec 2013 20:33:57 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 103,680 276,392 2.6658
09 Dec 2013 14:05:32 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 77,760 206,889 2.6606
08 Dec 2013 10:00:20 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 51,840 142,317 2.7453
06 Dec 2013 21:31:28 1296485 16132819 hadcm3n_o8qj_1900_40_008466446_3 25,920 72,650 2.8029


©2024 cpdn.org