climateprediction.net home page
Task 16005845

Task 16005845

Name hadcm3n_n1wj_1880_40_008374197_2
Workunit 8525056
Created 6 Sep 2013, 15:41:50 UTC
Sent 6 Sep 2013, 16:04:56 UTC
Report deadline 6 Dec 2013, 23:32:07 UTC
Received 9 Oct 2013, 2:33:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1179495
Run time 18 days 15 hours 48 min 32 sec
CPU time 14 days 23 hours 33 min 56 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 1.48 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
Das Gerät erkennt den Befehl nicht.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3388, iMonCtr=1
Model crash detected, will try to restart...
16:43:41 (3916): No heartbeat from core client for 30 sec - exiting
16:43:43 (3916): No heartbeat from core client for 30 sec - exiting
16:43:44 (3916): No heartbeat from core client for 30 sec - exiting
16:43:45 (3916): No heartbeat from core client for 30 sec - exiting
16:43:46 (3916): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3420, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3420, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3416, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3416, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1296, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
16:34:41 (1188): No heartbeat from core client for 30 sec - exiting
16:34:43 (1188): No heartbeat from core client for 30 sec - exiting
16:34:44 (1188): No heartbeat from core client for 30 sec - exiting
16:34:45 (1188): No heartbeat from core client for 30 sec - exiting
16:34:46 (1188): No heartbeat from core client for 30 sec - exiting
16:34:47 (1188): No heartbeat from core client for 30 sec - exiting
16:34:48 (1188): No heartbeat from core client for 30 sec - exiting
16:34:49 (1188): No heartbeat from core client for 30 sec - exiting
16:34:50 (1188): No heartbeat from core client for 30 sec - exiting
16:34:51 (1188): No heartbeat from core client for 30 sec - exiting
16:34:52 (1188): No heartbeat from core client for 30 sec - exiting
16:34:53 (1188): No heartbeat from core client for 30 sec - exiting
16:34:54 (1188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:34:55 (1188): No heartbeat from core client for 30 sec - exiting
16:34:56 (1188): No heartbeat from core client for 30 sec - exiting
16:34:57 (1188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Skipping gmts_generator due to netcdf error 13 - Permission denied
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
Skipping gmts_generator due to netcdf error -49 - Variable not found
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Skipping gmts_generator due to netcdf error 13 - Permission denied
Skipping gmts_generator due to netcdf error 13 - Permission denied
Skipping gmts_generator due to netcdf error 13 - Permission denied
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Skipping gmts_generator due to netcdf error 13 - Permission denied
Skipping gmts_generator due to netcdf error 13 - Permission denied
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6232, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Oct 2013 01:45:19 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 388,800 1,227,988 3.1584
05 Oct 2013 13:05:59 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 362,880 1,144,362 3.1536
03 Oct 2013 13:49:07 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 336,960 1,059,884 3.1454
30 Sep 2013 17:48:05 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 311,040 979,018 3.1476
28 Sep 2013 15:20:29 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 285,120 894,507 3.1373
26 Sep 2013 23:39:25 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 259,200 815,821 3.1475
25 Sep 2013 09:26:54 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 233,280 733,710 3.1452
23 Sep 2013 14:35:37 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 207,360 649,653 3.1330
21 Sep 2013 03:33:57 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 181,440 565,385 3.1161
18 Sep 2013 22:39:34 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 155,520 483,317 3.1077
16 Sep 2013 19:29:44 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 129,600 404,131 3.1183
14 Sep 2013 22:31:22 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 103,680 326,002 3.1443
12 Sep 2013 22:53:45 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 77,760 244,593 3.1455
10 Sep 2013 19:57:23 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 51,840 164,861 3.1802
08 Sep 2013 04:05:20 1179495 16005845 hadcm3n_n1wj_1880_40_008374197_2 25,920 83,792 3.2327


©2024 cpdn.org