climateprediction.net home page
Task 15444436

Task 15444436

Name hadcm3n_ze5j_1880_40_008247337_1
Workunit 8402461
Created 21 Nov 2012, 7:00:16 UTC
Sent 21 Nov 2012, 7:00:22 UTC
Report deadline 20 Feb 2013, 14:27:33 UTC
Received 25 Jan 2013, 7:06:57 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 1183496
Run time 17 days 10 hours 21 min 46 sec
CPU time 17 days 3 hours 55 min 46 sec
Validate state Valid
Credit 12,441.60
Device peak FLOPS 2.48 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
08:02:31 (4648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5084, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3648, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4288, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4884, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
07:21:27 (5312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:09:44 (2680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5316, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5568, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=1
Model crash detected, will try to restart...
08:08:28 (2528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
11:00:40 (436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=576, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3296, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3296, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3296, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4728, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=580, iMonCtr=1
Model crash detected, will try to restart...
07:59:31 (1076): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=1
Model crash detected, will try to restart...
15:55:54 (5100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4404, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=468, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
cpdnmonitor: cannot open input file dataout/ocean_restart.day after 11 attempts
cpdnmonitor: cannot open input file dataout/atmos_restart.hold after 11 attempts
cpdnmonitor: cannot open input file dataout/ocean_restart.day after 11 attempts
OPEN:  Unable to Open File dataout/ze5jka.pdb3c10 for Read/Write

Model crashed: STWORK  : Error opening output PP file on unit 63                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_se_6.07_windows_intelx86.dll after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_um_6.07_windows_intelx86.exe after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/jobs/xabnk.ihist after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/jobs/xabnk.namelists after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/dataout/ocean_restart.day after 11 attempts
09:17:36 (5032): Can't open init data file - running in standalone mode
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_se_6.07_windows_intelx86.dll after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_um_6.07_windows_intelx86.exe after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/jobs/xabnk.ihist after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/jobs/xabnk.namelists after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ze5j_1880_40_008247337/dataout/ocean_restart.day after 11 attempts
09:19:57 (5032): Can't open init data file - running in standalone mode
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4288, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5092, iMonCtr=1
Model crash detected, will try to restart...
10:52:26 (1148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1
Model crash detected, will try to restart...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Jan 2013 15:35:39 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 1,036,800 1,482,365 1.4298
23 Jan 2013 20:57:28 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 1,010,880 1,445,404 1.4298
23 Jan 2013 07:12:56 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 984,960 1,408,521 1.4300
21 Jan 2013 15:50:51 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 959,040 1,371,828 1.4304
20 Jan 2013 14:15:37 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 933,120 1,335,436 1.4312
17 Jan 2013 14:31:08 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 907,200 1,298,424 1.4312
16 Jan 2013 13:40:29 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 881,280 1,261,517 1.4315
15 Jan 2013 12:25:06 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 855,360 1,223,793 1.4307
12 Jan 2013 13:40:15 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 829,440 1,188,003 1.4323
10 Jan 2013 09:53:50 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 803,520 1,150,851 1.4323
09 Jan 2013 08:20:08 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 777,600 1,114,080 1.4327
08 Jan 2013 07:26:42 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 751,680 1,077,402 1.4333
05 Jan 2013 19:40:40 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 725,760 1,039,694 1.4326
03 Jan 2013 11:06:26 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 699,840 1,002,206 1.4321
02 Jan 2013 09:50:20 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 673,920 964,942 1.4318
31 Dec 2012 10:53:53 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 648,000 927,761 1.4317
30 Dec 2012 10:19:51 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 622,080 890,943 1.4322
29 Dec 2012 12:15:08 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 596,160 854,315 1.4330
25 Dec 2012 20:09:41 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 570,240 816,880 1.4325
19 Dec 2012 16:03:13 1183496 15444436 hadcm3n_ze5j_1880_40_008247337_1 544,320 778,282 1.4298


©2024 cpdn.org