climateprediction.net home page
Task 14346640

Task 14346640

Name hadam3p_eu_2oho_1990_1_007852917_0
Workunit 8008029
Created 2 Apr 2012, 14:44:58 UTC
Sent 2 Apr 2012, 14:45:02 UTC
Report deadline 15 Mar 2013, 20:05:02 UTC
Received 21 Apr 2012, 4:23:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1104312
Run time 7 days 8 hours 45 min 55 sec
CPU time 6 days 23 hours 7 min 25 sec
Validate state Invalid
Credit 2,378.11
Device peak FLOPS 1.77 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:23:14 (5420): Can't acquire lockfile (32) - waiting 35s
01:23:49 (5420): Can't acquire lockfile (32) - exiting
01:23:49 (5420): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
CPDN Monitor - Quit request from BOINC...
Atmos Restart file copy failed on 2ohoma.daj1470
Precis Restart file copy #1 failed on 2ohoga.daj1470
Atmos Restart file copy failed on atmos_restart.day
Atmos Restart file copy failed on 2ohoma.daj1480
Precis Restart file copy #1 failed on 2ohoga.daj1480
Atmos Restart file copy failed on atmos_restart.day
Atmos Restart file copy failed on 2ohoma.daj1490
Precis Restart file copy #1 failed on 2ohoga.daj1490
Atmos Restart file copy failed on atmos_restart.day
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1184, selfPID=1184, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
01:24:51 (1924): Can't acquire lockfile (32) - waiting 35s
01:25:26 (1924): Can't acquire lockfile (32) - exiting
01:25:26 (1924): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3564, selfPID=3564, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4580, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1764, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3968, selfPID=3968, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2152, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Precis Restart file copy #1 failed on 2ohoga.daj18f0
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3452, selfPID=3452, iMonCtr=2
21:38:42 (4416): No heartbeat from core client for 30 sec - exiting
21:38:43 (4416): No heartbeat from core client for 30 sec - exiting
21:38:44 (4416): No heartbeat from core client for 30 sec - exiting
21:38:46 (4416): No heartbeat from core client for 30 sec - exiting
21:38:47 (4416): No heartbeat from core client for 30 sec - exiting
21:38:48 (4416): No heartbeat from core client for 30 sec - exiting
21:38:49 (4416): No heartbeat from core client for 30 sec - exiting
21:38:50 (4416): No heartbeat from core client for 30 sec - exiting
21:38:51 (4416): No heartbeat from core client for 30 sec - exiting
21:38:52 (4416): No heartbeat from core client for 30 sec - exiting
21:38:53 (4416): No heartbeat from core client for 30 sec - exiting
21:38:54 (4416): No heartbeat from core client for 30 sec - exiting
21:38:55 (4416): No heartbeat from core client for 30 sec - exiting
21:38:56 (4416): No heartbeat from core client for 30 sec - exiting
21:38:58 (4416): No heartbeat from core client for 30 sec - exiting
21:38:59 (4416): No heartbeat from core client for 30 sec - exiting
21:39:00 (4416): No heartbeat from core client for 30 sec - exiting
21:39:01 (4416): No heartbeat from core client for 30 sec - exiting
21:39:02 (4416): No heartbeat from core client for 30 sec - exiting
21:39:03 (4416): No heartbeat from core client for 30 sec - exiting
21:39:04 (4416): No heartbeat from core client for 30 sec - exiting
21:39:05 (4416): No heartbeat from core client for 30 sec - exiting
21:39:06 (4416): No heartbeat from core client for 30 sec - exiting
21:39:08 (4416): No heartbeat from core client for 30 sec - exiting
21:39:09 (4416): No heartbeat from core client for 30 sec - exiting
21:39:10 (4416): No heartbeat from core client for 30 sec - exiting
21:39:11 (4416): No heartbeat from core client for 30 sec - exiting
21:39:12 (4416): No heartbeat from core client for 30 sec - exiting
21:39:13 (4416): No heartbeat from core client for 30 sec - exiting
21:39:14 (4416): No heartbeat from core client for 30 sec - exiting
21:39:15 (4416): No heartbeat from core client for 30 sec - exiting
21:39:16 (4416): No heartbeat from core client for 30 sec - exiting
21:39:17 (4416): No heartbeat from core client for 30 sec - exiting
21:39:19 (4416): No heartbeat from core client for 30 sec - exiting
21:39:20 (4416): No heartbeat from core client for 30 sec - exiting
21:39:21 (4416): No heartbeat from core client for 30 sec - exiting
21:39:22 (4416): No heartbeat from core client for 30 sec - exiting
21:39:23 (4416): No heartbeat from core client for 30 sec - exiting
21:39:24 (4416): No heartbeat from core client for 30 sec - exiting
21:39:25 (4416): No heartbeat from core client for 30 sec - exiting
21:39:26 (4416): No heartbeat from core client for 30 sec - exiting
21:39:27 (4416): No heartbeat from core client for 30 sec - exiting
21:39:28 (4416): No heartbeat from core client for 30 sec - exiting
21:39:30 (4416): No heartbeat from core client for 30 sec - exiting
21:39:31 (4416): No heartbeat from core client for 30 sec - exiting
21:39:32 (4416): No heartbeat from core client for 30 sec - exiting
21:39:33 (4416): No heartbeat from core client for 30 sec - exiting
21:39:34 (4416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5380, selfPID=5192, iMonCtr=1
Model crash detected, will try to restart...
Precis Restart file copy #1 failed on 2ohoga.daj1980
Atmos Restart file copy failed on atmos_restart.day
Atmos Restart file copy failed on 2ohoma.daj1990
Precis Restart file copy #1 failed on 2ohoga.daj1990
Atmos Restart file copy failed on atmos_restart.day
Atmos Restart file copy failed on 2ohoma.daj19a0
Precis Restart file copy #1 failed on 2ohoga.daj19a0
Atmos Restart file copy failed on atmos_restart.day
Atmos Restart file copy failed on 2ohoma.daj19b0
Precis Restart file copy #1 failed on 2ohoga.daj19b0
Atmos Restart file copy failed on atmos_restart.day
Atmos Restart file copy failed on 2ohoma.daj19c0
Precis Restart file copy #1 failed on 2ohoga.daj19c0
Atmos Restart file copy failed on atmos_restart.day
Atmos Restart file copy failed on 2ohoma.daj19d0
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5276, selfPID=5276, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5544, selfPID=5544, iMonCtr=2
Error converting file to netcdf: C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_2oho_1990_1_007852917/dataout\2ohoma.pcj1sep
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=960, selfPID=3088, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4252, selfPID=4076, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1344, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
08:32:47 (4296): Can't acquire lockfile (32) - waiting 35s
08:33:22 (4296): Can't acquire lockfile (32) - exiting
08:33:22 (4296): Error: Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen Prozess verwendet wird. (0x20)
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5052, selfPID=5052, iMonCtr=2
Error converting file to netcdf: C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_2oho_1990_1_007852917/dataout\2ohoma.pcj1oct
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Error converting file to netcdf: C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_2oho_1990_1_007852917/dataout\2ohoma.pcj1nov
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_2oho_1990_1_007852917_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Apr 2012 03:56:47 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 137,856 599,997 4.3523
19 Apr 2012 00:23:54 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 126,336 550,773 4.3596
17 Apr 2012 05:25:16 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 114,816 501,307 4.3662
15 Apr 2012 22:16:42 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 103,776 451,837 4.3540
13 Apr 2012 21:43:57 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 92,267 401,488 4.3514
13 Apr 2012 17:30:40 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 92,260 400,745 4.3436
13 Apr 2012 10:28:49 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 92,256 400,045 4.3362
12 Apr 2012 03:13:16 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 80,736 350,723 4.3441
11 Apr 2012 01:23:25 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 69,216 301,997 4.3631
09 Apr 2012 16:04:00 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 57,696 252,873 4.3829
07 Apr 2012 16:36:12 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 46,176 202,784 4.3915
06 Apr 2012 13:31:18 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 34,656 152,542 4.4016
05 Apr 2012 04:41:30 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 23,136 102,514 4.4309
04 Apr 2012 12:26:11 1104312 14346640 hadam3p_eu_2oho_1990_1_007852917_0 11,616 51,647 4.4462


©2024 climateprediction.net