climateprediction.net home page
Task 14102891

Task 14102891

Name hadcm3n_y8qx_1940_40_007753082_1
Workunit 7908191
Created 16 Feb 2012, 23:07:46 UTC
Sent 16 Feb 2012, 23:08:03 UTC
Report deadline 18 May 2012, 6:35:14 UTC
Received 28 Mar 2012, 20:54:11 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 725427
Run time 13 days 11 hours 33 min 30 sec
CPU time 9 days 19 hours 16 min 52 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.18 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9112, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2444, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5216, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4980, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1344, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5584, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8672, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9928, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6212, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7768, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2596, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7632, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=620, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10500, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4680, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
15:53:17 (4760): No heartbeat from core client for 30 sec - exiting
15:53:18 (4760): No heartbeat from core client for 30 sec - exiting
15:53:19 (4760): No heartbeat from core client for 30 sec - exiting
15:53:20 (4760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6064, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6968, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2248, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5336, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5336, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5264, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1700, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1124, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/y8qxko.pje5c10
Error converting file to netcdf: dataout/y8qxko.pie5c10
Error converting file to netcdf: dataout/y8qxko.pfe5c10
Error converting file to netcdf: dataout/y8qxka.phe5c10
Error converting file to netcdf: dataout/y8qxka.pge5c10
Error converting file to netcdf: dataout/y8qxka.pee5c10
Error converting file to netcdf: dataout/y8qxka.pde5c10
16:13:58 (1700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5820, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6016, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=504, iMonCtr=1
Model crash detected, will try to restart...
16:00:23 (6444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6948, iMonCtr=1
Model crash detected, will try to restart...
07:56:06 (5308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:00:23 (4520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:00:25 (4520): No heartbeat from core client for 30 sec - exiting
08:00:26 (4520): No heartbeat from core client for 30 sec - exiting
08:00:27 (4520): No heartbeat from core client for 30 sec - exiting
08:00:28 (4520): No heartbeat from core client for 30 sec - exiting
08:00:29 (4520): No heartbeat from core client for 30 sec - exiting
08:00:30 (4520): No heartbeat from core client for 30 sec - exiting
08:00:31 (4520): No heartbeat from core client for 30 sec - exiting
08:00:32 (4520): No heartbeat from core client for 30 sec - exiting
08:00:33 (4520): No heartbeat from core client for 30 sec - exiting
08:00:34 (4520): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5208, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6068, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1108, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6188, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=1
Model crash detected, will try to restart...
16:33:18 (5996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:33:19 (5996): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7688, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5784, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3472, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4288, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
16:27:12 (4468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:52:55 (5440): No heartbeat from core client for 30 sec - exiting
20:52:56 (5440): No heartbeat from core client for 30 sec - exiting
20:52:57 (5440): No heartbeat from core client for 30 sec - exiting
20:52:58 (5440): No heartbeat from core client for 30 sec - exiting
20:52:59 (5440): No heartbeat from core client for 30 sec - exiting
20:53:00 (5440): No heartbeat from core client for 30 sec - exiting
20:53:01 (5440): No heartbeat from core client for 30 sec - exiting
20:53:02 (5440): No heartbeat from core client for 30 sec - exiting
20:53:03 (5440): No heartbeat from core client for 30 sec - exiting
20:53:04 (5440): No heartbeat from core client for 30 sec - exiting
20:53:05 (5440): No heartbeat from core client for 30 sec - exiting
20:53:06 (5440): No heartbeat from core client for 30 sec - exiting
20:53:07 (5440): No heartbeat from core client for 30 sec - exiting
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Mar 2012 19:54:22 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 259,200 847,008 3.2678
25 Mar 2012 11:23:05 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 233,280 763,638 3.2735
22 Mar 2012 18:07:44 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 207,360 677,759 3.2685
19 Mar 2012 12:39:50 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 181,440 593,529 3.2712
16 Mar 2012 20:30:15 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 155,520 505,819 3.2524
12 Mar 2012 16:16:38 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 129,600 424,749 3.2774
09 Mar 2012 18:52:17 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 103,680 338,976 3.2694
06 Mar 2012 19:29:35 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 77,760 251,820 3.2384
02 Mar 2012 13:46:07 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 51,840 166,442 3.2107
19 Feb 2012 15:17:42 725427 14102891 hadcm3n_y8qx_1940_40_007753082_1 25,920 81,660 3.1505


©2024 cpdn.org