climateprediction.net home page
Task 13019458

Task 13019458

Name hadcm3n_t0bw_1940_40_007313629_0
Workunit 7511059
Created 28 Jun 2011, 11:37:23 UTC
Sent 28 Jun 2011, 11:37:37 UTC
Report deadline 27 Sep 2011, 19:04:48 UTC
Received 26 Jul 2011, 23:33:45 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1135852
Run time 18 days 2 hours 26 min 59 sec
CPU time 17 days 0 hours 16 min 12 sec
Validate state Invalid
Credit 5,598.72
Device peak FLOPS 1.20 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4920, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4628, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4968, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6964, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1896, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:51:42 (7956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:57:03 (5936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:59:49 (3420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:05:04 (6784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3132, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=792, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=792, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=792, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5200, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7080, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5412, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Jul 2011 19:05:52 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 466,560 1,455,367 3.1194
25 Jul 2011 22:54:26 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 440,640 1,371,790 3.1132
25 Jul 2011 20:57:11 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 414,720 1,290,107 3.1108
25 Jul 2011 19:14:13 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 388,800 1,207,998 3.1070
25 Jul 2011 19:05:56 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 362,880 1,125,461 3.1015
25 Jul 2011 17:55:07 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 336,960 1,042,988 3.0953
25 Jul 2011 16:28:18 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 311,040 958,644 3.0821
25 Jul 2011 14:20:21 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 285,120 875,489 3.0706
25 Jul 2011 14:20:21 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 259,200 792,594 3.0578
25 Jul 2011 14:20:21 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 233,280 709,842 3.0429
25 Jul 2011 14:20:21 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 207,360 628,269 3.0298
09 Jul 2011 15:02:29 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 181,440 546,885 3.0141
07 Jul 2011 21:59:55 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 155,520 495,054 3.1832
07 Jul 2011 15:54:15 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 129,600 414,227 3.1962
04 Jul 2011 18:55:51 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 103,680 332,364 3.2057
03 Jul 2011 19:23:53 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 77,760 250,377 3.2199
01 Jul 2011 19:16:37 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 51,840 167,374 3.2287
29 Jun 2011 19:44:11 1135852 13019458 hadcm3n_t0bw_1940_40_007313629_0 25,920 83,059 3.2044


©2024 cpdn.org