climateprediction.net home page
Task 13628368

Task 13628368

Name hadcm3n_o2um_1940_40_007543935_0
Workunit 7741167
Created 10 Nov 2011, 3:32:06 UTC
Sent 16 Nov 2011, 2:50:33 UTC
Report deadline 15 Feb 2012, 10:17:44 UTC
Received 12 Dec 2011, 18:40:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1165859
Run time 13 days 3 hours 53 min 16 sec
CPU time 11 days 21 hours 55 min 12 sec
Validate state Invalid
Credit 6,842.88
Device peak FLOPS 2.20 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
El dispositivo no reconoce el comando. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3988, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3032, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CSuspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2716, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2716, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3208, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3208, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4048, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:11:55 (3100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:11:56 (3100): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
11:40:16 (2888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:35:22 (576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
13:02:14 (2148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:37:35 (3640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:37:36 (3640): No heartbeat from core client for 30 sec - exiting
19:37:37 (3640): No heartbeat from core client for 30 sec - exiting
23:36:28 (2208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:51:32 (3532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:42:10 (5560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:42:11 (5560): No heartbeat from core client for 30 sec - exiting
15:42:12 (5560): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3284, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3284, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3284, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3284, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
15:34:23 (3832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:32:52 (1460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:31:19 (6064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:59:20 (3560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:57:57 (1644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:19:19 (3564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:19:21 (3564): No heartbeat from core client for 30 sec - exiting
21:19:22 (3564): No heartbeat from core client for 30 sec - exiting
21:19:24 (3564): No heartbeat from core client for 30 sec - exiting
21:19:25 (3564): No heartbeat from core client for 30 sec - exiting
21:19:26 (3564): No heartbeat from core client for 30 sec - exiting
01:17:36 (3412): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:38:18 (3488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:36:17 (4680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:35:17 (3572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:34:13 (4808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:34:14 (4808): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2040, iMonCtr=1
Model crash detected, will try to restart...
11:36:38 (2088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:36:39 (2088): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
16:49:59 (5840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:50:00 (5840): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:36:36 (4744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:36:38 (4744): No heartbeat from core client for 30 sec - exiting
00:36:39 (4744): No heartbeat from core client for 30 sec - exiting
00:36:40 (4744): No heartbeat from core client for 30 sec - exiting
00:36:41 (4744): No heartbeat from core client for 30 sec - exiting
00:36:42 (4744): No heartbeat from core client for 30 sec - exiting
00:36:43 (4744): No heartbeat from core client for 30 sec - exiting
00:36:44 (4744): No heartbeat from core client for 30 sec - exiting
00:36:45 (4744): No heartbeat from core client for 30 sec - exiting
00:36:46 (4744): No heartbeat from core client for 30 sec - exiting
CSuspended CPDN Monitor - Suspend request from BOINC...
20:55:59 (4052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:55:08 (4308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:32:56 (4740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:33:04 (4740): No heartbeat from core client for 30 sec - exiting
18:33:05 (4740): No heartbeat from core client for 30 sec - exiting
18:33:06 (4740): No heartbeat from core client for 30 sec - exiting
18:33:07 (4740): No heartbeat from core client for 30 sec - exiting
18:33:08 (4740): No heartbeat from core client for 30 sec - exiting
18:33:09 (4740): No heartbeat from core client for 30 sec - exiting
18:33:10 (4740): No heartbeat from core client for 30 sec - exiting
18:33:11 (4740): No heartbeat from core client for 30 sec - exiting
18:33:12 (4740): No heartbeat from core client for 30 sec - exiting
18:33:13 (4740): No heartbeat from core client for 30 sec - exiting
18:33:14 (4740): No heartbeat from core client for 30 sec - exiting
18:33:16 (4740): No heartbeat from core client for 30 sec - exiting
18:33:17 (4740): No heartbeat from core client for 30 sec - exiting
18:33:18 (4740): No heartbeat from core client for 30 sec - exiting
18:33:19 (4740): No heartbeat from core client for 30 sec - exiting
18:33:20 (4740): No heartbeat from core client for 30 sec - exiting
18:33:21 (4740): No heartbeat from core client for 30 sec - exiting
18:33:22 (4740): No heartbeat from core client for 30 sec - exiting
18:33:23 (4740): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
00:42:30 (580): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
00:42:36 (580): No heartbeat from core client for 30 sec - exiting
00:42:37 (580): No heartbeat from core client for 30 sec - exiting
00:42:38 (580): No heartbeat from core client for 30 sec - exiting
00:42:39 (580): No heartbeat from core client for 30 sec - exiting
00:42:40 (580): No heartbeat from core client for 30 sec - exiting
00:42:41 (580): No heartbeat from core client for 30 sec - exiting
00:42:42 (580): No heartbeat from core client for 30 sec - exiting
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o2um_1940_40_007543935/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o2um_1940_40_007543935/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o2um_1940_40_007543935/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o2um_1940_40_007543935/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o2um_1940_40_007543935/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o2um_1940_40_007543935/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Dec 2011 21:58:48 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 570,240 990,607 1.7372
09 Dec 2011 19:42:20 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 544,320 945,778 1.7375
08 Dec 2011 19:38:52 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 518,400 901,121 1.7383
07 Dec 2011 17:04:47 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 492,480 858,236 1.7427
06 Dec 2011 13:26:17 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 466,560 811,369 1.7390
05 Dec 2011 11:11:34 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 440,640 763,640 1.7330
04 Dec 2011 11:55:42 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 414,720 719,181 1.7341
02 Dec 2011 23:38:52 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 388,800 674,202 1.7341
02 Dec 2011 01:24:56 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 362,880 627,307 1.7287
01 Dec 2011 11:42:09 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 336,960 581,025 1.7243
29 Nov 2011 22:18:14 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 311,040 537,718 1.7288
28 Nov 2011 18:08:11 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 285,120 492,426 1.7271
27 Nov 2011 12:51:47 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 259,200 447,518 1.7265
26 Nov 2011 09:14:39 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 233,280 400,101 1.7151
25 Nov 2011 09:26:07 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 207,360 354,401 1.7091
23 Nov 2011 21:48:17 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 181,440 310,866 1.7133
22 Nov 2011 08:18:42 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 155,520 265,532 1.7074
21 Nov 2011 09:23:38 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 129,600 219,577 1.6943
19 Nov 2011 19:49:16 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 103,680 176,564 1.7030
18 Nov 2011 15:05:55 1165859 13628368 hadcm3n_o2um_1940_40_007543935_0 77,760 134,446 1.7290


©2024 cpdn.org