climateprediction.net home page
Task 11640574

Task 11640574

Name hadsm3dhet2_u0bh_006669686_7
Workunit 6872940
Created 9 Aug 2010, 15:30:35 UTC
Sent 26 Sep 2010, 20:42:47 UTC
Report deadline 9 Sep 2011, 2:02:47 UTC
Received 11 Oct 2010, 19:37:15 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1079445
Run time 5 days 17 hours 13 min 41 sec
CPU time 5 days 16 hours 14 min 2 sec
Validate state Invalid
Credit 3,572.76
Device peak FLOPS 2.52 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4968, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1892, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
MainError:	05:51:18 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9476, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5532, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Oct 2010 06:21:24 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 129,624 486,771 1.2518
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 118,822 473,421 1.2522
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 108,020 459,061 1.2499
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 97,218 445,068 1.2486
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 86,416 431,462 1.2482
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 75,614 419,519 1.2528
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 64,812 408,513 1.2606
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 54,010 397,550 1.2691
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 43,208 385,491 1.2745
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 32,406 373,760 1.2815
10 Oct 2010 19:50:02 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 21,604 361,800 1.2882
06 Oct 2010 09:55:43 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 10,802 348,036 1.2888
06 Oct 2010 06:14:30 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 259,248 333,731 1.2873
05 Oct 2010 20:09:09 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 248,446 319,988 1.2880
05 Oct 2010 07:26:44 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 237,644 305,502 1.2855
04 Oct 2010 21:25:56 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 226,842 291,458 1.2849
04 Oct 2010 08:29:37 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 216,040 277,457 1.2843
03 Oct 2010 22:41:21 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 205,238 263,307 1.2829
02 Oct 2010 09:44:00 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 194,436 249,458 1.2830
02 Oct 2010 06:07:35 1079445 11640574 hadsm3dhet2_u0bh_006669686_7 183,634 235,951 1.2849


©2024 cpdn.org