climateprediction.net home page
Task 10960782

Task 10960782

Name hadsm3dhet2_jkrj_006589905_6
Workunit 6793278
Created 15 Mar 2010, 11:52:45 UTC
Sent 22 Oct 2010, 14:08:03 UTC
Report deadline 4 Oct 2011, 19:28:03 UTC
Received 4 Nov 2010, 13:58:56 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1092796
Run time 4 days 17 hours 59 min 47 sec
CPU time 4 days 13 hours 34 min 37 sec
Validate state Invalid
Credit 1,885.62
Device peak FLOPS 1.92 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: 
cpdnmonitor: error reading file C:\Documents and Settings\All Users.WINDOWS\Application Data\BOINC/projects/climateprediction.net/hadsm3_se_6.07_windows_intelx86.exe
cpdnmonitor: error reading file C:\Documents and Settings\All Users.WINDOWS\Application Data\BOINC/projects/climateprediction.net/hadsm3_um_6.07_windows_intelx86.exe
cpdnmonitor: error reading file C:\Documents and Settings\All Users.WINDOWS\Application Data\BOINC/projects/climateprediction.net/hadsm3dhet2_jkrj_006589905/dataout/restart.day
Could not launch model process. Last Error=193
called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2180, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2180, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2180, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2180, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2180, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2180, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Nov 2010 07:43:02 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 205,238 371,559 1.8104
02 Nov 2010 19:48:33 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 194,436 351,859 1.8096
02 Nov 2010 08:35:03 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 183,634 332,457 1.8104
01 Nov 2010 21:20:43 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 172,832 313,063 1.8114
01 Nov 2010 09:59:53 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 162,030 293,395 1.8107
31 Oct 2010 21:53:37 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 151,228 273,795 1.8105
31 Oct 2010 09:50:28 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 140,426 254,376 1.8115
30 Oct 2010 20:17:10 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 129,624 234,982 1.8128
30 Oct 2010 03:31:11 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 118,822 215,562 1.8142
29 Oct 2010 16:05:28 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 108,020 195,610 1.8109
29 Oct 2010 03:41:08 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 97,218 176,239 1.8128
28 Oct 2010 13:45:55 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 86,416 156,925 1.8159
28 Oct 2010 03:28:41 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 75,614 137,310 1.8159
27 Oct 2010 15:18:15 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 64,812 118,036 1.8212
27 Oct 2010 04:05:24 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 54,010 98,314 1.8203
26 Oct 2010 16:58:12 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 43,208 78,971 1.8277
25 Oct 2010 12:50:32 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 32,406 59,216 1.8273
23 Oct 2010 23:37:06 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 21,604 39,709 1.8380
23 Oct 2010 11:59:26 1092796 10960782 hadsm3dhet2_jkrj_006589905_6 10,802 19,545 1.8094


©2024 cpdn.org