climateprediction.net home page
Task 11112390

Task 11112390

Name hadsm3dhet2_jwgn_006605065_2
Workunit 6808438
Created 15 Mar 2010, 12:13:52 UTC
Sent 30 May 2010, 10:10:42 UTC
Report deadline 12 May 2011, 15:30:42 UTC
Received 3 Jul 2010, 20:56:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1068724
Run time 12 days 21 hours 23 min 2 sec
CPU time 11 days 8 hours 0 min 35 sec
Validate state Invalid
Credit 4,664.44
Device peak FLOPS 0.00 GFLOPS
Application version ---
Stderr
<core_client_version>6.10.43</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5152, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5296, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5296, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: 7R
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4336, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4336, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2340, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2340, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2340, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2340, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2340, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2340, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5740, iMonCtr=1
Model crash detected, will try to restart...

Model crashed: 7R

Model crashed: 7R

Model crashed: 7R
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
MainError:	12:46:33 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1476, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 248,446 729,406 1.4367
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 237,644 718,174 1.4453
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 226,842 706,941 1.4543
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 216,040 695,736 1.4638
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 205,238 684,398 1.4735
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 194,436 672,725 1.4828
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 183,634 661,334 1.4933
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 172,832 649,142 1.5024
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 162,030 637,045 1.5122
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 151,228 625,839 1.5247
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 140,426 614,717 1.5380
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 129,624 603,148 1.5510
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 118,822 591,884 1.5655
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 108,020 580,159 1.5797
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 97,218 568,955 1.5961
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 86,416 557,859 1.6139
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 75,614 546,761 1.6328
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 64,812 535,654 1.6529
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 54,010 524,533 1.6744
27 Jun 2010 16:12:08 1068724 11112390 hadsm3dhet2_jwgn_006605065_2 43,208 513,166 1.6967


©2024 climateprediction.net