climateprediction.net home page
Task 11276104

Task 11276104

Name hadsm3dhet2_k93d_006621435_5
Workunit 6824808
Created 15 Mar 2010, 12:35:13 UTC
Sent 13 Apr 2010, 2:01:01 UTC
Report deadline 26 Mar 2011, 7:21:01 UTC
Received 14 Sep 2010, 19:15:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1001975
Run time 94 days 2 hours 4 min 15 sec
CPU time 84 days 8 hours 59 min 23 sec
Validate state Invalid
Credit 4,168.22
Device peak FLOPS 0.00 GFLOPS
Application version ---
Stderr
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
MainError:	09:32:12 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3520, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3520, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3520, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3520, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3520, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1924, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1268, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1268, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=592, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=592, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=592, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=592, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4864, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9480, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1668, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1812, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2016, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2016, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=876, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1124, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=868, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1756, iMonCtr=1
Model crash detected, will try to restart...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: 
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2328, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2328, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1136, iMonCtr=1
Model crash detected, will try to restart...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: 
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1552, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1552, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1552, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1552, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1396, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1396, iMonCtr=1
Model crash detected, will try to restart...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: 
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1924, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1924, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1924, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1924, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1924, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1852, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1852, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1852, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1852, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1852, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1852, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Sep 2010 19:16:28 1001975 11276104 hadsm3dhet2_k93d_006621435_5 194,436 7,248,951 15.9780
29 Apr 2010 18:46:55 1001975 11276104 hadsm3dhet2_k93d_006621435_5 183,634 1,257,423 2.8392
28 Apr 2010 14:30:21 1001975 11276104 hadsm3dhet2_k93d_006621435_5 172,832 1,213,492 2.8085
28 Apr 2010 00:24:48 1001975 11276104 hadsm3dhet2_k93d_006621435_5 162,030 1,171,039 2.7797
27 Apr 2010 11:18:08 1001975 11276104 hadsm3dhet2_k93d_006621435_5 151,228 1,128,210 2.7485
27 Apr 2010 00:14:43 1001975 11276104 hadsm3dhet2_k93d_006621435_5 140,426 1,091,562 2.7311
26 Apr 2010 11:20:56 1001975 11276104 hadsm3dhet2_k93d_006621435_5 129,624 1,051,195 2.7032
25 Apr 2010 23:20:06 1001975 11276104 hadsm3dhet2_k93d_006621435_5 118,822 1,011,629 2.6758
25 Apr 2010 10:45:12 1001975 11276104 hadsm3dhet2_k93d_006621435_5 108,020 976,435 2.6586
25 Apr 2010 02:12:15 1001975 11276104 hadsm3dhet2_k93d_006621435_5 97,218 947,582 2.6583
24 Apr 2010 17:41:32 1001975 11276104 hadsm3dhet2_k93d_006621435_5 86,416 918,683 2.6577
24 Apr 2010 09:06:24 1001975 11276104 hadsm3dhet2_k93d_006621435_5 75,614 889,584 2.6566
24 Apr 2010 00:36:22 1001975 11276104 hadsm3dhet2_k93d_006621435_5 64,812 860,583 2.6556
23 Apr 2010 16:04:38 1001975 11276104 hadsm3dhet2_k93d_006621435_5 54,010 831,444 2.6542
23 Apr 2010 07:35:30 1001975 11276104 hadsm3dhet2_k93d_006621435_5 43,208 802,590 2.6536
22 Apr 2010 23:21:21 1001975 11276104 hadsm3dhet2_k93d_006621435_5 32,406 774,911 2.6570
22 Apr 2010 14:37:59 1001975 11276104 hadsm3dhet2_k93d_006621435_5 21,604 745,266 2.6536
22 Apr 2010 06:04:44 1001975 11276104 hadsm3dhet2_k93d_006621435_5 10,802 716,228 2.6522
21 Apr 2010 21:32:47 1001975 11276104 hadsm3dhet2_k93d_006621435_5 259,248 687,248 2.6509
21 Apr 2010 13:07:44 1001975 11276104 hadsm3dhet2_k93d_006621435_5 248,446 658,930 2.6522


©2024 climateprediction.net