Name | hadsm3dhet2_jvhu_006603812_2 |
Workunit | 6807185 |
Created | 15 Mar 2010, 12:12:16 UTC |
Sent | 3 Jun 2010, 21:36:21 UTC |
Report deadline | 17 May 2011, 2:56:21 UTC |
Received | 21 Jun 2010, 21:16:24 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 21 (0x00000015) Unknown error code |
Computer ID | 942634 |
Run time | 2 days 12 hours 41 min 55 sec |
CPU time | 2 days 12 hours 41 min 55 sec |
Validate state | Invalid |
Credit | 1,885.62 |
Device peak FLOPS | 3.18 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.2.18</core_client_version> <![CDATA[ <message> The device is not ready. (0x15) - exit code 21 (0x15) </message> <stderr_txt> CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2240, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2240, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2240, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2240, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2240, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2240, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2240, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2240, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2188, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2188, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2188, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2188, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2188, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2188, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1624, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1624, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1624, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1624, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1624, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1624, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1624, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2280, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2164, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2164, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2260, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2172, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1404, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1404, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1404, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1404, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1404, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6556, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2332, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2720, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2720, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 0, checkPID=2143289344, selfPID=2143289344, iMonCtr=0 Model crash detected, will try to restart... Post-processing failed! :-( called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Jun 2010 21:14:18 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 205,238 | 218,209 | 1.0632 |
21 Jun 2010 17:19:35 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 194,436 | 208,943 | 1.0746 |
20 Jun 2010 19:15:30 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 183,634 | 199,836 | 1.0882 |
20 Jun 2010 15:06:08 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 172,832 | 190,009 | 1.0994 |
19 Jun 2010 21:29:00 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 162,030 | 180,635 | 1.1148 |
19 Jun 2010 08:03:38 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 151,228 | 171,373 | 1.1332 |
17 Jun 2010 17:50:43 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 140,426 | 162,727 | 1.1588 |
16 Jun 2010 19:25:30 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 129,624 | 153,924 | 1.1875 |
14 Jun 2010 19:17:41 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 118,822 | 144,876 | 1.2193 |
13 Jun 2010 18:40:05 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 108,020 | 136,013 | 1.2591 |
13 Jun 2010 15:13:56 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 97,218 | 126,367 | 1.2998 |
13 Jun 2010 11:12:11 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 86,416 | 115,115 | 1.3321 |
12 Jun 2010 18:53:25 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 75,614 | 101,033 | 1.3362 |
12 Jun 2010 14:50:35 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 64,812 | 87,354 | 1.3478 |
12 Jun 2010 09:01:49 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 54,010 | 72,457 | 1.3415 |
10 Jun 2010 21:43:16 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 43,208 | 58,280 | 1.3488 |
10 Jun 2010 17:03:30 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 32,406 | 44,403 | 1.3702 |
08 Jun 2010 21:45:06 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 21,604 | 29,582 | 1.3693 |
08 Jun 2010 17:12:19 | 942634 | 11099854 | hadsm3dhet2_jvhu_006603812_2 | 10,802 | 15,156 | 1.4031 |
©2024 cpdn.org