Name | hadsm3dhet2_jqcg_006597138_7 |
Workunit | 6800511 |
Created | 15 Mar 2010, 12:03:33 UTC |
Sent | 28 Sep 2010, 16:20:48 UTC |
Report deadline | 10 Sep 2011, 21:40:48 UTC |
Received | 1 Nov 2010, 5:24:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1095664 |
Run time | 1 days 20 hours 13 min 38 sec |
CPU time | 1 days 18 hours 36 min 34 sec |
Validate state | Invalid |
Credit | 1,091.68 |
Device peak FLOPS | 2.62 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=1976, selfPID=1976, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=7116, selfPID=7116, iMonCtr=1 CPDN Monitor - Quit request from BOINC... forrtl: Access is denied. CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=3948, selfPID=3948, iMonCtr=1 CPDN Monitor - Quit request from BOINC... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jqcg_006597138/dataout/restart.day forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6060, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jqcg_006597138/dataout/restart.day forrtl: Access is denied. CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=3496, selfPID=3496, iMonCtr=1 CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=3044, selfPID=3044, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle CPDN process is not running, exiting, bRetVal = 1, checkPID=4680, selfPID=4680, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=5432, iMonCtr=1 CPDN Monitor - Quit request from BOINC... forrtl: Access is denied. CPDN Monitor - Quit request from BOINC... No Process Handle CPDN process is not running, exiting, bRetVal = 1, checkPID=2628, selfPID=2628, iMonCtr=1 CPDN Monitor - Quit request from BOINC... forrtl: Access is denied. CPDN Monitor - Quit request from BOINC... No Process Handle CPDN process is not running, exiting, bRetVal = 1, checkPID=2492, selfPID=2492, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle CPDN process is not running, exiting, bRetVal = 1, checkPID=4528, selfPID=4528, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle CPDN process is not running, exiting, bRetVal = 1, checkPID=2620, selfPID=2620, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle CPDN process is not running, exiting, bRetVal = 1, checkPID=5868, selfPID=5868, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=1948, selfPID=1948, iMonCtr=1 No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN process is not running, exiting, bRetVal = 1, checkPID=3152, selfPID=3152, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=1168, selfPID=1168, iMonCtr=1 No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... No Process Handle CPDN process is not running, exiting, bRetVal = 1, checkPID=1752, selfPID=1752, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=2640, selfPID=2640, iMonCtr=1 No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN process is not running, exiting, bRetVal = 1, checkPID=5976, selfPID=5976, iMonCtr=1 CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... Model crashed: 7R Model crashed: 7R Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Oct 2010 19:04:55 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 118,822 | 142,531 | 1.1995 |
31 Oct 2010 16:01:03 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 108,020 | 131,585 | 1.2182 |
31 Oct 2010 12:06:23 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 97,218 | 120,567 | 1.2402 |
31 Oct 2010 08:53:18 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 86,416 | 109,535 | 1.2675 |
30 Oct 2010 04:55:06 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 75,614 | 98,548 | 1.3033 |
29 Oct 2010 16:20:58 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 64,812 | 87,350 | 1.3477 |
28 Oct 2010 07:51:40 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 54,010 | 76,135 | 1.4096 |
26 Oct 2010 17:59:22 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 43,208 | 64,701 | 1.4974 |
26 Oct 2010 14:45:11 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 32,406 | 53,584 | 1.6535 |
26 Oct 2010 11:31:49 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 21,604 | 42,514 | 1.9679 |
21 Oct 2010 13:51:28 | 1095664 | 11033115 | hadsm3dhet2_jqcg_006597138_7 | 10,802 | 10,955 | 1.0142 |
©2024 cpdn.org