Name | hadcm3n_t10y_1980_40_007449533_1 |
Workunit | 7647036 |
Created | 10 Sep 2011, 0:47:36 UTC |
Sent | 10 Sep 2011, 0:47:42 UTC |
Report deadline | 10 Dec 2011, 8:14:53 UTC |
Received | 23 Sep 2011, 19:31:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1168793 |
Run time | 8 days 21 hours 33 min 20 sec |
CPU time | 8 days 9 hours 38 min 30 sec |
Validate state | Invalid |
Credit | 4,976.64 |
Device peak FLOPS | 4.31 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:00:48 (3848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Sep 2011 00:08:52 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 414,720 | 686,747 | 1.6559 |
21 Sep 2011 21:35:31 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 388,800 | 642,530 | 1.6526 |
21 Sep 2011 03:18:09 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 362,880 | 600,192 | 1.6540 |
19 Sep 2011 21:44:54 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 336,960 | 558,121 | 1.6563 |
19 Sep 2011 07:01:40 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 311,040 | 515,308 | 1.6567 |
18 Sep 2011 08:01:03 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 285,120 | 472,839 | 1.6584 |
17 Sep 2011 10:51:18 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 259,200 | 430,114 | 1.6594 |
15 Sep 2011 20:45:42 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 233,280 | 387,117 | 1.6595 |
15 Sep 2011 06:16:30 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 207,360 | 344,716 | 1.6624 |
14 Sep 2011 13:36:25 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 181,440 | 303,007 | 1.6700 |
14 Sep 2011 01:34:42 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 155,520 | 260,326 | 1.6739 |
13 Sep 2011 13:24:14 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 129,600 | 217,635 | 1.6793 |
12 Sep 2011 21:56:40 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 103,680 | 174,557 | 1.6836 |
12 Sep 2011 08:04:02 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 77,760 | 130,987 | 1.6845 |
11 Sep 2011 19:35:51 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 51,840 | 87,368 | 1.6853 |
11 Sep 2011 05:21:41 | 1168793 | 13364015 | hadcm3n_t10y_1980_40_007449533_1 | 25,920 | 43,468 | 1.6770 |
©2024 cpdn.org