Name | hadcm3n_o699_1900_40_007203440_0 |
Workunit | 7401720 |
Created | 28 Mar 2011, 14:16:25 UTC |
Sent | 29 Mar 2011, 7:32:19 UTC |
Report deadline | 28 Jun 2011, 14:59:30 UTC |
Received | 12 May 2011, 15:28:02 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Done |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 213772 |
Run time | |
CPU time | 24 days 21 hours 24 min 3 sec |
Validate state | Invalid |
Credit | 5,598.72 |
Device peak FLOPS | 1.19 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>5.8.16</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 17:33:23 (2268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:52:27 (2840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:53:45 (8016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:59:58 (3884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:57:01 (2316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:55:12 (3404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:57:09 (4156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:13:46 (5780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 03:50:30 (5844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:55:05 (2740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:57:46 (5012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 04:02:56 (3712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:30:44 (4324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 02:52:22 (5008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 08:19:09 (5184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=336, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=336, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=336, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=336, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=336, iMonCtr=1 Model crash detected, will try to restart... 13:01:59 (336): No heartbeat from core client for 30 sec - exiting 13:02:00 (336): No heartbeat from core client for 30 sec - exiting 13:02:01 (336): No heartbeat from core client for 30 sec - exiting 13:02:02 (336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4004, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 May 2011 01:31:35 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 466,560 | 2,142,253 | 4.5916 |
07 May 2011 14:46:47 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 440,640 | 2,029,913 | 4.6067 |
04 May 2011 13:26:04 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 414,720 | 1,911,433 | 4.6090 |
02 May 2011 11:36:43 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 388,800 | 1,788,554 | 4.6002 |
30 Apr 2011 20:37:02 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 362,880 | 1,669,729 | 4.6013 |
29 Apr 2011 01:08:43 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 336,960 | 1,548,567 | 4.5957 |
27 Apr 2011 10:30:06 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 311,040 | 1,428,587 | 4.5929 |
25 Apr 2011 18:16:46 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 285,120 | 1,310,346 | 4.5958 |
24 Apr 2011 04:04:58 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 259,200 | 1,193,250 | 4.6036 |
22 Apr 2011 14:59:47 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 233,280 | 1,079,457 | 4.6273 |
20 Apr 2011 20:13:07 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 207,360 | 965,583 | 4.6566 |
20 Apr 2011 20:13:07 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 181,440 | 846,657 | 4.6663 |
20 Apr 2011 20:13:07 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 155,520 | 728,714 | 4.6857 |
20 Apr 2011 20:13:07 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 129,600 | 606,752 | 4.6817 |
12 Apr 2011 01:02:35 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 103,680 | 481,872 | 4.6477 |
10 Apr 2011 09:59:22 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 77,760 | 363,428 | 4.6737 |
08 Apr 2011 15:04:36 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 51,840 | 247,818 | 4.7804 |
06 Apr 2011 10:36:22 | 213772 | 12748095 | hadcm3n_o699_1900_40_007203440_0 | 25,920 | 122,590 | 4.7296 |
©2024 cpdn.org