Name | hadcm3n_p68w_1900_40_007225680_0 |
Workunit | 7423920 |
Created | 26 Apr 2011, 15:35:32 UTC |
Sent | 27 Apr 2011, 15:15:18 UTC |
Report deadline | 27 Jul 2011, 22:42:29 UTC |
Received | 15 Jul 2011, 16:44:27 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1122411 |
Run time | 60 days 13 hours 23 min 21 sec |
CPU time | 47 days 23 hours 33 min 41 sec |
Validate state | Invalid |
Credit | 11,508.48 |
Device peak FLOPS | 1.43 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:02:03 (4312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:02:04 (4312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5544, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5544, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5544, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Jul 2011 06:54:47 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 959,040 | 4,116,412 | 4.2922 |
27 Jun 2011 07:28:27 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 933,120 | 4,003,757 | 4.2907 |
24 Jun 2011 08:51:47 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 907,200 | 3,890,680 | 4.2887 |
22 Jun 2011 19:21:45 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 881,280 | 3,778,790 | 4.2878 |
21 Jun 2011 07:03:55 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 855,360 | 3,667,951 | 4.2882 |
19 Jun 2011 22:17:43 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 829,440 | 3,551,636 | 4.2820 |
19 Jun 2011 22:03:39 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 803,520 | 3,438,918 | 4.2798 |
16 Jun 2011 15:02:34 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 777,600 | 3,328,601 | 4.2806 |
14 Jun 2011 20:58:33 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 751,680 | 3,217,356 | 4.2802 |
12 Jun 2011 03:46:30 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 725,760 | 3,103,990 | 4.2769 |
09 Jun 2011 13:12:03 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 699,840 | 2,997,190 | 4.2827 |
07 Jun 2011 23:58:21 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 673,920 | 2,887,278 | 4.2843 |
06 Jun 2011 10:21:08 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 648,000 | 2,776,452 | 4.2846 |
04 Jun 2011 20:29:30 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 622,080 | 2,663,335 | 4.2813 |
03 Jun 2011 07:36:37 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 596,160 | 2,551,442 | 4.2798 |
01 Jun 2011 18:44:30 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 570,240 | 2,439,813 | 4.2786 |
31 May 2011 06:37:44 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 544,320 | 2,330,465 | 4.2814 |
29 May 2011 16:57:37 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 518,400 | 2,217,632 | 4.2778 |
28 May 2011 02:42:52 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 492,480 | 2,104,580 | 4.2734 |
26 May 2011 12:56:16 | 1122411 | 12831287 | hadcm3n_p68w_1900_40_007225680_0 | 466,560 | 1,993,362 | 4.2725 |
©2024 cpdn.org