Name | hadcm3n_zhlr_1920_40_008316212_3 |
Workunit | 8467347 |
Created | 9 Apr 2013, 13:49:20 UTC |
Sent | 9 Apr 2013, 13:49:36 UTC |
Report deadline | 9 Jul 2013, 21:16:47 UTC |
Received | 7 May 2013, 15:49:13 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1192595 |
Run time | 14 days 2 hours 39 min 27 sec |
CPU time | 11 days 16 hours 34 min 13 sec |
Validate state | Invalid |
Credit | 5,909.76 |
Device peak FLOPS | 2.08 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 09:46:42 (752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:46:43 (752): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3360, iMonCtr=1 Model crash detected, will try to restart... 07:55:14 (3516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:45:36 (2440): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 09:45:38 (2440): No heartbeat from core client for 30 sec - exiting 09:45:39 (2440): No heartbeat from core client for 30 sec - exiting 09:45:41 (2440): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:13:47 (3660): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 06:13:50 (3660): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 12:18:50 (5008): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 07:28:45 (3392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:36:31 (4020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:24:53 (4196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:24:56 (4196): No heartbeat from core client for 30 sec - exiting 09:27:12 (1776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:28:29 (3212): No heartbeat from core client for 30 sec - exiting 09:28:30 (3212): No heartbeat from core client for 30 sec - exiting 09:28:31 (3212): No heartbeat from core client for 30 sec - exiting 09:28:32 (3212): No heartbeat from core client for 30 sec - exiting 09:28:33 (3212): No heartbeat from core client for 30 sec - exiting 09:28:34 (3212): No heartbeat from core client for 30 sec - exiting 09:28:35 (3212): No heartbeat from core client for 30 sec - exiting 09:28:36 (3212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:28:37 (3212): No heartbeat from core client for 30 sec - exiting 09:29:38 (3628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:30:16 (5052): No heartbeat from core client for 30 sec - exiting 09:30:17 (5052): No heartbeat from core client for 30 sec - exiting 09:30:18 (5052): No heartbeat from core client for 30 sec - exiting 09:30:19 (5052): No heartbeat from core client for 30 sec - exiting 09:30:20 (5052): No heartbeat from core client for 30 sec - exiting 09:30:21 (5052): No heartbeat from core client for 30 sec - exiting 09:30:22 (5052): No heartbeat from core client for 30 sec - exiting 09:30:23 (5052): No heartbeat from core client for 30 sec - exiting 09:30:24 (5052): No heartbeat from core client for 30 sec - exiting 09:30:25 (5052): No heartbeat from core client for 30 sec - exiting 09:30:26 (5052): No heartbeat from core client for 30 sec - exiting 09:30:27 (5052): No heartbeat from core client for 30 sec - exiting 09:30:28 (5052): No heartbeat from core client for 30 sec - exiting 09:30:29 (5052): No heartbeat from core client for 30 sec - exiting 09:30:30 (5052): No heartbeat from core client for 30 sec - exiting 09:30:31 (5052): No heartbeat from core client for 30 sec - exiting 09:30:32 (5052): No heartbeat from core client for 30 sec - exiting 09:30:33 (5052): No heartbeat from core client for 30 sec - exiting 09:30:34 (5052): No heartbeat from core client for 30 sec - exiting 09:30:35 (5052): No heartbeat from core client for 30 sec - exiting 09:30:36 (5052): No heartbeat from core client for 30 sec - exiting 09:30:37 (5052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:31:39 (1520): No heartbeat from core client for 30 sec - exiting 09:31:40 (1520): No heartbeat from core client for 30 sec - exiting 09:31:41 (1520): No heartbeat from core client for 30 sec - exiting 09:31:42 (1520): No heartbeat from core client for 30 sec - exiting 09:31:43 (1520): No heartbeat from core client for 30 sec - exiting 09:31:44 (1520): No heartbeat from core client for 30 sec - exiting 09:31:45 (1520): No heartbeat from core client for 30 sec - exiting 09:31:46 (1520): No heartbeat from core client for 30 sec - exiting 09:31:47 (1520): No heartbeat from core client for 30 sec - exiting 09:31:48 (1520): No heartbeat from core client for 30 sec - exiting 09:31:49 (1520): No heartbeat from core client for 30 sec - exiting 09:31:50 (1520): No heartbeat from core client for 30 sec - exiting 09:31:51 (1520): No heartbeat from core client for 30 sec - exiting 09:31:52 (1520): No heartbeat from core client for 30 sec - exiting 09:31:53 (1520): No heartbeat from core client for 30 sec - exiting 09:31:54 (1520): No heartbeat from core client for 30 sec - exiting 09:31:55 (1520): No heartbeat from core client for 30 sec - exiting 09:31:56 (1520): No heartbeat from core client for 30 sec - exiting 09:31:57 (1520): No heartbeat from core client for 30 sec - exiting 09:31:58 (1520): No heartbeat from core client for 30 sec - exiting 09:31:59 (1520): No heartbeat from core client for 30 sec - exiting 09:32:00 (1520): No heartbeat from core client for 30 sec - exiting 09:32:01 (1520): No heartbeat from core client for 30 sec - exiting 09:32:02 (1520): No heartbeat from core client for 30 sec - exiting 09:32:03 (1520): No heartbeat from core client for 30 sec - exiting 09:32:04 (1520): No heartbeat from core client for 30 sec - exiting 09:32:05 (1520): No heartbeat from core client for 30 sec - exiting 09:32:06 (1520): No heartbeat from core client for 30 sec - exiting 09:32:07 (1520): No heartbeat from core client for 30 sec - exiting 09:32:08 (1520): No heartbeat from core client for 30 sec - exiting 09:32:09 (1520): No heartbeat from core client for 30 sec - exiting 09:32:10 (1520): No heartbeat from core client for 30 sec - exiting 09:32:11 (1520): No heartbeat from core client for 30 sec - exiting 09:32:12 (1520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:33:56 (1556): No heartbeat from core client for 30 sec - exiting 09:33:57 (1556): No heartbeat from core client for 30 sec - exiting 09:33:58 (1556): No heartbeat from core client for 30 sec - exiting 09:33:59 (1556): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3440, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3440, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 May 2013 01:38:47 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 492,480 | 970,936 | 1.9715 |
30 Apr 2013 06:04:33 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 466,560 | 921,431 | 1.9749 |
27 Apr 2013 11:33:47 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 440,640 | 871,203 | 1.9771 |
26 Apr 2013 09:33:12 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 414,720 | 817,097 | 1.9702 |
25 Apr 2013 10:57:52 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 388,800 | 760,895 | 1.9570 |
24 Apr 2013 12:51:27 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 362,880 | 703,858 | 1.9396 |
23 Apr 2013 14:45:35 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 336,960 | 651,040 | 1.9321 |
22 Apr 2013 15:20:45 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 311,040 | 600,970 | 1.9321 |
21 Apr 2013 05:56:18 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 285,120 | 553,041 | 1.9397 |
19 Apr 2013 22:05:05 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 259,200 | 504,830 | 1.9476 |
19 Apr 2013 07:28:11 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 233,280 | 453,433 | 1.9437 |
18 Apr 2013 16:57:10 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 207,360 | 402,300 | 1.9401 |
17 Apr 2013 20:22:46 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 181,440 | 351,481 | 1.9372 |
17 Apr 2013 05:50:56 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 155,520 | 300,476 | 1.9321 |
16 Apr 2013 15:25:51 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 129,600 | 249,781 | 1.9273 |
15 Apr 2013 07:09:49 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 103,680 | 200,160 | 1.9306 |
14 Apr 2013 17:23:05 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 77,760 | 151,046 | 1.9425 |
14 Apr 2013 03:24:40 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 51,840 | 102,078 | 1.9691 |
10 Apr 2013 17:26:24 | 1192595 | 15718837 | hadcm3n_zhlr_1920_40_008316212_3 | 25,920 | 53,041 | 2.0463 |
©2024 cpdn.org