Name | hadcm3n_ziml_1880_40_008250163_1 |
Workunit | 8405287 |
Created | 22 Nov 2012, 3:46:08 UTC |
Sent | 22 Nov 2012, 3:46:24 UTC |
Report deadline | 21 Feb 2013, 11:13:35 UTC |
Received | 3 Feb 2013, 2:45:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1233028 |
Run time | 6 days 21 hours 9 min 42 sec |
CPU time | 5 days 22 hours 33 min 29 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.39 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 18:58:24 (4652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:12:25 (3444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:12:26 (3444): No heartbeat from core client for 30 sec - exiting C19:54:31 (5044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:54:34 (5044): No heartbeat from core client for 30 sec - exiting 19:54:35 (5044): No heartbeat from core client for 30 sec - exiting 19:54:36 (5044): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1304, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2256, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2256, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4304, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3800, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=968, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2900, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2900, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2272, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2272, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3300, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3300, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3504, iMonCtr=1 Model crash detected, will try to restart... 10:45:59 (4484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1 Model crash detected, will try to restart... 19:30:27 (2960): No heartbeat from core client for 30 sec - exiting 19:30:29 (2960): No heartbeat from core client for 30 sec - exiting 19:30:30 (2960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1 Model crash detected, will try to restart... 14:52:30 (3348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:39:13 (1056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:15:56 (4196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:15:40 (4528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 15:18:25 (3212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7092, iMonCtr=1 Model crash detected, will try to restart... 20:56:54 (4104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:33:29 (3752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:01:05 (7124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3360, iMonCtr=1 Model crash detected, will try to restart... 19:09:08 (1488): No heartbeat from core client for 30 sec - exiting 19:09:09 (1488): No heartbeat from core client for 30 sec - exiting 19:09:10 (1488): No heartbeat from core client for 30 sec - exiting 19:09:11 (1488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2264, iMonCtr=1 Model crash detected, will try to restart... 10:22:21 (5492): No heartbeat from core client for 30 sec - exiting 10:22:23 (5492): No heartbeat from core client for 30 sec - exiting 10:22:24 (5492): No heartbeat from core client for 30 sec - exiting 10:22:25 (5492): No heartbeat from core client for 30 sec - exiting 10:22:26 (5492): No heartbeat from core client for 30 sec - exiting 10:22:27 (5492): No heartbeat from core client for 30 sec - exiting 10:22:28 (5492): No heartbeat from core client for 30 sec - exiting 10:22:29 (5492): No heartbeat from core client for 30 sec - exiting 10:22:30 (5492): No heartbeat from core client for 30 sec - exiting 10:22:31 (5492): No heartbeat from core client for 30 sec - exiting 10:22:32 (5492): No heartbeat from core client for 30 sec - exiting 10:22:33 (5492): No heartbeat from core client for 30 sec - exiting 10:22:34 (5492): No heartbeat from core client for 30 sec - exiting 10:22:35 (5492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:22:36 (5492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 15:20:55 (5240): No heartbeat from core client for 30 sec - exiting 15:20:56 (5240): No heartbeat from core client for 30 sec - exiting 15:20:57 (5240): No heartbeat from core client for 30 sec - exiting 15:20:58 (5240): No heartbeat from core client for 30 sec - exiting 15:20:59 (5240): No heartbeat from core client for 30 sec - exiting 15:21:00 (5240): No heartbeat from core client for 30 sec - exiting 15:21:01 (5240): No heartbeat from core client for 30 sec - exiting 15:21:02 (5240): No heartbeat from core client for 30 sec - exiting 15:21:04 (5240): No heartbeat from core client for 30 sec - exiting 15:21:05 (5240): No heartbeat from core client for 30 sec - exiting 15:21:06 (5240): No heartbeat from core client for 30 sec - exiting 15:21:07 (5240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:21:08 (5240): No heartbeat from core client for 30 sec - exiting 15:21:09 (5240): No heartbeat from core client for 30 sec - exiting 15:21:10 (5240): No heartbeat from core client for 30 sec - exiting 15:21:11 (5240): No heartbeat from core client for 30 sec - exiting 15:21:12 (5240): No heartbeat from core client for 30 sec - exiting 15:21:13 (5240): No heartbeat from core client for 30 sec - exiting 15:21:14 (5240): No heartbeat from core client for 30 sec - exiting 15:25:15 (6060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:29:16 (5616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:41:55 (5824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:00:15 (5192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:35:56 (5456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... C18:29:08 (2940): No heartbeat from core client for 30 sec - exiting 18:29:09 (2940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:23:42 (4244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:45:12 (2652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:07:39 (2984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:49:28 (3252): No heartbeat from core client for 30 sec - exiting 20:49:29 (3252): No heartbeat from core client for 30 sec - exiting 20:49:30 (3252): No heartbeat from core client for 30 sec - exiting 20:49:31 (3252): No heartbeat from core client for 30 sec - exiting 20:49:32 (3252): No heartbeat from core client for 30 sec - exiting 20:49:33 (3252): No heartbeat from core client for 30 sec - exiting 20:49:34 (3252): No heartbeat from core client for 30 sec - exiting 20:49:35 (3252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:49:36 (3252): No heartbeat from core client for 30 sec - exiting 21:04:34 (4840): No heartbeat from core client for 30 sec - exiting 21:04:35 (4840): No heartbeat from core client for 30 sec - exiting 21:04:36 (4840): No heartbeat from core client for 30 sec - exiting 21:04:37 (4840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:04:38 (4840): No heartbeat from core client for 30 sec - exiting 21:39:00 (4876): No heartbeat from core client for 30 sec - exiting 21:39:01 (4876): No heartbeat from core client for 30 sec - exiting 21:39:02 (4876): No heartbeat from core client for 30 sec - exiting 21:39:03 (4876): No heartbeat from core client for 30 sec - exiting 21:39:04 (4876): No heartbeat from core client for 30 sec - exiting 21:39:05 (4876): No heartbeat from core client for 30 sec - exiting 21:39:06 (4876): No heartbeat from core client for 30 sec - exiting 21:39:07 (4876): No heartbeat from core client for 30 sec - exiting 21:39:08 (4876): No heartbeat from core client for 30 sec - exiting 21:39:09 (4876): No heartbeat from core client for 30 sec - exiting 21:39:10 (4876): No heartbeat from core client for 30 sec - exiting 21:39:11 (4876): No heartbeat from core client for 30 sec - exiting 21:39:12 (4876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:39:13 (4876): No heartbeat from core client for 30 sec - exiting 21:39:14 (4876): No heartbeat from core client for 30 sec - exiting 20:27:33 (4532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:10:35 (4400): No heartbeat from core client for 30 sec - exiting 17:10:36 (4400): No heartbeat from core client for 30 sec - exiting 17:10:37 (4400): No heartbeat from core client for 30 sec - exiting 17:10:39 (4400): No heartbeat from core client for 30 sec - exiting 17:10:40 (4400): No heartbeat from core client for 30 sec - exiting 17:10:41 (4400): No heartbeat from core client for 30 sec - exiting 17:10:42 (4400): No heartbeat from core client for 30 sec - exiting 17:10:43 (4400): No heartbeat from core client for 30 sec - exiting 17:10:44 (4400): No heartbeat from core client for 30 sec - exiting 17:10:45 (4400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:00:13 (4300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1 Model crash detected, will try to restart... 15:53:40 (4612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:53:41 (4612): No heartbeat from core client for 30 sec - exiting 20:50:57 (4812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:46:08 (4868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:27:04 (4632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:06:30 (4788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... C18:15:53 (5028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:36:24 (3312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3476, iMonCtr=1 Model crash detected, will try to restart... 21:10:21 (5388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Feb 2013 02:46:18 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 259,200 | 513,206 | 1.9800 |
27 Jan 2013 16:02:25 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 233,280 | 461,092 | 1.9766 |
20 Jan 2013 16:26:04 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 207,360 | 408,847 | 1.9717 |
13 Jan 2013 23:31:21 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 181,440 | 356,630 | 1.9656 |
11 Jan 2013 03:03:21 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 155,520 | 306,089 | 1.9682 |
03 Jan 2013 16:26:06 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 129,600 | 254,380 | 1.9628 |
27 Dec 2012 00:22:33 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 103,680 | 202,769 | 1.9557 |
17 Dec 2012 00:28:07 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 77,760 | 152,493 | 1.9611 |
16 Dec 2012 00:47:57 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 51,840 | 103,370 | 1.9940 |
01 Dec 2012 18:08:13 | 1233028 | 15451053 | hadcm3n_ziml_1880_40_008250163_1 | 25,920 | 51,788 | 1.9980 |
©2024 cpdn.org