Name | hadcm3n_zd57_1880_40_008250963_3 |
Workunit | 8406087 |
Created | 6 Dec 2012, 17:21:24 UTC |
Sent | 6 Dec 2012, 17:21:27 UTC |
Report deadline | 8 Mar 2013, 0:48:38 UTC |
Received | 5 Jan 2013, 8:01:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1184315 |
Run time | 24 days 2 hours 34 min |
CPU time | 22 days 5 hours 5 min 26 sec |
Validate state | Invalid |
Credit | 8,087.04 |
Device peak FLOPS | 1.92 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 06:37:41 (15070): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:40:00 (5244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:24:48 (5772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:27:18 (23227): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:29:39 (23927): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:36:14 (24601): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:38:34 (25934): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:40:59 (26597): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:43:23 (27283): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:45:43 (27955): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:07 (28624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:50:27 (29295): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:52:46 (29952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:04:35 (30609): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:09:00 (20934): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:11:19 (21911): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:13:39 (22585): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 23:32:48 (3605): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 11:06:42 (2936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:50:13 (4525): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:52:32 (13827): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:45:35 (14491): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:47:53 (8525): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:18:41 (9187): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 00:03:38 (3568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:03:39 (3568): No heartbeat from core client for 30 sec - exiting 00:03:40 (3568): No heartbeat from core client for 30 sec - exiting 00:03:41 (3568): No heartbeat from core client for 30 sec - exiting 00:03:42 (3568): No heartbeat from core client for 30 sec - exiting 00:03:43 (3568): No heartbeat from core client for 30 sec - exiting 00:03:44 (3568): No heartbeat from core client for 30 sec - exiting 00:03:45 (3568): No heartbeat from core client for 30 sec - exiting 00:03:46 (3568): No heartbeat from core client for 30 sec - exiting 00:03:47 (3568): No heartbeat from core client for 30 sec - exiting 00:03:48 (3568): No heartbeat from core client for 30 sec - exiting 00:03:49 (3568): No heartbeat from core client for 30 sec - exiting 00:03:50 (3568): No heartbeat from core client for 30 sec - exiting 00:03:51 (3568): No heartbeat from core client for 30 sec - exiting 00:03:52 (3568): No heartbeat from core client for 30 sec - exiting 00:03:53 (3568): No heartbeat from core client for 30 sec - exiting 00:03:54 (3568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:18:33 (3325): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:55:55 (6809): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:58:14 (26445): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 11:17:39 (4398): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:20:00 (4992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:25:42 (5517): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:28:05 (17679): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... SIGABRT: abort called Stack trace (10 frames): /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xffffe400] [0xffffe430] /lib/i686/libc.so.6(gsignal+0x51)[0xb75f50f1] /lib/i686/libc.so.6(abort+0x17e)[0xb75f6c1e] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i686/libc.so.6(__libc_start_main+0xe6)[0xb75e1ca6] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3197, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xffffe400] [0xffffe430] /lib/i686/libc.so.6(gsignal+0x51)[0xb76b70f1] /lib/i686/libc.so.6(abort+0x17e)[0xb76b8c1e] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i686/libc.so.6(__libc_start_main+0xe6)[0xb76a3ca6] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3197, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xffffe400] [0xffffe430] /lib/i686/libc.so.6(gsignal+0x51)[0xb75e80f1] /lib/i686/libc.so.6(abort+0x17e)[0xb75e9c1e] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i686/libc.so.6(__libc_start_main+0xe6)[0xb75d4ca6] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3197, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xffffe400] [0xffffe430] /lib/i686/libc.so.6(gsignal+0x51)[0xb76760f1] /lib/i686/libc.so.6(abort+0x17e)[0xb7677c1e] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i686/libc.so.6(__libc_start_main+0xe6)[0xb7662ca6] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3197, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xffffe400] [0xffffe430] /lib/i686/libc.so.6(gsignal+0x51)[0xb76df0f1] /lib/i686/libc.so.6(abort+0x17e)[0xb76e0c1e] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i686/libc.so.6(__libc_start_main+0xe6)[0xb76cbca6] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3197, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (10 frames): /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xffffe400] [0xffffe430] /lib/i686/libc.so.6(gsignal+0x51)[0xb763c0f1] /lib/i686/libc.so.6(abort+0x17e)[0xb763dc1e] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i686/libc.so.6(__libc_start_main+0xe6)[0xb7628ca6] /opt/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3197, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Dec 2012 19:48:57 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 673,920 | 1,346,523 | 1.9980 |
26 Dec 2012 23:52:27 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 648,000 | 1,293,177 | 1.9956 |
26 Dec 2012 08:23:51 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 622,080 | 1,239,821 | 1.9930 |
25 Dec 2012 19:49:21 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 596,160 | 1,186,393 | 1.9901 |
24 Dec 2012 22:30:05 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 570,240 | 1,132,968 | 1.9868 |
23 Dec 2012 01:26:30 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 544,320 | 1,079,607 | 1.9834 |
22 Dec 2012 03:03:14 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 518,400 | 1,027,395 | 1.9819 |
21 Dec 2012 20:02:35 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 492,480 | 977,875 | 1.9856 |
20 Dec 2012 22:22:27 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 466,560 | 928,279 | 1.9896 |
20 Dec 2012 08:26:44 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 440,640 | 878,605 | 1.9939 |
19 Dec 2012 19:18:14 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 414,720 | 828,785 | 1.9984 |
19 Dec 2012 04:21:11 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 388,800 | 779,168 | 2.0040 |
18 Dec 2012 07:31:26 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 362,880 | 729,633 | 2.0107 |
18 Dec 2012 07:31:26 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 336,960 | 680,092 | 2.0183 |
18 Dec 2012 07:31:26 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 311,040 | 630,430 | 2.0268 |
18 Dec 2012 07:31:26 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 285,120 | 580,612 | 2.0364 |
18 Dec 2012 07:31:26 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 259,200 | 530,245 | 2.0457 |
18 Dec 2012 07:31:26 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 233,280 | 476,891 | 2.0443 |
18 Dec 2012 07:31:26 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 207,360 | 423,851 | 2.0440 |
18 Dec 2012 07:31:26 | 1184315 | 15472233 | hadcm3n_zd57_1880_40_008250963_3 | 181,440 | 370,823 | 2.0438 |
©2024 cpdn.org