Name | hadcm3n_zik1_1920_40_008316144_3 |
Workunit | 8467279 |
Created | 19 May 2013, 22:46:07 UTC |
Sent | 19 May 2013, 22:46:08 UTC |
Report deadline | 19 Aug 2013, 6:13:19 UTC |
Received | 30 May 2013, 22:19:47 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 10 days 14 hours 45 min 43 sec |
CPU time | 10 days 9 hours 28 min 26 sec |
Validate state | Invalid |
Credit | 3,732.48 |
Device peak FLOPS | 2.01 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 17:25:14 (2335): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:35:20 (2999): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:56:34 (13127): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:12:22 (52349): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:40:35 (37890): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:40:36 (37890): No heartbeat from core client for 30 sec - exiting 06:40:37 (37890): No heartbeat from core client for 30 sec - exiting 06:40:38 (37890): No heartbeat from core client for 30 sec - exiting 06:40:39 (37890): No heartbeat from core client for 30 sec - exiting 06:40:40 (37890): No heartbeat from core client for 30 sec - exiting 06:40:41 (37890): No heartbeat from core client for 30 sec - exiting 06:40:42 (37890): No heartbeat from core client for 30 sec - exiting 06:40:43 (37890): No heartbeat from core client for 30 sec - exiting 06:40:44 (37890): No heartbeat from core client for 30 sec - exiting 06:40:45 (37890): No heartbeat from core client for 30 sec - exiting 06:40:46 (37890): No heartbeat from core client for 30 sec - exiting 06:40:47 (37890): No heartbeat from core client for 30 sec - exiting 06:40:48 (37890): No heartbeat from core client for 30 sec - exiting 06:40:49 (37890): No heartbeat from core client for 30 sec - exiting 06:40:50 (37890): No heartbeat from core client for 30 sec - exiting 06:40:51 (37890): No heartbeat from core client for 30 sec - exiting 06:40:52 (37890): No heartbeat from core client for 30 sec - exiting 06:40:53 (37890): No heartbeat from core client for 30 sec - exiting 06:40:54 (37890): No heartbeat from core client for 30 sec - exiting 06:40:55 (37890): No heartbeat from core client for 30 sec - exiting 06:40:56 (37890): No heartbeat from core client for 30 sec - exiting 06:40:57 (37890): No heartbeat from core client for 30 sec - exiting 06:45:35 (38928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:36:02 (39099): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:28 (41726): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:29 (41726): No heartbeat from core client for 30 sec - exiting 10:40:30 (41726): No heartbeat from core client for 30 sec - exiting 10:40:31 (41726): No heartbeat from core client for 30 sec - exiting 10:40:32 (41726): No heartbeat from core client for 30 sec - exiting 10:40:33 (41726): No heartbeat from core client for 30 sec - exiting 10:40:34 (41726): No heartbeat from core client for 30 sec - exiting 10:40:35 (41726): No heartbeat from core client for 30 sec - exiting 10:40:36 (41726): No heartbeat from core client for 30 sec - exiting 10:40:37 (41726): No heartbeat from core client for 30 sec - exiting 10:40:38 (41726): No heartbeat from core client for 30 sec - exiting 10:40:39 (41726): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 14:03:47 (41938): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:13:24 (44007): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:51:26 (44381): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:01:02 (44891): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:05:47 (45094): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:42:25 (45321): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:31:37 (45769): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:29:13 (46490): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:29:14 (46490): No heartbeat from core client for 30 sec - exiting 17:38:37 (47400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:23:44 (47660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:23:45 (47660): No heartbeat from core client for 30 sec - exiting 18:23:46 (47660): No heartbeat from core client for 30 sec - exiting 18:23:47 (47660): No heartbeat from core client for 30 sec - exiting 18:23:48 (47660): No heartbeat from core client for 30 sec - exiting 18:23:49 (47660): No heartbeat from core client for 30 sec - exiting 18:23:50 (47660): No heartbeat from core client for 30 sec - exiting 18:23:51 (47660): No heartbeat from core client for 30 sec - exiting 18:23:52 (47660): No heartbeat from core client for 30 sec - exiting 18:23:53 (47660): No heartbeat from core client for 30 sec - exiting 18:23:54 (47660): No heartbeat from core client for 30 sec - exiting 18:23:55 (47660): No heartbeat from core client for 30 sec - exiting 18:28:22 (48371): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:41:08 (48559): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:51:14 (48821): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:56:00 (49596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:56:01 (49596): No heartbeat from core client for 30 sec - exiting 19:56:02 (49596): No heartbeat from core client for 30 sec - exiting 19:56:03 (49596): No heartbeat from core client for 30 sec - exiting 19:56:04 (49596): No heartbeat from core client for 30 sec - exiting 19:56:05 (49596): No heartbeat from core client for 30 sec - exiting 19:56:06 (49596): No heartbeat from core client for 30 sec - exiting 19:56:07 (49596): No heartbeat from core client for 30 sec - exiting 19:56:08 (49596): No heartbeat from core client for 30 sec - exiting 19:56:09 (49596): No heartbeat from core client for 30 sec - exiting 19:56:10 (49596): No heartbeat from core client for 30 sec - exiting 19:56:11 (49596): No heartbeat from core client for 30 sec - exiting 19:56:12 (49596): No heartbeat from core client for 30 sec - exiting 19:56:13 (49596): No heartbeat from core client for 30 sec - exiting 19:56:14 (49596): No heartbeat from core client for 30 sec - exiting 19:56:15 (49596): No heartbeat from core client for 30 sec - exiting 19:56:16 (49596): No heartbeat from core client for 30 sec - exiting 19:56:17 (49596): No heartbeat from core client for 30 sec - exiting 19:56:18 (49596): No heartbeat from core client for 30 sec - exiting 19:56:19 (49596): No heartbeat from core client for 30 sec - exiting 19:56:20 (49596): No heartbeat from core client for 30 sec - exiting 19:56:21 (49596): No heartbeat from core client for 30 sec - exiting 19:56:22 (49596): No heartbeat from core client for 30 sec - exiting 19:56:23 (49596): No heartbeat from core client for 30 sec - exiting 19:56:24 (49596): No heartbeat from core client for 30 sec - exiting 19:56:25 (49596): No heartbeat from core client for 30 sec - exiting 20:00:33 (49783): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 21:12:14 (49978): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:17:02 (50742): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:17:03 (50742): No heartbeat from core client for 30 sec - exiting 21:17:04 (50742): No heartbeat from core client for 30 sec - exiting 21:17:05 (50742): No heartbeat from core client for 30 sec - exiting 21:17:06 (50742): No heartbeat from core client for 30 sec - exiting 21:17:07 (50742): No heartbeat from core client for 30 sec - exiting 21:17:08 (50742): No heartbeat from core client for 30 sec - exiting 21:17:09 (50742): No heartbeat from core client for 30 sec - exiting 21:17:10 (50742): No heartbeat from core client for 30 sec - exiting 21:17:11 (50742): No heartbeat from core client for 30 sec - exiting 21:17:12 (50742): No heartbeat from core client for 30 sec - exiting 21:17:13 (50742): No heartbeat from core client for 30 sec - exiting 21:17:14 (50742): No heartbeat from core client for 30 sec - exiting 21:17:15 (50742): No heartbeat from core client for 30 sec - exiting 21:17:16 (50742): No heartbeat from core client for 30 sec - exiting 21:17:17 (50742): No heartbeat from core client for 30 sec - exiting 21:17:18 (50742): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7793400] [0xf7793425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b01df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b3825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759b4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76eb400] [0xf76eb425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75081df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf750b825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74f34d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf779b400] [0xf779b425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b81df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75bb825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75a34d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf772d400] [0xf772d425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf754a1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf754d825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75354d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7787400] [0xf7787425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75a41df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75a7825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf758f4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76ff400] [0xf76ff425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf751c1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf751f825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75074d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 May 2013 01:19:59 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 311,040 | 840,399 | 2.7019 |
29 May 2013 04:51:33 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 285,120 | 768,606 | 2.6957 |
28 May 2013 07:53:26 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 259,200 | 697,867 | 2.6924 |
27 May 2013 11:09:46 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 233,280 | 626,064 | 2.6837 |
26 May 2013 15:01:58 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 207,360 | 555,508 | 2.6790 |
25 May 2013 19:03:45 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 181,440 | 487,955 | 2.6893 |
25 May 2013 00:19:10 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 155,520 | 422,475 | 2.7165 |
24 May 2013 05:51:55 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 129,600 | 356,801 | 2.7531 |
23 May 2013 10:38:43 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 103,680 | 290,868 | 2.8054 |
22 May 2013 13:10:00 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 77,760 | 218,100 | 2.8048 |
21 May 2013 16:52:08 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 51,840 | 145,769 | 2.8119 |
20 May 2013 19:36:04 | 1282401 | 15790416 | hadcm3n_zik1_1920_40_008316144_3 | 25,920 | 72,486 | 2.7965 |
©2024 cpdn.org