Name | hadcm3n_n7jw_1880_40_008376885_0 |
Workunit | 8527744 |
Created | 30 May 2013, 3:46:28 UTC |
Sent | 30 May 2013, 4:40:07 UTC |
Report deadline | 29 Aug 2013, 12:07:18 UTC |
Received | 15 Jun 2013, 4:24:50 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 839248 |
Run time | 15 days 13 hours 49 min 8 sec |
CPU time | 11 days 3 hours 53 min 41 sec |
Validate state | Invalid |
Credit | 10,886.40 |
Device peak FLOPS | 2.02 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.65</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 08:36:08 (14136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:37:31 (1506): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:38:40 (1522): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:52:07 (1548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... SIGSEGV: segmentation violation Stack trace (9 frames): /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf76f9400] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x80cb8e1] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x807a232] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x837e9f4] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839982e] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f8b7] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib32/libc.so.6(__libc_start_main+0xf5)[0xf74c6cc5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7619, iMonCtr=1 Model crash detected, will try to restart... 14:31:19 (7619): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Jun 2013 03:17:28 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 907,200 | 964,439 | 1.0631 |
14 Jun 2013 16:02:31 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 881,280 | 928,850 | 1.0540 |
14 Jun 2013 05:51:17 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 855,360 | 893,246 | 1.0443 |
13 Jun 2013 19:48:43 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 829,440 | 857,708 | 1.0341 |
13 Jun 2013 09:19:00 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 803,520 | 822,327 | 1.0234 |
12 Jun 2013 23:14:52 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 777,600 | 787,545 | 1.0128 |
12 Jun 2013 13:10:27 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 751,680 | 751,964 | 1.0004 |
12 Jun 2013 03:28:42 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 725,760 | 717,664 | 0.9888 |
11 Jun 2013 18:02:46 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 699,840 | 684,237 | 0.9777 |
11 Jun 2013 08:28:04 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 673,920 | 650,846 | 0.9658 |
10 Jun 2013 22:48:32 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 648,000 | 616,738 | 0.9518 |
10 Jun 2013 12:49:26 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 622,080 | 581,543 | 0.9348 |
10 Jun 2013 02:10:03 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 596,160 | 547,022 | 0.9176 |
09 Jun 2013 15:17:11 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 570,240 | 513,167 | 0.8999 |
09 Jun 2013 03:52:48 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 544,320 | 720,359 | 1.3234 |
08 Jun 2013 16:42:25 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 518,400 | 685,041 | 1.3215 |
08 Jun 2013 05:07:06 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 492,480 | 648,766 | 1.3173 |
07 Jun 2013 18:21:05 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 466,560 | 615,142 | 1.3185 |
07 Jun 2013 07:19:48 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 440,640 | 578,928 | 1.3138 |
06 Jun 2013 21:50:05 | 839248 | 15805882 | hadcm3n_n7jw_1880_40_008376885_0 | 414,720 | 545,155 | 1.3145 |
©2024 cpdn.org