Name | hadcm3n_zbe9_1880_40_008251930_3 |
Workunit | 8407054 |
Created | 22 Feb 2013, 3:57:58 UTC |
Sent | 22 Feb 2013, 3:58:01 UTC |
Report deadline | 24 May 2013, 11:25:12 UTC |
Received | 4 Mar 2013, 5:08:11 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1240735 |
Run time | 9 days 22 hours 50 min 22 sec |
CPU time | 9 days 20 hours 3 min 32 sec |
Validate state | Invalid |
Credit | 6,842.88 |
Device peak FLOPS | 3.08 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:49:46 (10016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:35:15 (10732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:45:22 (29083): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:45:37 (29083): No heartbeat from core client for 30 sec - exiting 23:45:38 (29083): No heartbeat from core client for 30 sec - exiting 23:45:39 (29083): No heartbeat from core client for 30 sec - exiting 23:45:40 (29083): No heartbeat from core client for 30 sec - exiting 23:45:41 (29083): No heartbeat from core client for 30 sec - exiting 23:45:42 (29083): No heartbeat from core client for 30 sec - exiting 23:45:43 (29083): No heartbeat from core client for 30 sec - exiting 23:45:44 (29083): No heartbeat from core client for 30 sec - exiting 23:45:45 (29083): No heartbeat from core client for 30 sec - exiting 23:45:46 (29083): No heartbeat from core client for 30 sec - exiting 23:45:47 (29083): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 23:47:14 (31838): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:47:52 (31838): No heartbeat from core client for 30 sec - exiting 23:47:53 (31838): No heartbeat from core client for 30 sec - exiting 23:47:54 (31838): No heartbeat from core client for 30 sec - exiting 23:47:55 (31838): No heartbeat from core client for 30 sec - exiting 23:47:56 (31838): No heartbeat from core client for 30 sec - exiting 23:47:57 (31838): No heartbeat from core client for 30 sec - exiting 23:47:58 (31838): No heartbeat from core client for 30 sec - exiting 23:47:59 (31838): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7792400] [0xf7792430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75a51df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75a8825] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75904d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32415, iMonCtr=1 SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf771d400] [0xf771d430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75301df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7533825] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf751b4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf76e5400] [0xf76e5430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf74f81df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf74fb825] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74e34d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77be400] [0xf77be430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75d11df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75d4825] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75bc4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77b2400] [0xf77b2430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75c51df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c8825] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75b04d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7770400] [0xf7770430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75831df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7586825] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf756e4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf779e400] [0xf779e430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b11df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b4825] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759c4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Mar 2013 20:32:32 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 570,240 | 821,106 | 1.4399 |
03 Mar 2013 10:06:11 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 544,320 | 783,762 | 1.4399 |
02 Mar 2013 23:23:32 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 518,400 | 745,635 | 1.4383 |
02 Mar 2013 12:55:06 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 492,480 | 708,164 | 1.4380 |
02 Mar 2013 02:20:26 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 466,560 | 670,515 | 1.4371 |
01 Mar 2013 15:15:51 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 440,640 | 632,488 | 1.4354 |
01 Mar 2013 04:41:29 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 414,720 | 594,928 | 1.4345 |
28 Feb 2013 18:00:23 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 388,800 | 556,730 | 1.4319 |
28 Feb 2013 07:17:24 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 362,880 | 518,833 | 1.4298 |
27 Feb 2013 20:33:11 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 336,960 | 480,530 | 1.4261 |
27 Feb 2013 10:02:06 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 311,040 | 443,098 | 1.4246 |
26 Feb 2013 23:19:28 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 285,120 | 404,898 | 1.4201 |
26 Feb 2013 12:45:10 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 259,200 | 367,254 | 1.4169 |
26 Feb 2013 02:08:04 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 233,280 | 329,290 | 1.4116 |
25 Feb 2013 15:41:56 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 207,360 | 291,897 | 1.4077 |
25 Feb 2013 05:13:31 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 181,440 | 254,723 | 1.4039 |
24 Feb 2013 18:27:21 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 155,520 | 215,787 | 1.3875 |
24 Feb 2013 07:44:59 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 129,600 | 178,036 | 1.3737 |
23 Feb 2013 20:28:53 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 103,680 | 140,921 | 1.3592 |
23 Feb 2013 11:07:37 | 1240735 | 15619290 | hadcm3n_zbe9_1880_40_008251930_3 | 77,760 | 109,329 | 1.4060 |
©2024 cpdn.org