Name | hadcm3n_38np_1980_40_008318566_0 |
Workunit | 8469701 |
Created | 24 Feb 2013, 5:58:52 UTC |
Sent | 24 Feb 2013, 5:59:04 UTC |
Report deadline | 26 May 2013, 13:26:15 UTC |
Received | 3 Mar 2013, 13:40:56 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1269677 |
Run time | 19 hours 46 min 51 sec |
CPU time | 12 hours 27 min 36 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 3.42 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 17:25:03 (1861): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:25:04 (1861): No heartbeat from core client for 30 sec - exiting 17:25:05 (1861): No heartbeat from core client for 30 sec - exiting 17:25:06 (1861): No heartbeat from core client for 30 sec - exiting 17:25:07 (1861): No heartbeat from core client for 30 sec - exiting 17:25:08 (1861): No heartbeat from core client for 30 sec - exiting 17:25:09 (1861): No heartbeat from core client for 30 sec - exiting 17:25:10 (1861): No heartbeat from core client for 30 sec - exiting 17:25:11 (1861): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 17:39:03 (1895): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:39:04 (1895): No heartbeat from core client for 30 sec - exiting 17:39:05 (1895): No heartbeat from core client for 30 sec - exiting 17:40:17 (1914): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:40:24 (1914): No heartbeat from core client for 30 sec - exiting 17:40:26 (1914): No heartbeat from core client for 30 sec - exiting 17:40:27 (1914): No heartbeat from core client for 30 sec - exiting 17:40:28 (1914): No heartbeat from core client for 30 sec - exiting 17:40:29 (1914): No heartbeat from core client for 30 sec - exiting 17:40:30 (1914): No heartbeat from core client for 30 sec - exiting 17:40:31 (1914): No heartbeat from core client for 30 sec - exiting 17:40:32 (1914): No heartbeat from core client for 30 sec - exiting 17:40:33 (1914): No heartbeat from core client for 30 sec - exiting 17:40:34 (1914): No heartbeat from core client for 30 sec - exiting 17:40:35 (1914): No heartbeat from core client for 30 sec - exiting 17:40:36 (1914): No heartbeat from core client for 30 sec - exiting 19:30:22 (1929): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:30:23 (1929): No heartbeat from core client for 30 sec - exiting 19:30:24 (1929): No heartbeat from core client for 30 sec - exiting 19:30:25 (1929): No heartbeat from core client for 30 sec - exiting 19:30:26 (1929): No heartbeat from core client for 30 sec - exiting 19:30:27 (1929): No heartbeat from core client for 30 sec - exiting 19:30:28 (1929): No heartbeat from core client for 30 sec - exiting 19:30:29 (1929): No heartbeat from core client for 30 sec - exiting 19:30:30 (1929): No heartbeat from core client for 30 sec - exiting 20:24:23 (1990): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:24:24 (1990): No heartbeat from core client for 30 sec - exiting 20:24:25 (1990): No heartbeat from core client for 30 sec - exiting 20:24:26 (1990): No heartbeat from core client for 30 sec - exiting 20:24:27 (1990): No heartbeat from core client for 30 sec - exiting 20:24:28 (1990): No heartbeat from core client for 30 sec - exiting 20:24:29 (1990): No heartbeat from core client for 30 sec - exiting 20:24:30 (1990): No heartbeat from core client for 30 sec - exiting 20:24:31 (1990): No heartbeat from core client for 30 sec - exiting 20:24:32 (1990): No heartbeat from core client for 30 sec - exiting 20:24:33 (1990): No heartbeat from core client for 30 sec - exiting 20:24:34 (1990): No heartbeat from core client for 30 sec - exiting 20:24:35 (1990): No heartbeat from core client for 30 sec - exiting 20:24:36 (1990): No heartbeat from core client for 30 sec - exiting 21:16:14 (2023): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:34:08 (2052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:34:09 (2052): No heartbeat from core client for 30 sec - exiting 22:10:59 (2070): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:11:00 (2070): No heartbeat from core client for 30 sec - exiting 22:11:01 (2070): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 01:27:23 (2111): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:05:47 (2210): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:08:09 (2265): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:08:10 (2265): No heartbeat from core client for 30 sec - exiting 03:08:11 (2265): No heartbeat from core client for 30 sec - exiting 03:08:12 (2265): No heartbeat from core client for 30 sec - exiting 03:08:13 (2265): No heartbeat from core client for 30 sec - exiting 03:08:14 (2265): No heartbeat from core client for 30 sec - exiting 03:08:15 (2265): No heartbeat from core client for 30 sec - exiting 03:08:16 (2265): No heartbeat from core client for 30 sec - exiting 04:06:30 (2275): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:06:31 (2275): No heartbeat from core client for 30 sec - exiting 04:06:32 (2275): No heartbeat from core client for 30 sec - exiting 06:45:21 (2311): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:45:22 (2311): No heartbeat from core client for 30 sec - exiting 06:45:23 (2311): No heartbeat from core client for 30 sec - exiting 06:45:24 (2311): No heartbeat from core client for 30 sec - exiting 06:45:25 (2311): No heartbeat from core client for 30 sec - exiting 06:45:26 (2311): No heartbeat from core client for 30 sec - exiting 06:45:27 (2311): No heartbeat from core client for 30 sec - exiting 06:45:28 (2311): No heartbeat from core client for 30 sec - exiting 06:45:29 (2311): No heartbeat from core client for 30 sec - exiting 06:45:30 (2311): No heartbeat from core client for 30 sec - exiting 06:45:31 (2311): No heartbeat from core client for 30 sec - exiting 06:45:32 (2311): No heartbeat from core client for 30 sec - exiting 06:45:33 (2311): No heartbeat from core client for 30 sec - exiting 06:45:34 (2311): No heartbeat from core client for 30 sec - exiting 06:45:35 (2311): No heartbeat from core client for 30 sec - exiting 06:45:36 (2311): No heartbeat from core client for 30 sec - exiting 06:45:37 (2311): No heartbeat from core client for 30 sec - exiting 06:45:38 (2311): No heartbeat from core client for 30 sec - exiting 06:45:39 (2311): No heartbeat from core client for 30 sec - exiting 06:45:40 (2311): No heartbeat from core client for 30 sec - exiting 06:45:41 (2311): No heartbeat from core client for 30 sec - exiting 06:45:42 (2311): No heartbeat from core client for 30 sec - exiting 06:45:43 (2311): No heartbeat from core client for 30 sec - exiting 06:45:44 (2311): No heartbeat from core client for 30 sec - exiting 06:45:45 (2311): No heartbeat from core client for 30 sec - exiting 06:45:46 (2311): No heartbeat from core client for 30 sec - exiting 06:45:47 (2311): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Atmos Hold Restart file rename failed on atmos_restart.hold 07:15:50 (2526): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:15:51 (2526): No heartbeat from core client for 30 sec - exiting 07:15:52 (2526): No heartbeat from core client for 30 sec - exiting 07:15:53 (2526): No heartbeat from core client for 30 sec - exiting 07:15:54 (2526): No heartbeat from core client for 30 sec - exiting 07:15:55 (2526): No heartbeat from core client for 30 sec - exiting 08:16:35 (2572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:16:36 (2572): No heartbeat from core client for 30 sec - exiting 08:16:37 (2572): No heartbeat from core client for 30 sec - exiting 08:16:38 (2572): No heartbeat from core client for 30 sec - exiting 08:16:39 (2572): No heartbeat from core client for 30 sec - exiting 08:16:40 (2572): No heartbeat from core client for 30 sec - exiting 08:16:41 (2572): No heartbeat from core client for 30 sec - exiting 08:16:42 (2572): No heartbeat from core client for 30 sec - exiting 08:37:55 (2610): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:53:59 (2633): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:54:00 (2633): No heartbeat from core client for 30 sec - exiting 08:54:01 (2633): No heartbeat from core client for 30 sec - exiting 08:54:02 (2633): No heartbeat from core client for 30 sec - exiting 08:54:03 (2633): No heartbeat from core client for 30 sec - exiting 08:54:04 (2633): No heartbeat from core client for 30 sec - exiting 08:54:05 (2633): No heartbeat from core client for 30 sec - exiting 08:54:06 (2633): No heartbeat from core client for 30 sec - exiting 09:29:04 (2654): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:13:38 (2683): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:13:39 (2683): No heartbeat from core client for 30 sec - exiting 10:13:40 (2683): No heartbeat from core client for 30 sec - exiting 10:13:41 (2683): No heartbeat from core client for 30 sec - exiting 10:13:42 (2683): No heartbeat from core client for 30 sec - exiting 10:13:43 (2683): No heartbeat from core client for 30 sec - exiting 10:13:44 (2683): No heartbeat from core client for 30 sec - exiting 10:13:45 (2683): No heartbeat from core client for 30 sec - exiting 10:13:46 (2683): No heartbeat from core client for 30 sec - exiting 12:40:41 (2712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:40:42 (2712): No heartbeat from core client for 30 sec - exiting 12:40:43 (2712): No heartbeat from core client for 30 sec - exiting 12:40:44 (2712): No heartbeat from core client for 30 sec - exiting 12:40:45 (2712): No heartbeat from core client for 30 sec - exiting 12:40:46 (2712): No heartbeat from core client for 30 sec - exiting 12:40:47 (2712): No heartbeat from core client for 30 sec - exiting 12:43:48 (2786): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:43:49 (2786): No heartbeat from core client for 30 sec - exiting 12:43:50 (2786): No heartbeat from core client for 30 sec - exiting 12:43:51 (2786): No heartbeat from core client for 30 sec - exiting 12:43:52 (2786): No heartbeat from core client for 30 sec - exiting 12:43:53 (2786): No heartbeat from core client for 30 sec - exiting 12:43:54 (2786): No heartbeat from core client for 30 sec - exiting 12:43:55 (2786): No heartbeat from core client for 30 sec - exiting 12:43:56 (2786): No heartbeat from core client for 30 sec - exiting 14:37:21 (2798): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:37:22 (2798): No heartbeat from core client for 30 sec - exiting 14:37:23 (2798): No heartbeat from core client for 30 sec - exiting 14:37:24 (2798): No heartbeat from core client for 30 sec - exiting 14:37:25 (2798): No heartbeat from core client for 30 sec - exiting 14:37:26 (2798): No heartbeat from core client for 30 sec - exiting 14:37:27 (2798): No heartbeat from core client for 30 sec - exiting 14:37:28 (2798): No heartbeat from core client for 30 sec - exiting 14:37:29 (2798): No heartbeat from core client for 30 sec - exiting 14:37:30 (2798): No heartbeat from core client for 30 sec - exiting 14:37:31 (2798): No heartbeat from core client for 30 sec - exiting 14:37:32 (2798): No heartbeat from core client for 30 sec - exiting 14:37:33 (2798): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77bc400] [0xf77bc430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75e31df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75e6825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75ce4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2868, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7760400] [0xf7760430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75871df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf758a825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75724d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2868, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7774400] [0xf7774430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf759b1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf759e825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75864d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2868, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7776400] [0xf7776430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf759d1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75a0825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75884d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2868, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf770b400] [0xf770b430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75321df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7535825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf751d4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2868, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf770e400] [0xf770e430] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75351df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7538825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75204d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2868, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Mar 2013 05:50:12 | 1269677 | 15632649 | hadcm3n_38np_1980_40_008318566_0 | 25,920 | 29,312 | 1.1309 |
©2024 cpdn.org