Name | hadcm3n_4icu_1940_40_008311853_2 |
Workunit | 8462988 |
Created | 14 May 2013, 10:30:42 UTC |
Sent | 14 May 2013, 10:31:49 UTC |
Report deadline | 13 Aug 2013, 17:59:00 UTC |
Received | 30 May 2013, 22:19:47 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 16 days 2 hours 20 min 38 sec |
CPU time | 15 days 18 hours 32 min 14 sec |
Validate state | Invalid |
Credit | 5,909.76 |
Device peak FLOPS | 2.01 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 15:42:45 (22208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:25:14 (2329): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:25:16 (2329): No heartbeat from core client for 30 sec - exiting 17:25:17 (2329): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 09:35:20 (2961): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:35:21 (2961): No heartbeat from core client for 30 sec - exiting 09:35:22 (2961): No heartbeat from core client for 30 sec - exiting 09:35:23 (2961): No heartbeat from core client for 30 sec - exiting 09:35:24 (2961): No heartbeat from core client for 30 sec - exiting 09:35:25 (2961): No heartbeat from core client for 30 sec - exiting 09:35:26 (2961): No heartbeat from core client for 30 sec - exiting 09:35:27 (2961): No heartbeat from core client for 30 sec - exiting 09:35:28 (2961): No heartbeat from core client for 30 sec - exiting 09:35:29 (2961): No heartbeat from core client for 30 sec - exiting 09:35:30 (2961): No heartbeat from core client for 30 sec - exiting 09:35:31 (2961): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 02:54:28 (13119): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:40:35 (10373): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:40:36 (10373): No heartbeat from core client for 30 sec - exiting 06:40:37 (10373): No heartbeat from core client for 30 sec - exiting 06:40:38 (10373): No heartbeat from core client for 30 sec - exiting 06:40:39 (10373): No heartbeat from core client for 30 sec - exiting 06:40:40 (10373): No heartbeat from core client for 30 sec - exiting 06:40:41 (10373): No heartbeat from core client for 30 sec - exiting 06:40:42 (10373): No heartbeat from core client for 30 sec - exiting 06:40:43 (10373): No heartbeat from core client for 30 sec - exiting 06:40:44 (10373): No heartbeat from core client for 30 sec - exiting 06:40:45 (10373): No heartbeat from core client for 30 sec - exiting 06:40:46 (10373): No heartbeat from core client for 30 sec - exiting 06:40:47 (10373): No heartbeat from core client for 30 sec - exiting 06:40:48 (10373): No heartbeat from core client for 30 sec - exiting 06:40:49 (10373): No heartbeat from core client for 30 sec - exiting 06:40:50 (10373): No heartbeat from core client for 30 sec - exiting 06:40:51 (10373): No heartbeat from core client for 30 sec - exiting 06:40:52 (10373): No heartbeat from core client for 30 sec - exiting 06:40:53 (10373): No heartbeat from core client for 30 sec - exiting 06:40:54 (10373): No heartbeat from core client for 30 sec - exiting 06:40:55 (10373): No heartbeat from core client for 30 sec - exiting 06:40:56 (10373): No heartbeat from core client for 30 sec - exiting 06:40:57 (10373): No heartbeat from core client for 30 sec - exiting 06:45:35 (38883): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:45:36 (38883): No heartbeat from core client for 30 sec - exiting 06:45:37 (38883): No heartbeat from core client for 30 sec - exiting 06:45:38 (38883): No heartbeat from core client for 30 sec - exiting 06:45:39 (38883): No heartbeat from core client for 30 sec - exiting 06:45:40 (38883): No heartbeat from core client for 30 sec - exiting 06:45:41 (38883): No heartbeat from core client for 30 sec - exiting 06:45:42 (38883): No heartbeat from core client for 30 sec - exiting 06:45:43 (38883): No heartbeat from core client for 30 sec - exiting 06:45:44 (38883): No heartbeat from core client for 30 sec - exiting 06:45:45 (38883): No heartbeat from core client for 30 sec - exiting 06:45:46 (38883): No heartbeat from core client for 30 sec - exiting 06:45:47 (38883): No heartbeat from core client for 30 sec - exiting 06:45:48 (38883): No heartbeat from core client for 30 sec - exiting 06:45:49 (38883): No heartbeat from core client for 30 sec - exiting 06:45:50 (38883): No heartbeat from core client for 30 sec - exiting 10:36:02 (39084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:27 (41718): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:28 (41718): No heartbeat from core client for 30 sec - exiting 10:40:29 (41718): No heartbeat from core client for 30 sec - exiting 10:40:30 (41718): No heartbeat from core client for 30 sec - exiting 10:40:31 (41718): No heartbeat from core client for 30 sec - exiting 10:40:32 (41718): No heartbeat from core client for 30 sec - exiting 10:40:33 (41718): No heartbeat from core client for 30 sec - exiting 10:40:34 (41718): No heartbeat from core client for 30 sec - exiting 10:40:35 (41718): No heartbeat from core client for 30 sec - exiting 10:40:36 (41718): No heartbeat from core client for 30 sec - exiting 10:40:37 (41718): No heartbeat from core client for 30 sec - exiting 10:40:38 (41718): No heartbeat from core client for 30 sec - exiting 10:40:39 (41718): No heartbeat from core client for 30 sec - exiting 14:03:47 (41930): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:13:25 (43998): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:13:26 (43998): No heartbeat from core client for 30 sec - exiting 14:13:27 (43998): No heartbeat from core client for 30 sec - exiting 14:13:28 (43998): No heartbeat from core client for 30 sec - exiting 14:13:29 (43998): No heartbeat from core client for 30 sec - exiting 14:51:26 (44372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:51:27 (44372): No heartbeat from core client for 30 sec - exiting 15:05:46 (44883): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:05:47 (44883): No heartbeat from core client for 30 sec - exiting 15:05:48 (44883): No heartbeat from core client for 30 sec - exiting 15:05:49 (44883): No heartbeat from core client for 30 sec - exiting 15:05:50 (44883): No heartbeat from core client for 30 sec - exiting 15:05:51 (44883): No heartbeat from core client for 30 sec - exiting 15:05:52 (44883): No heartbeat from core client for 30 sec - exiting 15:05:53 (44883): No heartbeat from core client for 30 sec - exiting 15:05:54 (44883): No heartbeat from core client for 30 sec - exiting 15:05:55 (44883): No heartbeat from core client for 30 sec - exiting 15:05:56 (44883): No heartbeat from core client for 30 sec - exiting 15:05:57 (44883): No heartbeat from core client for 30 sec - exiting 15:05:58 (44883): No heartbeat from core client for 30 sec - exiting 15:05:59 (44883): No heartbeat from core client for 30 sec - exiting 15:06:00 (44883): No heartbeat from core client for 30 sec - exiting 15:06:01 (44883): No heartbeat from core client for 30 sec - exiting 15:06:02 (44883): No heartbeat from core client for 30 sec - exiting 15:06:03 (44883): No heartbeat from core client for 30 sec - exiting 15:06:04 (44883): No heartbeat from core client for 30 sec - exiting 15:47:25 (45298): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:47:26 (45298): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 16:31:37 (45868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:31:38 (45868): No heartbeat from core client for 30 sec - exiting 16:31:39 (45868): No heartbeat from core client for 30 sec - exiting 16:31:40 (45868): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 17:29:13 (46498): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:29:14 (46498): No heartbeat from core client for 30 sec - exiting 17:29:15 (46498): No heartbeat from core client for 30 sec - exiting 17:29:16 (46498): No heartbeat from core client for 30 sec - exiting 17:29:17 (46498): No heartbeat from core client for 30 sec - exiting 17:29:18 (46498): No heartbeat from core client for 30 sec - exiting 17:29:19 (46498): No heartbeat from core client for 30 sec - exiting 17:29:20 (46498): No heartbeat from core client for 30 sec - exiting 17:29:21 (46498): No heartbeat from core client for 30 sec - exiting 17:29:22 (46498): No heartbeat from core client for 30 sec - exiting 17:29:23 (46498): No heartbeat from core client for 30 sec - exiting 17:29:24 (46498): No heartbeat from core client for 30 sec - exiting 17:38:36 (47406): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:38:37 (47406): No heartbeat from core client for 30 sec - exiting 17:38:38 (47406): No heartbeat from core client for 30 sec - exiting 17:38:39 (47406): No heartbeat from core client for 30 sec - exiting 17:38:40 (47406): No heartbeat from core client for 30 sec - exiting 17:38:41 (47406): No heartbeat from core client for 30 sec - exiting 17:38:42 (47406): No heartbeat from core client for 30 sec - exiting 18:23:44 (47664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:28:22 (48388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:28:23 (48388): No heartbeat from core client for 30 sec - exiting 18:28:24 (48388): No heartbeat from core client for 30 sec - exiting 18:28:25 (48388): No heartbeat from core client for 30 sec - exiting 19:51:15 (48575): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:56:00 (49540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:00:34 (49732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:01:05 (49732): No heartbeat from core client for 30 sec - exiting 20:01:06 (49732): No heartbeat from core client for 30 sec - exiting 20:01:07 (49732): No heartbeat from core client for 30 sec - exiting 20:01:08 (49732): No heartbeat from core client for 30 sec - exiting 21:12:14 (49922): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:17:02 (50697): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:17:03 (50697): No heartbeat from core client for 30 sec - exiting 21:17:04 (50697): No heartbeat from core client for 30 sec - exiting 21:17:05 (50697): No heartbeat from core client for 30 sec - exiting 21:17:06 (50697): No heartbeat from core client for 30 sec - exiting 21:17:07 (50697): No heartbeat from core client for 30 sec - exiting 21:17:08 (50697): No heartbeat from core client for 30 sec - exiting 21:17:09 (50697): No heartbeat from core client for 30 sec - exiting 21:17:10 (50697): No heartbeat from core client for 30 sec - exiting 21:17:11 (50697): No heartbeat from core client for 30 sec - exiting 21:17:12 (50697): No heartbeat from core client for 30 sec - exiting 21:17:13 (50697): No heartbeat from core client for 30 sec - exiting 21:17:14 (50697): No heartbeat from core client for 30 sec - exiting 21:17:15 (50697): No heartbeat from core client for 30 sec - exiting 21:17:16 (50697): No heartbeat from core client for 30 sec - exiting 21:17:17 (50697): No heartbeat from core client for 30 sec - exiting 21:17:18 (50697): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7756400] [0xf7756425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75731df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7576825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf755e4d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50885, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7748400] [0xf7748425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75651df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7568825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75504d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50885, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf7761400] [0xf7761425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf757e1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7581825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75694d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50885, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf773e400] [0xf773e425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf755b1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf755e825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75464d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50885, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77a0400] [0xf77a0425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75bd1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c0825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75a84d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50885, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf776d400] [0xf776d425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf758a1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf758d825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75754d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50885, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 May 2013 16:54:09 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 492,480 | 1,349,884 | 2.7410 |
29 May 2013 19:58:25 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 466,560 | 1,285,176 | 2.7546 |
29 May 2013 01:50:30 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 440,640 | 1,221,928 | 2.7731 |
28 May 2013 07:53:26 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 414,720 | 1,159,053 | 2.7948 |
27 May 2013 11:09:46 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 388,800 | 1,090,342 | 2.8044 |
26 May 2013 15:01:58 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 362,880 | 1,020,196 | 2.8114 |
25 May 2013 19:03:45 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 336,960 | 949,919 | 2.8191 |
24 May 2013 21:58:01 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 311,040 | 879,339 | 2.8271 |
24 May 2013 02:15:42 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 285,120 | 810,252 | 2.8418 |
23 May 2013 06:37:20 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 259,200 | 741,262 | 2.8598 |
22 May 2013 10:03:49 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 233,280 | 669,984 | 2.8720 |
21 May 2013 10:43:47 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 207,360 | 587,294 | 2.8322 |
20 May 2013 10:56:40 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 181,440 | 504,534 | 2.7807 |
19 May 2013 12:18:23 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 155,520 | 422,268 | 2.7152 |
18 May 2013 13:05:12 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 129,600 | 341,273 | 2.6333 |
17 May 2013 16:18:30 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 103,680 | 269,246 | 2.5969 |
16 May 2013 19:52:47 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 77,760 | 197,144 | 2.5353 |
16 May 2013 00:19:12 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 51,840 | 130,137 | 2.5104 |
15 May 2013 05:09:16 | 1282401 | 15782964 | hadcm3n_4icu_1940_40_008311853_2 | 25,920 | 64,232 | 2.4781 |
©2024 climateprediction.net