Name | hadcm3n_o7fh_1940_40_008382305_2 |
Workunit | 8533164 |
Created | 10 Jun 2013, 7:42:21 UTC |
Sent | 10 Jun 2013, 7:57:56 UTC |
Report deadline | 9 Sep 2013, 15:25:07 UTC |
Received | 11 Jun 2013, 6:42:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 19 hours 32 min 18 sec |
CPU time | 17 hours 44 min 24 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 12:49:13 (47143): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:49:14 (47143): No heartbeat from core client for 30 sec - exiting 12:53:27 (49166): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:09 (49321): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:10 (49321): No heartbeat from core client for 30 sec - exiting 12:57:11 (49321): No heartbeat from core client for 30 sec - exiting 21:38:25 (49454): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:42:19 (53918): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:42:20 (53918): No heartbeat from core client for 30 sec - exiting 21:42:21 (53918): No heartbeat from core client for 30 sec - exiting 21:42:22 (53918): No heartbeat from core client for 30 sec - exiting 21:42:23 (53918): No heartbeat from core client for 30 sec - exiting 21:42:24 (53918): No heartbeat from core client for 30 sec - exiting 21:42:25 (53918): No heartbeat from core client for 30 sec - exiting 21:42:26 (53918): No heartbeat from core client for 30 sec - exiting 21:42:27 (53918): No heartbeat from core client for 30 sec - exiting 21:42:28 (53918): No heartbeat from core client for 30 sec - exiting 21:42:29 (53918): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 21:58:25 (54086): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:02:06 (54334): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:02:07 (54334): No heartbeat from core client for 30 sec - exiting 22:02:08 (54334): No heartbeat from core client for 30 sec - exiting 22:02:09 (54334): No heartbeat from core client for 30 sec - exiting 22:02:10 (54334): No heartbeat from core client for 30 sec - exiting 22:02:11 (54334): No heartbeat from core client for 30 sec - exiting 22:02:12 (54334): No heartbeat from core client for 30 sec - exiting 22:02:13 (54334): No heartbeat from core client for 30 sec - exiting 22:02:14 (54334): No heartbeat from core client for 30 sec - exiting 22:02:15 (54334): No heartbeat from core client for 30 sec - exiting 22:02:16 (54334): No heartbeat from core client for 30 sec - exiting 22:02:17 (54334): No heartbeat from core client for 30 sec - exiting 22:02:18 (54334): No heartbeat from core client for 30 sec - exiting 22:02:19 (54334): No heartbeat from core client for 30 sec - exiting 22:02:20 (54334): No heartbeat from core client for 30 sec - exiting 22:02:21 (54334): No heartbeat from core client for 30 sec - exiting 22:02:22 (54334): No heartbeat from core client for 30 sec - exiting 22:02:23 (54334): No heartbeat from core client for 30 sec - exiting 22:02:24 (54334): No heartbeat from core client for 30 sec - exiting 22:02:25 (54334): No heartbeat from core client for 30 sec - exiting 22:06:19 (54493): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:06:20 (54493): No heartbeat from core client for 30 sec - exiting 22:06:21 (54493): No heartbeat from core client for 30 sec - exiting 22:06:22 (54493): No heartbeat from core client for 30 sec - exiting 22:06:23 (54493): No heartbeat from core client for 30 sec - exiting 22:06:24 (54493): No heartbeat from core client for 30 sec - exiting 22:06:25 (54493): No heartbeat from core client for 30 sec - exiting 22:06:26 (54493): No heartbeat from core client for 30 sec - exiting 22:06:27 (54493): No heartbeat from core client for 30 sec - exiting 22:06:28 (54493): No heartbeat from core client for 30 sec - exiting 22:06:29 (54493): No heartbeat from core client for 30 sec - exiting 22:06:30 (54493): No heartbeat from core client for 30 sec - exiting 22:06:31 (54493): No heartbeat from core client for 30 sec - exiting 22:06:32 (54493): No heartbeat from core client for 30 sec - exiting 22:06:33 (54493): No heartbeat from core client for 30 sec - exiting 22:06:34 (54493): No heartbeat from core client for 30 sec - exiting 22:10:59 (54666): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:11:00 (54666): No heartbeat from core client for 30 sec - exiting 22:11:01 (54666): No heartbeat from core client for 30 sec - exiting 22:11:02 (54666): No heartbeat from core client for 30 sec - exiting 00:32:53 (54849): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:36:30 (56192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 00:55:17 (56348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:59:16 (56638): No heartbeat from core client for 30 sec - exiting 00:59:17 (56638): No heartbeat from core client for 30 sec - exiting 00:59:18 (56638): No heartbeat from core client for 30 sec - exiting 00:59:19 (56638): No heartbeat from core client for 30 sec - exiting 00:59:20 (56638): No heartbeat from core client for 30 sec - exiting 00:59:21 (56638): No heartbeat from core client for 30 sec - exiting 00:59:22 (56638): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:03:53 (56803): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:11:39 (56925): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:15:33 (57111): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:15:34 (57111): No heartbeat from core client for 30 sec - exiting 01:19:28 (57270): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:19:29 (57270): No heartbeat from core client for 30 sec - exiting 01:19:30 (57270): No heartbeat from core client for 30 sec - exiting 04:28:21 (57424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:28:22 (57424): No heartbeat from core client for 30 sec - exiting 04:28:23 (57424): No heartbeat from core client for 30 sec - exiting 04:28:24 (57424): No heartbeat from core client for 30 sec - exiting 04:28:25 (57424): No heartbeat from core client for 30 sec - exiting 04:40:35 (59144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:40:36 (59144): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 04:44:22 (59342): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:44:23 (59342): No heartbeat from core client for 30 sec - exiting 04:44:24 (59342): No heartbeat from core client for 30 sec - exiting 04:44:25 (59342): No heartbeat from core client for 30 sec - exiting 04:44:26 (59342): No heartbeat from core client for 30 sec - exiting 04:44:27 (59342): No heartbeat from core client for 30 sec - exiting 04:44:28 (59342): No heartbeat from core client for 30 sec - exiting 04:44:29 (59342): No heartbeat from core client for 30 sec - exiting 04:44:30 (59342): No heartbeat from core client for 30 sec - exiting 04:44:31 (59342): No heartbeat from core client for 30 sec - exiting 04:44:32 (59342): No heartbeat from core client for 30 sec - exiting 04:44:34 (59342): No heartbeat from core client for 30 sec - exiting 04:44:35 (59342): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 05:01:46 (59497): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Jun 2013 03:41:07 | 1282401 | 15837317 | hadcm3n_o7fh_1940_40_008382305_2 | 25,920 | 62,987 | 2.4301 |
©2024 climateprediction.net