Name | hadcm3n_82mm_1980_40_008461345_0 |
Workunit | 8612201 |
Created | 30 Aug 2013, 22:04:02 UTC |
Sent | 2 Sep 2013, 7:16:40 UTC |
Report deadline | 2 Dec 2013, 14:43:51 UTC |
Received | 15 Sep 2013, 3:23:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1291356 |
Run time | 4 days 8 hours 56 min 44 sec |
CPU time | 4 days 4 hours 2 min 34 sec |
Validate state | Invalid |
Credit | 1,555.20 |
Device peak FLOPS | 2.08 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Error: Input file: dataout/82mmko.pji2c10 is not a valid UM file. Error converting file to netcdf: dataout/82mmko.pji2c10 Error: Input file: dataout/82mmko.pii2c10 is not a valid UM file. Error converting file to netcdf: dataout/82mmko.pii2c10 Error: Input file: dataout/82mmko.pfi2c10 is not a valid UM file. Error converting file to netcdf: dataout/82mmko.pfi2c10 Error: Input file: dataout/82mmka.phi2c10 is not a valid UM file. Error converting file to netcdf: dataout/82mmka.phi2c10 Error: Input file: dataout/82mmka.pgi2c10 is not a valid UM file. Error converting file to netcdf: dataout/82mmka.pgi2c10 Error: Input file: dataout/82mmka.pei2c10 is not a valid UM file. Error converting file to netcdf: dataout/82mmka.pei2c10 Error: Input file: dataout/82mmka.pdi2c10 is not a valid UM file. Error converting file to netcdf: dataout/82mmka.pdi2c10 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:16:15 (24966): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:36:48 (10220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:41:15 (1462): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:41:17 (1462): No heartbeat from core client for 30 sec - exiting 02:41:18 (1462): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 07:03:30 (1717): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:03:33 (1717): No heartbeat from core client for 30 sec - exiting 07:03:34 (1717): No heartbeat from core client for 30 sec - exiting 07:03:35 (1717): No heartbeat from core client for 30 sec - exiting 07:03:36 (1717): No heartbeat from core client for 30 sec - exiting 07:03:37 (1717): No heartbeat from core client for 30 sec - exiting 07:03:38 (1717): No heartbeat from core client for 30 sec - exiting 07:03:39 (1717): No heartbeat from core client for 30 sec - exiting 07:03:40 (1717): No heartbeat from core client for 30 sec - exiting 07:03:41 (1717): No heartbeat from core client for 30 sec - exiting 07:03:42 (1717): No heartbeat from core client for 30 sec - exiting 07:03:43 (1717): No heartbeat from core client for 30 sec - exiting 07:03:44 (1717): No heartbeat from core client for 30 sec - exiting 07:03:45 (1717): No heartbeat from core client for 30 sec - exiting 07:03:46 (1717): No heartbeat from core client for 30 sec - exiting 07:03:47 (1717): No heartbeat from core client for 30 sec - exiting 07:03:48 (1717): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 07:17:34 (2786): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:17:36 (2786): No heartbeat from core client for 30 sec - exiting 07:17:37 (2786): No heartbeat from core client for 30 sec - exiting 07:17:38 (2786): No heartbeat from core client for 30 sec - exiting 07:17:39 (2786): No heartbeat from core client for 30 sec - exiting 07:17:40 (2786): No heartbeat from core client for 30 sec - exiting 07:17:41 (2786): No heartbeat from core client for 30 sec - exiting 07:17:42 (2786): No heartbeat from core client for 30 sec - exiting 07:17:43 (2786): No heartbeat from core client for 30 sec - exiting 07:17:44 (2786): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 07:23:01 (2820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:23:02 (2820): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Sep 2013 00:41:18 | 1291356 | 15995321 | hadcm3n_82mm_1980_40_008461345_0 | 129,600 | 318,018 | 2.4538 |
08 Sep 2013 01:14:22 | 1291356 | 15995321 | hadcm3n_82mm_1980_40_008461345_0 | 103,680 | 257,424 | 2.4829 |
05 Sep 2013 18:51:57 | 1291356 | 15995321 | hadcm3n_82mm_1980_40_008461345_0 | 77,760 | 194,664 | 2.5034 |
04 Sep 2013 16:14:09 | 1291356 | 15995321 | hadcm3n_82mm_1980_40_008461345_0 | 51,840 | 129,709 | 2.5021 |
03 Sep 2013 13:22:23 | 1291356 | 15995321 | hadcm3n_82mm_1980_40_008461345_0 | 25,920 | 65,504 | 2.5272 |
©2024 cpdn.org