Name | wah2_eas25_h22g_201412_24_1021_012308807_0 |
Workunit | 12308807 |
Created | 24 Jul 2024, 11:36:37 UTC |
Sent | 30 Jul 2024, 3:34:12 UTC |
Report deadline | 7 Nov 2024, 3:34:12 UTC |
Received | 4 Aug 2024, 5:54:29 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1538229 |
Run time | 2 days 17 hours 42 min 14 sec |
CPU time | 1 days 23 hours 23 min 52 sec |
Validate state | Invalid |
Credit | 4,991.48 |
Device peak FLOPS | 4.98 GFLOPS |
Application version | Weather At Home 2 (wah2) (region independent) v8.32 windows_intelx86 |
Peak working set size | 340.21 MB |
Peak swap size | 307.85 MB |
Peak disk usage | 95.10 MB |
Stderr | <core_client_version>8.0.2</core_client_version> <![CDATA[ <stderr_txt> modelGetExecutables: check control files, strTemp0 & 1 : B:\/projects/climateprediction.net/wah2_eas25_h22g_201412_24_1021_012308807/jobs/xadae.namelists B:\/projects/climateprediction.net/wah2_eas25_h22g_201412_24_1021_012308807/jobs/xacxf.namelists modelGetExecutables: unzipping control files : strInput & strTmp wah2_eas25_h22g_201412_24_1021_012308807.zip wah2_eas25_h22g_201412_24_1021_012308807/jobs gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f global model: command string: "B:\/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_h22g_201412_24_1021_012308807 generic_phase1_spinup_eas25_global_aabaka_f ic19610316_16_N96 ALLclim_ancil_146months_OSTIA_sst_2004-12-01_2017-01-30 ALLclim_ancil_146months_OSTIA_ice_2004-12-01_2017-01-30 SO2DMS_N96_cmip6hist-ssp245_2009-2020 oxi.addfa ozone_cmip6hist-ssp245_N96_1979_2031 regional model: command string: "B:\/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_h22g_201412_24_1021_012308807 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. executeModelProcess: MonID=29744, GCM_PID=29408, RCM_PID=18024 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Queuing intermediate upload for CPDN/BOINC: cpdnout1.zip Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Queuing intermediate upload for CPDN/BOINC: cpdnout2.zip Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Queuing intermediate upload for CPDN/BOINC: cpdnout3.zip Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Queuing intermediate upload for CPDN/BOINC: cpdnout4.zip Suspended CPDN Monitor - Suspend request from BOINC... Queuing intermediate upload for CPDN/BOINC: cpdnout5.zip Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Queuing intermediate upload for CPDN/BOINC: cpdnout6.zip Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Detaching shared memory... Done. modelGetExecutables: check control files, strTemp0 & 1 : B:\/projects/climateprediction.net/wah2_eas25_h22g_201412_24_1021_012308807/jobs/xadae.namelists B:\/projects/climateprediction.net/wah2_eas25_h22g_201412_24_1021_012308807/jobs/xacxf.namelists gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka_f gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka_f global model: command string: "B:\/projects/climateprediction.net/wah2am3m2_um_8.32_windows_intelx86.exe" wah2_eas25_h22g_201412_24_1021_012308807 generic_phase1_spinup_eas25_global_aabaka_f ic19610316_16_N96 ALLclim_ancil_146months_OSTIA_sst_2004-12-01_2017-01-30 ALLclim_ancil_146months_OSTIA_ice_2004-12-01_2017-01-30 SO2DMS_N96_cmip6hist-ssp245_2009-2020 oxi.addfa ozone_cmip6hist-ssp245_N96_1979_2031 regional model: command string: "B:\/projects/climateprediction.net/wah2rm3m2t_um_8.32_windows_intelx86.exe" wah2_eas25_h22g_201412_24_1021_012308807 cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem. executeModelProcess: MonID=21676, GCM_PID=22644, RCM_PID=18804 07:52:21 (18804): called boinc_finish(193) Global Worker:: CPDN process is not running, exiting, bRetVal = T, checkPID = 18804, selfPID = 22644, iMonCtr = 2 Controller:: CPDN process is not running, exiting, bRetVal = T, checkPID = 22644, selfPID = 21676, iMonCtr = 1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... monitor:finished called ... tidying up. monitor:finished: Uploading out files... Queuing intermediate upload for CPDN/BOINC: cpdnout_out.zip Detaching shared memory... Done. monitor:finished: Closed output file : stdout_<>.txt modelResultFiles : Removing : wah2_eas25_h22g_201412_24_1021_012308807 in B:\/projects/climateprediction.net monitor:finished: handing over to boinc_finish(RetVal=0) 07:52:26 (21676): called boinc_finish(0) </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_7.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_8.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_9.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_10.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_11.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_12.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_13.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_14.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_15.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_16.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_17.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_18.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_19.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_20.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_21.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_22.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_23.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_24.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_h22g_201412_24_1021_012308807_0_r1683226208_restart.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Aug 2024 10:24:05 | 1538229 | 22466480 | wah2_eas25_h22g_201412_24_1021_012308807_0 | 69,419 | 149,773 | 2.1575 |
03 Aug 2024 00:43:40 | 1538229 | 22466480 | wah2_eas25_h22g_201412_24_1021_012308807_0 | 57,899 | 126,821 | 2.1904 |
02 Aug 2024 15:25:39 | 1538229 | 22466480 | wah2_eas25_h22g_201412_24_1021_012308807_0 | 46,379 | 101,188 | 2.1818 |
02 Aug 2024 06:12:18 | 1538229 | 22466480 | wah2_eas25_h22g_201412_24_1021_012308807_0 | 34,859 | 75,962 | 2.1791 |
01 Aug 2024 20:57:58 | 1538229 | 22466480 | wah2_eas25_h22g_201412_24_1021_012308807_0 | 23,339 | 50,968 | 2.1838 |
01 Aug 2024 11:40:47 | 1538229 | 22466480 | wah2_eas25_h22g_201412_24_1021_012308807_0 | 11,819 | 25,549 | 2.1617 |
©2024 cpdn.org