Name | wah2_eas25_a22p_200211_25_994_012218127_1 |
Workunit | 12218127 |
Created | 25 Jun 2023, 1:37:21 UTC |
Sent | 25 Jun 2023, 1:57:48 UTC |
Report deadline | 6 Jul 2024, 7:17:48 UTC |
Received | 10 Aug 2023, 15:43:49 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1542065 |
Run time | 19 days 14 hours 32 min 58 sec |
CPU time | 3 days 15 hours 58 min 20 sec |
Validate state | Invalid |
Credit | 3,059.47 |
Device peak FLOPS | 1.00 GFLOPS |
Application version | Weather At Home 2 (wah2) v8.24 windows_intelx86 |
Peak working set size | 430.79 MB |
Peak swap size | 393.89 MB |
Peak disk usage | 880.23 MB |
Stderr | <core_client_version>7.22.2</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24132, selfPID=28300, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5080, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... 10:21:00 (5080): called boinc_finish(0) Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13744, selfPID=13744, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16500, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13496, selfPID=13744, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=768, selfPID=768, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=768, selfPID=2412, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12728, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... 10:40:35 (12728): called boinc_finish(0) Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=40328, selfPID=40328, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15084, selfPID=15084, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14016, selfPID=14016, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13944, selfPID=13944, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13944, selfPID=13656, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9960, selfPID=9960, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24260, selfPID=22120, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11320, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14708, selfPID=14708, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15940, selfPID=16368, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9024, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25084, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14152, selfPID=14152, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10696, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30012, selfPID=30012, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30012, selfPID=7032, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15472, selfPID=15472, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14156, selfPID=14156, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10180, selfPID=10180, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11036, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24696, selfPID=15100, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14636, selfPID=14636, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14636, selfPID=15576, iMonCtr=1 GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8872, selfPID=8872, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8872, selfPID=14444, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13380, selfPID=14128, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13380, selfPID=13380, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13336, selfPID=13336, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14112, selfPID=14112, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15376, selfPID=15376, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13980, selfPID=14220, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13980, selfPID=13980, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15336, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... 13:20:39 (15336): called boinc_finish(0) CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14472, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15048, selfPID=15048, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15048, selfPID=15304, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13972, selfPID=13972, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16136, selfPID=16136, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16136, selfPID=12816, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15036, selfPID=15228, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15036, selfPID=15036, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19376, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18508, selfPID=11336, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... 02:03:42 (11336): called boinc_finish(0) </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_8.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_9.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_10.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_11.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_12.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_13.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_14.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_15.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_16.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_17.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_18.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_19.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_20.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_21.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_22.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_23.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_24.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_25.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a22p_200211_25_994_012218127_1_r973453553_restart.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Jul 2023 23:17:39 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 80,939 | 955,268 | 11.8023 |
26 Jul 2023 10:36:37 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 69,419 | 799,648 | 11.5192 |
20 Jul 2023 17:23:55 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 57,899 | 621,696 | 10.7376 |
20 Jul 2023 10:40:15 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 57,899 | 617,937 | 10.6727 |
10 Aug 2023 02:55:44 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 46,379 | 308,921 | 6.6608 |
13 Jul 2023 18:30:35 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 46,379 | 491,971 | 10.6076 |
09 Jul 2023 23:01:19 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 34,859 | 382,956 | 10.9859 |
08 Aug 2023 19:58:42 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 34,859 | 238,628 | 6.8455 |
05 Jul 2023 21:18:50 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 23,339 | 275,304 | 11.7959 |
06 Aug 2023 07:01:07 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 23,339 | 168,151 | 7.2047 |
04 Aug 2023 17:59:07 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 11,819 | 96,819 | 8.1918 |
02 Jul 2023 20:44:23 | 1542065 | 22328935 | wah2_eas25_a22p_200211_25_994_012218127_1 | 11,819 | 140,145 | 11.8576 |
©2024 climateprediction.net