Name | wah2_eas25_a0sq_199211_25_994_012216472_2 |
Workunit | 12216472 |
Created | 25 Jun 2023, 1:44:01 UTC |
Sent | 25 Jun 2023, 1:57:48 UTC |
Report deadline | 6 Jul 2024, 7:17:48 UTC |
Received | 11 Aug 2023, 16:20:03 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1542065 |
Run time | 22 days 23 hours 57 min 54 sec |
CPU time | 19 days 10 hours 33 min 25 sec |
Validate state | Invalid |
Credit | 10,658.83 |
Device peak FLOPS | 1.00 GFLOPS |
Application version | Weather At Home 2 (wah2) v8.24 windows_intelx86 |
Peak working set size | 434.42 MB |
Peak swap size | 396.30 MB |
Peak disk usage | 1,831.52 MB |
Stderr | <core_client_version>7.22.2</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5648, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11272, selfPID=11272, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14492, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13460, selfPID=13696, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13460, selfPID=13460, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16136, selfPID=16136, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13608, selfPID=13608, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15332, selfPID=15332, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6840, selfPID=17160, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13668, selfPID=13808, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17508, selfPID=17508, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2464, selfPID=2464, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14440, selfPID=14440, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13912, selfPID=13912, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13832, selfPID=13832, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13832, selfPID=14288, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12308, selfPID=12308, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=356, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9072, selfPID=17700, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14596, selfPID=14596, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13860, selfPID=13860, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10028, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14040, selfPID=14252, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14040, selfPID=14040, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20328, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=34792, selfPID=4884, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15424, selfPID=15424, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12712, selfPID=12712, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12756, selfPID=12756, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23828, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN proCPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14784, selfPID=14784, iMonCtr=1 GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15592, selfPID=15592, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15592, selfPID=15828, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14296, selfPID=14092, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14296, selfPID=14296, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4824, selfPID=4824, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14060, selfPID=14060, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13892, selfPID=14120, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13892, selfPID=13892, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11012, selfPID=11012, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11012, selfPID=7324, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16420, selfPID=16420, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14912, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15304, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15248, selfPID=15248, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14428, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15104, selfPID=15104, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14836, selfPID=14836, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13996, selfPID=13996, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13996, selfPID=14384, iMonCtr=1 GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 CPDN Monitor - Quit request from BOINC... GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12780, selfPID=12780, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14712, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13652, selfPID=13652, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30856, selfPID=24128, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... 12:04:18 (24128): called boinc_finish(0) </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_15.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_16.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_17.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_18.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_19.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_20.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_21.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_22.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_23.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_24.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0sq_199211_25_994_012216472_2_r755307538_25.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Aug 2023 18:45:55 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 161,579 | 1,645,372 | 10.1831 |
09 Aug 2023 00:22:42 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 150,059 | 1,574,877 | 10.4951 |
07 Aug 2023 01:47:27 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 138,539 | 1,503,764 | 10.8544 |
06 Aug 2023 10:04:34 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 138,539 | 1,501,949 | 10.8413 |
04 Aug 2023 21:01:20 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 127,019 | 1,430,385 | 11.2612 |
03 Aug 2023 01:31:33 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 115,499 | 1,349,092 | 11.6806 |
29 Jul 2023 18:59:34 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 103,979 | 1,219,755 | 11.7308 |
26 Jul 2023 01:05:46 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 92,459 | 1,055,002 | 11.4105 |
20 Jul 2023 18:24:28 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 80,939 | 892,942 | 11.0323 |
20 Jul 2023 10:40:15 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 80,939 | 889,079 | 10.9846 |
14 Jul 2023 04:16:17 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 69,419 | 760,435 | 10.9543 |
11 Jul 2023 08:20:53 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 57,899 | 647,804 | 11.1885 |
06 Jul 2023 19:27:24 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 46,379 | 542,328 | 11.6934 |
03 Jul 2023 21:50:05 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 34,859 | 414,386 | 11.8875 |
01 Jul 2023 18:04:46 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 23,339 | 278,332 | 11.9256 |
28 Jun 2023 21:24:50 | 1542065 | 22328938 | wah2_eas25_a0sq_199211_25_994_012216472_2 | 11,819 | 164,088 | 13.8834 |
©2024 climateprediction.net