Name | wah2_eas25_a0j8_199011_25_994_012216130_1 |
Workunit | 12216130 |
Created | 25 Jun 2023, 1:46:22 UTC |
Sent | 25 Jun 2023, 1:57:48 UTC |
Report deadline | 6 Jul 2024, 7:17:48 UTC |
Received | 3 Aug 2023, 13:59:39 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1542065 |
Run time | 14 days 18 hours 24 min 1 sec |
CPU time | 11 days 15 hours 20 min 35 sec |
Validate state | Invalid |
Credit | 5,339.28 |
Device peak FLOPS | 1.00 GFLOPS |
Application version | Weather At Home 2 (wah2) v8.24 windows_intelx86 |
Peak working set size | 430.61 MB |
Peak swap size | 394.71 MB |
Peak disk usage | 874.21 MB |
Stderr | <core_client_version>7.22.2</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10404, selfPID=23444, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... 04:19:49 (23444): called boinc_finish(0) CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12900, selfPID=12900, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13488, selfPID=13712, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4124, selfPID=4124, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12860, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... 10:40:35 (12860): called boinc_finish(0) Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=32444, selfPID=32444, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13204, selfPID=13204, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14056, selfPID=14056, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13888, selfPID=13888, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13888, selfPID=14328, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9964, selfPID=9964, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24664, selfPID=22552, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14716, selfPID=14716, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15864, selfPID=16228, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14596, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14184, selfPID=14184, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31408, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=32556, selfPID=15212, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12168, selfPID=26352, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15504, selfPID=15724, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13784, selfPID=13784, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13348, selfPID=13348, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11908, selfPID=11908, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15408, selfPID=15408, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12752, selfPID=12752, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8844, selfPID=14340, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8844, selfPID=8844, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9592, selfPID=9592, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9592, selfPID=13736, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13492, selfPID=14164, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13492, selfPID=13492, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7660, selfPID=7660, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14180, selfPID=14180, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15404, selfPID=15404, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14012, selfPID=14260, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14012, selfPID=14012, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15352, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14496, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... 09:49:59 (14496): called boinc_finish(0) </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_8.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_9.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_10.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_11.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_12.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_13.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_14.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_15.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_16.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_17.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_18.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_19.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_20.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_21.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_22.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_23.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_24.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_25.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eas25_a0j8_199011_25_994_012216130_1_r1410772978_restart.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Aug 2023 00:02:48 | 1542065 | 22328939 | wah2_eas25_a0j8_199011_25_994_012216130_1 | 80,939 | 956,106 | 11.8127 |
28 Jul 2023 05:41:43 | 1542065 | 22328939 | wah2_eas25_a0j8_199011_25_994_012216130_1 | 69,419 | 816,428 | 11.7609 |
21 Jul 2023 17:03:46 | 1542065 | 22328939 | wah2_eas25_a0j8_199011_25_994_012216130_1 | 57,899 | 636,627 | 10.9955 |
13 Jul 2023 20:32:02 | 1542065 | 22328939 | wah2_eas25_a0j8_199011_25_994_012216130_1 | 46,379 | 496,096 | 10.6966 |
11 Jul 2023 00:42:49 | 1542065 | 22328939 | wah2_eas25_a0j8_199011_25_994_012216130_1 | 34,859 | 388,995 | 11.1591 |
06 Jul 2023 02:36:21 | 1542065 | 22328939 | wah2_eas25_a0j8_199011_25_994_012216130_1 | 23,339 | 281,485 | 12.0607 |
03 Jul 2023 00:47:15 | 1542065 | 22328939 | wah2_eas25_a0j8_199011_25_994_012216130_1 | 11,819 | 146,138 | 12.3647 |
©2024 climateprediction.net