climateprediction.net home page
Task 16725058

Task 16725058

Name hadam3p_eu_na3j_2013_1_008807583_0
Workunit 8953561
Created 7 Jul 2014, 17:06:53 UTC
Sent 1 Aug 2014, 16:18:06 UTC
Report deadline 14 Jul 2015, 21:38:06 UTC
Received 3 Aug 2014, 16:30:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1298068
Run time 1 days 8 hours 25 min 2 sec
CPU time
Validate state Invalid
Credit 796.57
Device peak FLOPS 3.05 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
i686-apple-darwin
Stderr
<core_client_version>7.0.65</core_client_version>
<![CDATA[
<message>
process exited with code 25 (0x19, -231)
</message>
<stderr_txt>
17:13:30 (16945): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
SIGSEGV: segmentation violation
23:07:40 (19732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:19:32 (22359): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:23:38 (32554): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:22:08 (36837): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:42:08 (68918): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:56:07 (84645): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1
forrtl: No space left on device
forrtl: severe (38): error during write, unit 0, file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadam3p_eu_na3j_2013_1_008807583/dataout/xaakg.err
Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  00365662  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0036412B  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  003289AD  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002DA7E5  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002D9F47  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0031F669  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0031C1CF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  000212FB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002B9D2C  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0000192B  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00001859  Unknown               Unknown  Unknown
Unknown            00000003  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=99354, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
zip I/O error: No space left on device

zip error: Could not create output file (../hadam3p_eu_na3j_2013_1_008807583_0_13.zip)
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=70124, selfPID=70124, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=82684, iMonCtr=2
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=86213, selfPID=86203, iMonCtr=1
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92308, selfPID=92309, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92353, selfPID=92354, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92435, selfPID=92436, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92476, selfPID=92477, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92556, selfPID=92557, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92826, selfPID=92827, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92952, selfPID=92953, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92998, selfPID=92999, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=93166, selfPID=93167, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=93212, selfPID=93213, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=93254, selfPID=93255, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=95943, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=99684, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=99902, selfPID=99896, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2371, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8290, iMonCtr=2
no start tag in app init data
no start tag in app init data
03:42:40 (9984): Can't parse init data file - running in standalone mode
no start tag in app init data
03:42:40 (9984): Can't parse init data file - running in standalone mode
Could not change to project directory 
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13094, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13297, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26151, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=29230, selfPID=29225, iMonCtr=1
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=63914, iMonCtr=2
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Aug 2014 01:20:52 1298068 16725058 hadam3p_eu_na3j_2013_1_008807583_0 46,176 109,478 2.3709
02 Aug 2014 17:14:25 1298068 16725058 hadam3p_eu_na3j_2013_1_008807583_0 34,656 81,431 2.3497
02 Aug 2014 09:13:21 1298068 16725058 hadam3p_eu_na3j_2013_1_008807583_0 23,136 53,859 2.3279
02 Aug 2014 00:12:08 1298068 16725058 hadam3p_eu_na3j_2013_1_008807583_0 11,616 27,292 2.3495


©2024 climateprediction.net