climateprediction.net home page
Task 15836316

Task 15836316

Name hadcm3n_4m0h_2020_40_008390494_1
Workunit 8541353
Created 9 Jun 2013, 9:04:05 UTC
Sent 9 Jun 2013, 9:52:23 UTC
Report deadline 8 Sep 2013, 17:19:34 UTC
Received 18 Jul 2013, 14:05:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1170335
Run time 5 days 19 hours 57 min 34 sec
CPU time 5 days 12 hours 23 min 28 sec
Validate state Invalid
Credit 2,177.28
Device peak FLOPS 1.99 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
03:26:43 (10359): No heartbeat from core client for 30 sec - exiting
03:26:49 (10359): No heartbeat from core client for 30 sec - exiting
03:26:50 (10359): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:45:57 (12304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
21:28:08 (20040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:28:18 (20040): No heartbeat from core client for 30 sec - exiting
21:28:19 (20040): No heartbeat from core client for 30 sec - exiting
21:28:20 (20040): No heartbeat from core client for 30 sec - exiting
21:28:21 (20040): No heartbeat from core client for 30 sec - exiting
21:28:22 (20040): No heartbeat from core client for 30 sec - exiting
21:28:24 (20040): No heartbeat from core client for 30 sec - exiting
21:28:25 (20040): No heartbeat from core client for 30 sec - exiting
21:28:26 (20040): No heartbeat from core client for 30 sec - exiting
21:28:27 (20040): No heartbeat from core client for 30 sec - exiting
21:28:28 (20040): No heartbeat from core client for 30 sec - exiting
21:28:29 (20040): No heartbeat from core client for 30 sec - exiting
21:28:30 (20040): No heartbeat from core client for 30 sec - exiting
21:28:31 (20040): No heartbeat from core client for 30 sec - exiting
21:28:32 (20040): No heartbeat from core client for 30 sec - exiting
23:55:58 (20420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:30:43 (20992): No heartbeat from core client for 30 sec - exiting
00:30:47 (20992): No heartbeat from core client for 30 sec - exiting
00:30:48 (20992): No heartbeat from core client for 30 sec - exiting
00:30:49 (20992): No heartbeat from core client for 30 sec - exiting
00:30:50 (20992): No heartbeat from core client for 30 sec - exiting
00:30:51 (20992): No heartbeat from core client for 30 sec - exiting
00:30:52 (20992): No heartbeat from core client for 30 sec - exiting
00:30:53 (20992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Error: Input file: dataout/4m0hko.pjm7c10 is not a valid UM file.
Error converting file to netcdf: dataout/4m0hko.pjm7c10
Error: Input file: dataout/4m0hko.pim7c10 is not a valid UM file.
Error converting file to netcdf: dataout/4m0hko.pim7c10
Error: Input file: dataout/4m0hko.pfm7c10 is not a valid UM file.
Error converting file to netcdf: dataout/4m0hko.pfm7c10
Error: Input file: dataout/4m0hka.phm7c10 is not a valid UM file.
Error converting file to netcdf: dataout/4m0hka.phm7c10
Error: Input file: dataout/4m0hka.pgm7c10 is not a valid UM file.
Error converting file to netcdf: dataout/4m0hka.pgm7c10
Error: Input file: dataout/4m0hka.pem7c10 is not a valid UM file.
Error converting file to netcdf: dataout/4m0hka.pem7c10
Error: Input file: dataout/4m0hka.pdm7c10 is not a valid UM file.
Error converting file to netcdf: dataout/4m0hka.pdm7c10
00:52:27 (7765): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:52:29 (7765):01:13:53 (7896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:13:54 (7896): No heartbeat from core client for 30 sec - exiting
01:13:55 (7896): No heartbeat from core client for 30 sec - exiting
01:13:56 (7896): No heartbeat from core client for 30 sec - exiting
01:27:04 (8046): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:27:14 (8046): No heartbeat from core client for 30 sec - exiting
01:45:57 (8172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:57:14 (8283): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:57:15 (8283): No heartbeat from core client for 30 sec - exiting
01:57:16 (8283): No heartbeat from core client for 30 sec - exiting
01:57:17 (8283): No heartbeat from core client for 30 sec - exiting
01:57:18 (8283): No heartbeat from core client for 30 sec - exiting
02:08:41 (8404): No heartbeat from core client for 30 sec - exiting
02:08:44 (8404): No heartbeat from core client for 30 sec - exiting
02:08:45 (8404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:08:46 (8404): No heartbeat from core client for 30 sec - exiting
02:08:47 (8404): No heartbeat from core client for 30 sec - exiting
02:08:48 (8404): No heartbeat from core client for 30 sec - exiting
02:41:26 (8467): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:41:31 (8467): No heartbeat from core client for 30 sec - exiting
02:41:32 (8467): No heartbeat from core client for 30 sec - exiting
02:41:33 (8467): No heartbeat from core client for 30 sec - exiting
02:41:34 (8467): No heartbeat from core client for 30 sec - exiting
02:41:35 (8467): No heartbeat from core client for 30 sec - exiting
02:41:36 (8467): No heartbeat from core client for 30 sec - exiting
02:41:37 (8467): No heartbeat from core client for 30 sec - exiting
02:41:38 (8467): No heartbeat from core client for 30 sec - exiting
02:41:39 (8467): No heartbeat from core client for 30 sec - exiting
02:41:40 (8467): No heartbeat from core client for 30 sec - exiting
02:41:41 (8467): No heartbeat from core client for 30 sec - exiting
02:41:42 (8467): No heartbeat from core client for 30 sec - exiting
02:41:43 (8467): No heartbeat from core client for 30 sec - exiting
02:41:44 (8467): No heartbeat from core client for 30 sec - exiting
02:41:45 (8467): No heartbeat from core client for 30 sec - exiting
02:41:46 (8467): No heartbeat from core client for 30 sec - exiting
02:41:47 (8467): No heartbeat from core client for 30 sec - exiting
02:41:48 (8467): No heartbeat from core client for 30 sec - exiting
02:41:49 (8467): No heartbeat from core client for 30 sec - exiting
02:54:25 (8605): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:54:30 (8605): No heartbeat from core client for 30 sec - exiting
02:54:31 (8605): No heartbeat from core client for 30 sec - exiting
02:54:32 (8605): No heartbeat from core client for 30 sec - exiting
02:54:33 (8605): No heartbeat from core client for 30 sec - exiting
03:07:07 (8706): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:07:11 (8706): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (10 frames):
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77d2400]
[0xf77d2430]
/lib32/libc.so.6(gsignal+0x51)[0xf76537d1]
/lib32/libc.so.6(abort+0x182)[0xf7656c32]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf763fb56]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8784, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf778c400]
[0xf778c430]
/lib32/libc.so.6(gsignal+0x51)[0xf760d7d1]
/lib32/libc.so.6(abort+0x182)[0xf7610c32]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf75f9b56]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8784, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77af400]
[0xf77af430]
/lib32/libc.so.6(gsignal+0x51)[0xf76307d1]
/lib32/libc.so.6(abort+0x182)[0xf7633c32]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf761cb56]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8784, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77c5400]
[0xf77c5430]
/lib32/libc.so.6(gsignal+0x51)[0xf76467d1]
/lib32/libc.so.6(abort+0x182)[0xf7649c32]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf7632b56]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8784, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf773f400]
[0xf773f430]
/lib32/libc.so.6(gsignal+0x51)[0xf75c07d1]
/lib32/libc.so.6(abort+0x182)[0xf75c3c32]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf75acb56]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8784, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76f4400]
[0xf76f4430]
/lib32/libc.so.6(gsignal+0x51)[0xf75757d1]
/lib32/libc.so.6(abort+0x182)[0xf7578c32]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf7561b56]
/home/antonio/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8784, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jul 2013 17:14:25 1170335 15836316 hadcm3n_4m0h_2020_40_008390494_1 181,440 470,004 2.5904
23 Jul 2013 17:14:28 1170335 15836316 hadcm3n_4m0h_2020_40_008390494_1 155,520 402,416 2.5876
23 Jul 2013 17:14:27 1170335 15836316 hadcm3n_4m0h_2020_40_008390494_1 129,600 334,275 2.5793
23 Jul 2013 17:14:27 1170335 15836316 hadcm3n_4m0h_2020_40_008390494_1 103,680 266,395 2.5694
23 Jul 2013 17:14:27 1170335 15836316 hadcm3n_4m0h_2020_40_008390494_1 77,760 198,764 2.5561
23 Jul 2013 17:14:30 1170335 15836316 hadcm3n_4m0h_2020_40_008390494_1 51,840 133,899 2.5829
11 Jul 2013 23:41:34 1170335 15836316 hadcm3n_4m0h_2020_40_008390494_1 25,920 68,098 2.6272


©2024 climateprediction.net