Name | oifs_43r3_ps_0489_1988050100_123_957_12174133_0 |
Workunit | 12174133 |
Created | 21 Dec 2022, 14:02:45 UTC |
Sent | 21 Dec 2022, 14:36:13 UTC |
Report deadline | 20 Jan 2023, 14:36:13 UTC |
Received | 10 Jan 2024, 14:20:58 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 148 (0x00000094) Unknown error code |
Computer ID | 1512362 |
Run time | 1 hours 19 min 11 sec |
CPU time | 36 min 43 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 4.72 GFLOPS |
Application version | OpenIFS 43r3 Perturbed Surface v1.05 x86_64-pc-linux-gnu |
Peak working set size | 4,613.70 MB |
Peak swap size | 4,860.08 MB |
Peak disk usage | 425.32 MB |
Stderr | <core_client_version>7.20.5</core_client_version> <![CDATA[ <message> process exited with code 148 (0x94, -108)</message> <stderr_txt> rocess Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process double free or corruption (top) [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.681] [signal_drhook@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1538] Received signal#15 (SIGTERM) :: 3007MB (heap), 3562MB (maxrss), 0MB (maxstack), 0 (paging), nsigs = 1 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.747] [signal_drhook@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1542] Also activating Harakiri-alarm (SIGALRM=14) to expire after 500s elapsed to prevent hangs, nsigs = 1 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.747] [signal_drhook@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1544] Harakiri signal handler 'signal_harakiri' for signal#14 (SIGALRM) installed at 0x81f3c0 (old at (nil)) [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.747] [signal_drhook@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1617] Signal#15 was caused by unrecognized si_code [memaddr=0x1], nsigs = 1 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.774] [signal_drhook@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1686] Starting DrHook backtrace for signal#15, nsigs = 1 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.774] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3843] 3007 MB (maxheap), 3562 MB (maxrss), 0 MB (maxstack) [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : MASTER [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : CNT0 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : CNT1 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : CNT2 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : CNT3 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : CNT4 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : STEPO [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : SCAN2M [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : GP_MODEL_HEAP [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : GP_MODEL [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : EC_PHYS_DRV [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.964] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : >OMP-PHYSICS CLDPP T/S (1002) [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.965] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : EC_PHYS [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.965] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : CALLPAR [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.965] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : SURFRAD_LAYER [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084025:1704181225:82.965] [c_drhook_print_@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:3897] : SURFRAD [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084026:1704181226:83.965] [signal_drhook@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1734] DrHook backtrace done for signal#15, nsigs = 1 [EC_DRHOOK:beniopa:1:1:1557:1557] [20240102:084026:1704181226:83.965] [signal_drhook@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1785] Calling previous signal handler at 0x1ce8bb0 for signal#15, nsigs = 1 forrtl: error (78): process killed (SIGTERM) Image PC Routine Line Source oifs_43r3_model.e 0000000001CE916B Unknown Unknown Unknown oifs_43r3_model.e 000000000081FFF1 Unknown Unknown Unknown oifs_43r3_model.e 0000000001DC9090 Unknown Unknown Unknown oifs_43r3_model.e 00000000017AF3F0 surfrad_ctl_mod._ 1 surfrad_ctl_mod.F90 oifs_43r3_model.e 00000000015C72FD surfrad_ 279 surfrad.F90 oifs_43r3_model.e 0000000001065EEA surfrad_layer_ 119 surfrad_layer.F90 oifs_43r3_model.e 0000000000F2E549 callpar_ 675 callpar.F90 oifs_43r3_model.e 0000000000E4940F ec_phys_ 670 ec_phys.F90 oifs_43r3_model.e 0000000000E3610C ec_phys_drv_ 599 ec_phys_drv.F90 oifs_43r3_model.e 0000000000BFA3D3 gp_model_ 613 gp_model.F90 oifs_43r3_model.e 00000000012E9482 gp_model_heap_ 74 gp_model_heap.F90 oifs_43r3_model.e 0000000000BBC1CE scan2m_ 535 scan2m.F90 oifs_43r3_model.e 000000000057A664 stepo_ 327 stepo.F90 oifs_43r3_model.e 000000000055DEEC cnt4_ 1133 cnt4.F90 oifs_43r3_model.e 00000000005471C9 cnt3_ 267 cnt3.F90 oifs_43r3_model.e 0000000000546412 cnt2_ 88 cnt2.F90 oifs_43r3_model.e 0000000000545E28 cnt1_ 92 cnt1.F90 oifs_43r3_model.e 000000000040708D cnt0_ 146 cnt0.F90 oifs_43r3_model.e 000000000040220F MAIN__ 96 master.F90 oifs_43r3_model.e 00000000004021A2 Unknown Unknown Unknown oifs_43r3_model.e 0000000001DCA390 Unknown Unknown Unknown oifs_43r3_model.e 000000000040206E Unknown Unknown Unknown (argv0) ../../projects/climateprediction.net/oifs_43r3_ps_1.05_x86_64-pc-linux-gnu (argv1) start_date: 1988050100 (argv2) exptid: hq0f (argv3) unique_member_id: 0489 (argv4) batchid: 957 (argv5) wuid: 12174133 (argv6) fclen: 123 (argv7) app_name: oifs_43r3_ps (argv8) nthreads: 1 Working directory is: /backup/BOINC/slots/0 Project directory is: /backup/BOINC/projects/climateprediction.net/ app name: oifs_43r3_ps version: 1.05 Location of temp folder: /backup/BOINC/projects/climateprediction.net/oifs_43r3_ps_12174133 ..mkdir for temp folder for results failed Copying: /backup/BOINC/projects/climateprediction.net/oifs_43r3_ps_app_1.05_x86_64-pc-linux-gnu.zip to: /backup/BOINC/slots/0/oifs_43r3_ps_app_1.05_x86_64-pc-linux-gnu.zip Unzipping the app zip file: /backup/BOINC/slots/0/oifs_43r3_ps_app_1.05_x86_64-pc-linux-gnu.zip Copying the namelist files from: ../../projects/climateprediction.net/jf_83a7b4aa75d4552bcacbd0686fde06fc to: /backup/BOINC/slots/0/oifs_43r3_ps_0489_1988050100_123_957_12174133.zip Unzipping the namelist zip file: /backup/BOINC/slots/0/oifs_43r3_ps_0489_1988050100_123_957_12174133.zip ic_ancil_file: ic_ancil_12174133 ifsdata_file: ifsdata_12174133 climate_data_file: clim_data_12174133 horiz_resolution: 159 vert_resolution: 60 grid_type: l_2 upload_interval: 24 utstep: 3600 nfrres: restart dump frequency (steps) 24 Copying IC ancils from: ../../projects/climateprediction.net/jf_bcab1eabc91a1e674d45472712723776 to: /backup/BOINC/slots/0/ic_ancil_12174133.zip Unzipping the IC ancils zip file: /backup/BOINC/slots/0/ic_ancil_12174133.zip ..mkdir for ifsdata folder failed Copying the ifsdata_file from: ../../projects/climateprediction.net/jf_67d2e825c08482209cd1f13aee04281a to: /backup/BOINC/slots/0/ifsdata/ifsdata_12174133.zip Unzipping the ifsdata_zip file: /backup/BOINC/slots/0/ifsdata/ifsdata_12174133.zip ..mkdir for the climate data folder failed Copying the climate data file from: ../../projects/climateprediction.net/jf_f32c668cf7829426ffd94699b48b9a2e to: /backup/BOINC/slots/0/159l_2/clim_data_12174133.zip Unzipping the climate data zip file: /backup/BOINC/slots/0/159l_2/clim_data_12174133.zip Checking for progress XML file: /backup/BOINC/slots/0/progress_file_12174133.xml Opened progress file ok : /backup/BOINC/slots/0/progress_file_12174133.xml -- Model is restarting -- Adjusting last_iter, 0, to previous model restart step. Creating progress file: /backup/BOINC/slots/0/progress_file_12174133.xml last_cpu_time: 2116 upload_file_number: 0 last_iter: -1 last_upload: 0 model_completed: 0 total_length_of_simulation: 10627200 result_base_name: oifs_43r3_ps_0489_1988050100_123_957_12174133_0_r1381215316 The child process has been launched with process id: 1554 Executing the command: /backup/BOINC/slots/0/oifs_43r3_model.exe [EC_DRHOOK:hostname:myproc:omptid:pid:unixtid] [YYYYMMDD:HHMMSS:epoch:walltime] [function@file:lineno] -- Max OpenMP threads = 1 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2091] fp = 0x3129fe0 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2098] DR_HOOK_ALLOW_COREDUMP=-1 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2104] Hardlimit for core file is now 0 (0x0) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2122] DR_HOOK_PROFILE_PROC=-1 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2128] DR_HOOK_PROFILE_LIMIT=-10.000 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2194] DR_HOOK_RANDOM_MEMSTAT=0 (RAND_MAX=2147483647) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2205] DR_HOOK_HASHBITS=16 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2213] DR_HOOK_NCALLSTACK=0 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2221] DR_HOOK_HARAKIRI_TIMEOUT=500 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2228] DR_HOOK_TRAPFPE=1 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2235] DR_HOOK_TRAPFPE_INVALID=1 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2242] DR_HOOK_TRAPFPE_DIVBYZERO=1 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [process_options@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:2249] DR_HOOK_TRAPFPE_OVERFLOW=1 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [ignore_signals@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1227] DR_HOOK_IGNORE_SIGNALS=<undef> [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [restore_default_signals@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1178] DR_HOOK_RESTORE_DEFAULT_SIGNALS=<undef> [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1937] New signal handler 'signal_drhook' for signal#6 (SIGABRT) at 0x81f820 (old at 0x1ce8bb0) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1938] New signal handler 'signal_drhook' for signal#7 (SIGBUS) at 0x81f820 (old at (nil)) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1939] New signal handler 'signal_drhook' for signal#11 (SIGSEGV) at 0x81f820 (old at 0x1ce8bb0) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1944] New signal handler 'signal_drhook' for signal#16 (SIGSTKFLT) at 0x81f820 (old at (nil)) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [trapfpe_treatment@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1149] DR_HOOK enables SIGFPE-related floating point trapping since DRHOOK_TRAPFPE=1 [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1948] New signal handler 'signal_drhook' for signal#8 (SIGFPE) at 0x81f820 (old at 0x1ce8bb0) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1949] New signal handler 'signal_drhook' for signal#4 (SIGILL) at 0x81f820 (old at 0x1ce8bb0) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1951] New signal handler 'signal_drhook' for signal#5 (SIGTRAP) at 0x81f820 (old at (nil)) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1952] New signal handler 'signal_drhook' for signal#2 (SIGINT) at 0x81f820 (old at 0x1ce8bb0) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1960] New signal handler 'signal_drhook' for signal#3 (SIGQUIT) at 0x81f820 (old at 0x1ce8bb0) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1961] New signal handler 'signal_drhook' for signal#15 (SIGTERM) at 0x81f820 (old at 0x1ce8bb0) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1966] New signal handler 'signal_drhook' for signal#24 (SIGXCPU) at 0x81f820 (old at (nil)) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1968] New signal handler 'signal_drhook' for signal#25 (SIGXFSZ) at 0x81f820 (old at (nil)) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [signal_drhook_init@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1973] New signal handler 'signal_drhook' for signal#31 (SIGSYS) at 0x81f820 (old at (nil)) [EC_DRHOOK:beniopa:1:1:1554:1554] [20240102:085905:1704182345:0.000] [catch_signals@/home/glenn/github/jamie_oifs43r3.git/src/ifsaux/support/drhook.c:1111] DR_HOOK_CATCH_SIGNALS=<undef> Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process Suspend request received from the BOINC client, suspending the child process Resuming the child process </stderr_txt> ]]> |
No trickles! |
---|
©2024 cpdn.org