-
Notifications
You must be signed in to change notification settings - Fork 47
Description
Dear devs,
I'm experiencing a problem with running a simulation with indx=12, indy= 12
.
I'm using an HPC cluster with a plenty of resources, and for example indx=11, indy=11, indz=13
runs fine.As soon as I set for example indx=12, indy= 12, indz=11
it crashes with a memory segfault.
It does not create the output folders except for ELOG, and the last lines in elog-*
are:
new loop, nter= 1
Info: particles being passed further = 232215
Info: particles being passed further = 453895
Info: particles being passed further = 428535
The debugger traces issue till input_class.f03
:
source/simulation_class.f03:163
source/input_class.f03:104
and then shows mpi errors with ompi_bcast_f
, PMPI_Bcast
etc
This is very puzzling, since the same MPI layout works well with smaller simulations, and up-scaling it does not help (neither using different mpi implementations). Could be there any kind of buffer which overflows for such a slice size (4096 x 4096)?
I'm running out of testing options, so any help of advice is appreciated!