CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > Software User Forums > ANSYS > CFX

MEMERR error when running on cluster

Register Blogs Community New Posts Updated Threads Search

Like Tree1Likes
  • 1 Post By ghorrocks

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   June 27, 2023, 14:50
Default MEMERR error when running on cluster
  #1
New Member
 
Aspen
Join Date: Nov 2022
Location: Canada
Posts: 11
Rep Power: 3
imagineaspen is on a distinguished road
Hi all,

I am trying to run a simulation on a Linux cluster but getting this error immediately after starting the simulation (also attached the output file):

Details of error:-
----------------
Error detected by routine PEEKI
CDANAM = /FLOW/GEOMETRY/KZnBcp
CRESLT = ADRS

Current Directory : /FLOW/PHYSICS

+--------------------------------------------------------------------+
ERROR #001100279 has occurred in subroutine ErrAction.
Message:
Stopped in routine MEMERR

An error has occurred in cfx5solve:
The ANSYS CFX partitioner exited with return code 1.


Does anyone know what this error means? It seems to have something to do with memory? I have tried increasing the topology estimate factor but I still get the same error. Also tried changing the partitioning method since it seems like the simulation fails during partitioning. The simulation is able to run on another computer, it is only when I run it on the cluster that this occurs.
Attached Files
File Type: txt outputfile.txt (43.3 KB, 2 views)
imagineaspen is offline   Reply With Quote

Old   June 27, 2023, 16:50
Default
  #2
Super Moderator
 
Glenn Horrocks
Join Date: Mar 2009
Location: Sydney, Australia
Posts: 17,703
Rep Power: 143
ghorrocks is just really niceghorrocks is just really niceghorrocks is just really niceghorrocks is just really nice
It is failing in the partitioning run, not the solver.

In these cases the first thing to try is to use a different partitioning algorithm. The default (METIS) does give memory problems occasionally. Try using recursive bisection or one of the other algorithms as they tend to have much less memory problems (but do not produce as good partitions in general).
__________________
Note: I do not answer CFD questions by PM. CFD questions should be posted on the forum.
ghorrocks is offline   Reply With Quote

Old   June 28, 2023, 13:58
Default
  #3
New Member
 
Aspen
Join Date: Nov 2022
Location: Canada
Posts: 11
Rep Power: 3
imagineaspen is on a distinguished road
Thank you for the advice Glenn. I've tried different partitioning algorithms but unfortunately I am still getting the same error.
imagineaspen is offline   Reply With Quote

Old   June 28, 2023, 18:58
Default
  #4
Super Moderator
 
Glenn Horrocks
Join Date: Mar 2009
Location: Sydney, Australia
Posts: 17,703
Rep Power: 143
ghorrocks is just really niceghorrocks is just really niceghorrocks is just really niceghorrocks is just really nice
Then it is possibel that you are running out of memory in the partitioner. Have a look at the memory settings in the partitioner, also try the large problem partitioner.

Alternately, you could partition this on a large computer with lots of memory and write the partitions to a partition file. Then you can use that partition file on this cluster so you do not have to do the partitioning at all on this cluster.
__________________
Note: I do not answer CFD questions by PM. CFD questions should be posted on the forum.
ghorrocks is offline   Reply With Quote

Old   June 30, 2023, 17:31
Default
  #5
Senior Member
 
Join Date: Jun 2009
Posts: 1,804
Rep Power: 32
Opaque will become famous soon enough
Quote:
Originally Posted by imagineaspen View Post
Hi all,

I am trying to run a simulation on a Linux cluster but getting this error immediately after starting the simulation (also attached the output file):

Details of error:-
----------------
Error detected by routine PEEKI
CDANAM = /FLOW/GEOMETRY/KZnBcp
CRESLT = ADRS

Current Directory : /FLOW/PHYSICS

+--------------------------------------------------------------------+
ERROR #001100279 has occurred in subroutine ErrAction.
Message:
Stopped in routine MEMERR

An error has occurred in cfx5solve:
The ANSYS CFX partitioner exited with return code 1.


Does anyone know what this error means? It seems to have something to do with memory? I have tried increasing the topology estimate factor but I still get the same error. Also tried changing the partitioning method since it seems like the simulation fails during partitioning. The simulation is able to run on another computer, it is only when I run it on the cluster that this occurs.
something is off with your physics-to-mesh connectivity. ADRS is the error. The array KZnBcp is being accessed beyond its bounds, either too low, or too high.
__________________
Note: I do not answer CFD questions by PM. CFD questions should be posted on the forum.
Opaque is offline   Reply With Quote

Old   July 4, 2023, 12:51
Default
  #6
New Member
 
Aspen
Join Date: Nov 2022
Location: Canada
Posts: 11
Rep Power: 3
imagineaspen is on a distinguished road
Quote:
Originally Posted by ghorrocks View Post
Then it is possibel that you are running out of memory in the partitioner. Have a look at the memory settings in the partitioner, also try the large problem partitioner.

Alternately, you could partition this on a large computer with lots of memory and write the partitions to a partition file. Then you can use that partition file on this cluster so you do not have to do the partitioning at all on this cluster.
Thanks Glenn. I tried modifying the partitioner memory settings but it all resulted in the same error. I also tried partitioning on a different computer and inputting that partitioning file to the cluster but then I get a memory error immediately when the solver starts. I think I will have to bring this issue my school’s technical support; the simulation is not large enough that memory should be an issue.

One thing I forgot to put in the initial post is that I ran a steady state simulation using the same mesh to generate an initial values file before running this simulation. The steady state simulation runs properly on the cluster, it is only the transient simulation that produces this error. My set-up includes a rotating domain which seems to be part of the issue; I was testing simpler geometries to see if I could replicate the error and I am able to run simulations with large numbers of elements but as soon as I have a rotating domain I start getting memory errors. I think this must be why the steady state simulation is able to run but the transient does not.
imagineaspen is offline   Reply With Quote

Old   July 4, 2023, 12:53
Default
  #7
New Member
 
Aspen
Join Date: Nov 2022
Location: Canada
Posts: 11
Rep Power: 3
imagineaspen is on a distinguished road
Quote:
Originally Posted by Opaque View Post
something is off with your physics-to-mesh connectivity. ADRS is the error. The array KZnBcp is being accessed beyond its bounds, either too low, or too high.
Thanks Opaque. Could you clarify what you mean by physics-to-mesh connectivity? Does this imply that it is a problem with my mesh itself or with my CFX-Pre set-up?
imagineaspen is offline   Reply With Quote

Old   July 4, 2023, 20:14
Default
  #8
Super Moderator
 
Glenn Horrocks
Join Date: Mar 2009
Location: Sydney, Australia
Posts: 17,703
Rep Power: 143
ghorrocks is just really niceghorrocks is just really niceghorrocks is just really niceghorrocks is just really nice
Opaque's comment is suggesting that some form of internal CFX error is occurring, and it is manifesting as a failure to find some data initially, but as a memory error later. I suspect this is correct, which would explain why my suggestions on the partitioner settings did not help.

In this case I would have a look at your setup and check it is OK. In particular look to see if you have missed something - is there an essential variable you have not defined; such as a material property or definition of the rotating domain.
Opaque likes this.
__________________
Note: I do not answer CFD questions by PM. CFD questions should be posted on the forum.
ghorrocks is offline   Reply With Quote

Old   July 6, 2023, 10:15
Default
  #9
New Member
 
Aspen
Join Date: Nov 2022
Location: Canada
Posts: 11
Rep Power: 3
imagineaspen is on a distinguished road
Thanks Glenn and Opaque. I will take another look at my set-up.
imagineaspen is offline   Reply With Quote

Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
pitchAxis gets reversed sign when running on HPC cluster GeertL OpenFOAM Running, Solving & CFD 0 August 17, 2020 06:19
Running OpenFoam in cluster chandra shekhar pant OpenFOAM Running, Solving & CFD 7 March 11, 2020 05:32
Problem with running customized solver parallel on cluster shinri1217 OpenFOAM Running, Solving & CFD 0 June 27, 2018 13:26
[Other] Basic questions about OpenFOAM cluster running and installing Fauster OpenFOAM Installation 0 May 25, 2018 15:00
Openfoam running error in parallel in Cluster sibo OpenFOAM 2 February 25, 2017 13:26


All times are GMT -4. The time now is 05:04.