Code hangs while writing volume solution
2 Attachment(s)
Hi,
SU^2 is hanging up while writing the volume solution using version 2.0.2. (The surface solution seems to be written OK) This happens at the end of the run or if a WRT_SOL_FREQ is specified. I don't have this problem with the ONERA M6 case in the TestCases, but I can't see any major differences related to output options in the configuration files. This case does have around 3 million elements, so that might be a factor. I've attached the configuration and output files. Thanks, Greg |
Hi Greg,
Thanks for the feedback - we are aware of the limitations in version 2.0.2. This was a developer release, and the volume output routines are still being updated. We will be releasing another developer version next week, and this version will be able to handle larger cases due to further improvements in the I/O routines. We are working to improve the parallel performance of the code in general (both memory & scalability), and we will continue to release these updates as we go over the next several months. When you get the chance, please try your cases with the updated versions. Cheers, Tom |
Restart File...
4 Attachment(s)
This may or may not be an associated problem, but I've noticed in a recent run that a restart_flow.dat wasn't created.
I had modified my config file so that WRITE_SOL_FREQ = 999999 and I could be sure that the flow solution wouldn't be written until the solution had converged (from a previous run I expected this around 7000 iterations). I set it up this way because I am running 2.0.2 and knew the flow solution process would take a while. Is there any relationship between WRITE_SOL_FREQ and the frequency/sequence of writing the restart file?? ...I didn't think there would be. On a subsequent run I changed my convergence criteria to force it to write a flow solution and the restart file looks to have been created at the end of the process. I can't tell for sure when it was generated, but the last I looked yesterday evening it had not been. Particulars: - about 6.3 million cells in my mesh - using 72 processors on a Cray XT5 See attachments for details: Run 1 - first run config and output files...NO RESTART FILE WAS CREATED Run 2 - second run config and output files...restart file was created Thanks! Dave PS: I do understand 2.0.2 is in development, but I haven't had any luck with 2.0 due to memory issues. Thanks for your work thus far! |
Follow-up
I've done a few subsequent runs and am finding the same thing; the restart file is not being generated until the run completes!
This could lead to time-wastage if something goes wrong with the run, would you let me know when the restart file should be generated, and how often it is updated? Thanks! Quote:
|
Hi David (& Greg),
As the output routines have been changing quite a bit recently, I would recommend trying the new developer release (V2.0.3) that will be posted later today. Note that further updates to the output routines are still in the works, but they should be more stable with this release. As for the restart files, they will indeed be generated according to the frequency specified with the WRT_SOL_FREQ option. Also, in V2.0.3, we have a new module named SU2_SOL which can generate a solution file given an SU2 mesh and restart file as input. This will be called automatically when using the parallel_computation.py script (only restarts are written which are merged into a solution at the end of the simulation). Lastly, if you want to completely turn off volume and surface solution writing during a simulation, please try the following options: WRT_VOL_SOL= NO WRT_SRF_SOL= NO which will turn off all volume and surface solution file writing while still writing the restart files. Thanks for trying SU2, and look for more updates to the output as we go. Hope this helps! Tom |
Ahhhh!
Thanks, I'm going to add those points to my "master config" file; it is good to be able to do preliminary runs without writing the flow/surface solutions.
I'm compiling 2.0.3 (rev1) now, and will follow-up here. (but if I forget to, no news is good news!) Cheers! Dave |
Volume Solution
I'm editing this comment as new data is available. I have gotten a flow.dat file written, so I've answered my own question: it'll only be written once a converged restart file is available.
The process from start to finish to write the flow.dat took between 2-8 hours (I can't figure out how to determine more accurately - I was asleep). My mesh is about 6.4 million cells, should it take that long?? ...I should add that this was using 12 processors. Original Comment: Quote:
...I don't understand your point above: Quote:
...I feel like I'm close, but missing something. Thanks! Dave |
Using 2.0.3:
I am also not getting any flow files created. The partition surface files are created, but no partition flow files. No combined surface flow file is created and then the SU2_SOL code hangs while it is writing the flow file (or that is what the output is indicating). |
Volume Solution
Ok, I really abusing electrons here, but didn't want to lose any of the points...
I re-ran a group of simulations and actually timed it from "Writing flow solution" to "Exit Success". It was an hour for my 1 degree AoA case. That still seems a bit long to me, is that unusual? (that is now the only question I'd like addressed, my questions below are Overcome By Events):) Quote:
|
Hi guys,
Can you please check the memory usage during file writing? We would like to know whether it is a memory issue, i.e. there is a leak or your machine is maxing out the memory and it is causing long delays in file writing, or if there is an actual issue in the new output routines. One of the motivations for using the new SU2_SOL module is that it has much less memory overhead than the solver. Thanks, Tom |
Hmmm...
Quote:
Dave PS: This was my output after the iterations were complete: Code:
------------------------- Exit Success (SU2_CFD) ------------------------ |
Hi Dave,
One more thing you can try... Another advantage of SU2_SOL is that new solution files can always be generated on the fly using just the restart and config files. This can be done on any machine with any number of cores. It is even handy for converting between different solution file types, if need be, without rerunning expensive simulations. For instance, if it seems like you are having memory problems on one machine due to a large mesh, you could move the restart and config files to a different machine with more GB per core (or possibly a regular workstation with a large amount of RAM) and simply run SU2_SOL (on any number of cores) to generate your solution file. Hope this helps! Tom |
Hi guys,
Just an update: with V2.0.5, we made major improvements to file IO, including adjustments to the algorithms for merging solution files. You should see much better performance for large cases. Cheers, Tom |
All times are GMT -4. The time now is 02:07. |