CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > General Forums > Main CFD Forum

precision issues in multi-scale simulations

Register Blogs Community New Posts Updated Threads Search

Like Tree7Likes
  • 1 Post By LuckyTran
  • 1 Post By andy_
  • 1 Post By LuckyTran
  • 1 Post By FMDenaro
  • 1 Post By andy_
  • 1 Post By LuckyTran
  • 1 Post By sbaffini

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   February 15, 2022, 06:43
Default precision issues in multi-scale simulations
  #1
Senior Member
 
Sayan Bhattacharjee
Join Date: Mar 2020
Posts: 495
Rep Power: 8
aerosayan is on a distinguished road
Taking a rocket's combustion chamber as an example : the scale of different fluid phenomenons happening inside the chamber are orders of magnitude different from each other. The whole combustion chamber + nozzle itself can be 1 m long, while the combustion and viscous effects are extremely small.

My question is, how do we represent this whole domain with a standard 64 bit floating point number? IIRC, In some astrophysics codes the scientists just use 128 bit floating point numbers because the scale is too large. But can't we do anything better?

If we take a quadtree grid to represent the whole domain, we can subdivide as much as we want but the cells will be so small that the physical properties like position, cell area, cell volume, cell surface areas, etc. will be less than machine epsilon for a 64 bit floating point number.

Can't we logically scale the smaller cells up (and make them 100, 1000, 10000 times larger), so we can do our simulation and then "somehow" combine the data back to the real values?

I think we can do that, since electrons are significantly small and quantum chemistry codes scale them up so the simulation can use a 64 bit floating point number. I'm not 100% sure how they do it, but it involves some form of non-dimensionalization and different scaling factors with which they multiply their different parameters.


Can we do something like this in CFD too? I don't know if someone already did this.
aerosayan is offline   Reply With Quote

Old   February 15, 2022, 08:44
Default
  #2
Senior Member
 
Lucky
Join Date: Apr 2011
Location: Orlando, FL USA
Posts: 5,676
Rep Power: 66
LuckyTran has a spectacular aura aboutLuckyTran has a spectacular aura aboutLuckyTran has a spectacular aura about
Not withstanding that codes don't really have units and 1 m in any code is really just 1 with a flag that specifies the m, you can freely reinterpret 1 m as 1 banana or 1 monkey or 1 elephant.

Floating point numbers are s*b^e. You can shift the decimal point with e and even extreme cases shift the base as well. Binary is base 2 and hexadecimal base 16 but you could use just about any base. We use floating point numbers precisely for this reason. Numbers with fixed points are called fixed points.
aerosayan likes this.
LuckyTran is offline   Reply With Quote

Old   February 15, 2022, 09:44
Default
  #3
Senior Member
 
andy
Join Date: May 2009
Posts: 270
Rep Power: 18
andy_ is on a distinguished road
Quote:
Originally Posted by aerosayan View Post
Can we do something like this in CFD too?
Non-dimensionalisation of some form was normal for CFD codes in the first few decades when most codes used 32 bit floating point numbers rather than 64 bit due to strong memory constraints and to a lesser extent low computational speeds. With the relaxing of memory constraints and the growing width of memory buses 64 bit floating point numbers became the default and it tended to get less attention except for some special cases.

Getting a simulation code to perform well with low precision floating point numbers is typically a time consuming but fairly straightforward exercise. As well as non-dimensionalisation it can involve rewriting terms and possibly changing solution variables depending on what is leading to a difference of large numbers. If your code is not misbehaving for your current cases it may not be a worthwhile exercise but getting some idea of the approaches available possibly is for the future.
aerosayan likes this.
andy_ is offline   Reply With Quote

Old   February 15, 2022, 09:53
Default
  #4
Senior Member
 
Lucky
Join Date: Apr 2011
Location: Orlando, FL USA
Posts: 5,676
Rep Power: 66
LuckyTran has a spectacular aura aboutLuckyTran has a spectacular aura aboutLuckyTran has a spectacular aura about
Non-dimensionalization doesn't affect the range of scales you need to simulate. Your biggest scale is still your biggest and smallest is still your smallest and the ratio between them is the same.
aerosayan likes this.
LuckyTran is offline   Reply With Quote

Old   February 15, 2022, 12:15
Default
  #5
Senior Member
 
Sayan Bhattacharjee
Join Date: Mar 2020
Posts: 495
Rep Power: 8
aerosayan is on a distinguished road
Quote:
Originally Posted by LuckyTran View Post
Not withstanding that codes don't really have units and 1 m in any code is really just 1 with a flag that specifies the m, you can freely reinterpret 1 m as 1 banana or 1 monkey or 1 elephant.

Floating point numbers are s*b^e. You can shift the decimal point with e and even extreme cases shift the base as well. Binary is base 2 and hexadecimal base 16 but you could use just about any base. We use floating point numbers precisely for this reason. Numbers with fixed points are called fixed points.

The problem occurs when the simulation contains cells that are very large, and very small. How do we deal with it then?

If say, a few thousand cells' volume is less than the machine epsilon, I don't know if we can use the value in mathematical equations.

ex, if y is lower than machine epsilon,

x = y + z is probably going to be saturated to x = z or x = z + machine_epsilon

This might be a problem, right?

ex, if x,y,z represent point co-ordinates, and the equation saturates, then two points essentially start co-inciding with each other. That's a problem.

I don't have the answer to this.
I have to write some code and test this out.
aerosayan is offline   Reply With Quote

Old   February 15, 2022, 12:25
Default
  #6
Senior Member
 
Filippo Maria Denaro
Join Date: Jul 2010
Posts: 6,777
Rep Power: 71
FMDenaro has a spectacular aura aboutFMDenaro has a spectacular aura aboutFMDenaro has a spectacular aura about
Quote:
Originally Posted by aerosayan View Post
The problem occurs when the simulation contains cells that are very large, and very small. How do we deal with it then?

If say, a few thousand cells' volume is less than the machine epsilon, I don't know if we can use the value in mathematical equations.

ex, if y is lower than machine epsilon,

x = y + z is probably going to be saturated to x = z or x = z + machine_epsilon

This might be a problem, right?

ex, if x,y,z represent point co-ordinates, and the equation saturates, then two points essentially start co-inciding with each other. That's a problem.

I don't have the answer to this.
I have to write some code and test this out.





But are you talking about a case where a normalized max cell size is O(1) and the lowest size is less than 10^-16 ??
FMDenaro is offline   Reply With Quote

Old   February 15, 2022, 12:41
Default
  #7
Senior Member
 
Sayan Bhattacharjee
Join Date: Mar 2020
Posts: 495
Rep Power: 8
aerosayan is on a distinguished road
Quote:
Originally Posted by FMDenaro View Post
But are you talking about a case where a normalized max cell size is O(1) and the lowest size is less than 10^-16 ??

1/(2^64) is 5.42101086243e-20

That's a achievable if a quadtree is divided 64 times and the firrst cell is 1 unit in length. I hope my maths is correct.

Obviously the simulation won't work because the cell sizes are so different.

I gave a wrong example before and led to confusion. Instead of cell volumes, the issue will be present in the position co-ordinates.

If a point x1 is less than machine_epsilon distance from the point x2, then both of them will most likely coincide because we can't test for the difference or distance between them.

Such a situation might arise at the highest level of refinement. I'm guessing at level 51 or 52 of the quadtree where the root cell is 1 unit in length.

Not sure if this arises in normal CFD simulation cases, but astrophysics simulations and computational metallurgy simulations seem to span such large scales.
aerosayan is offline   Reply With Quote

Old   February 15, 2022, 12:47
Default
  #8
Senior Member
 
Filippo Maria Denaro
Join Date: Jul 2010
Posts: 6,777
Rep Power: 71
FMDenaro has a spectacular aura aboutFMDenaro has a spectacular aura aboutFMDenaro has a spectacular aura about
Quote:
Originally Posted by aerosayan View Post
1/(2^64) is 5.42101086243e-20

That's a achievable if a quadtree is divided 64 times and the firrst cell is 1 unit in length. I hope my maths is correct.

Obviously the simulation won't work because the cell sizes are so different.

I gave a wrong example before and led to confusion. Instead of cell volumes, the issue will be present in the position co-ordinates.

If a point x1 is less than machine_epsilon distance from the point x2, then both of them will most likely coincide because we can't test for the difference or distance between them.

Such a situation might arise at the highest level of refinement. I'm guessing at level 51 or 52 of the quadtree where the root cell is 1 unit in length.

Not sure if this arises in normal CFD simulation cases, but astrophysics simulations and computational metallurgy simulations seem to span such large scales.
To be honest, I don’t know specific flow problems where you have 20 order of magnitude between the largest and the lowest lenght scales! Actually, I wonder if you are talking about such small characteristic lenghts to be below the mean free path!
aerosayan likes this.
FMDenaro is offline   Reply With Quote

Old   February 15, 2022, 12:49
Default
  #9
Senior Member
 
andy
Join Date: May 2009
Posts: 270
Rep Power: 18
andy_ is on a distinguished road
Quote:
Originally Posted by aerosayan View Post
x = y + z is probably going to be saturated to x = z or x = z + machine_epsilon

This might be a problem, right?
Not normally since x is correct to machine accuracy. Problems tend to arise more from x = y - z when y and z are many orders of magnitude larger than x due to containing the same large value.

For your grid example if you stored cell dimensions rather than grid coordinates then you would avoid the issue (assuming I have understood the example).
aerosayan likes this.
andy_ is offline   Reply With Quote

Old   February 15, 2022, 12:54
Default
  #10
Senior Member
 
Sayan Bhattacharjee
Join Date: Mar 2020
Posts: 495
Rep Power: 8
aerosayan is on a distinguished road
Quote:
Originally Posted by FMDenaro View Post
To be honest, I don’t know specific flow problems where you have 20 order of magnitude between the largest and the lowest lenght scales! Actually, I wonder if you are talking about such small characteristic lenghts to be below the mean free path!
i guess i'm over-engineering again.
it won't probably be necessary for conventional cfd, but i might want to figure out if it's possible if the need arises.
aerosayan is offline   Reply With Quote

Old   February 15, 2022, 13:06
Default
  #11
Senior Member
 
Filippo Maria Denaro
Join Date: Jul 2010
Posts: 6,777
Rep Power: 71
FMDenaro has a spectacular aura aboutFMDenaro has a spectacular aura aboutFMDenaro has a spectacular aura about
Quote:
Originally Posted by aerosayan View Post
i guess i'm over-engineering again.
it won't probably be necessary for conventional cfd, but i might want to figure out if it's possible if the need arises.



Quadruple precision can be adopted but I think that would be more an issue in a specific field of study of round-off error propagation. For example in linear algebra computation. Something like that:
https://archive.siam.org/meetings/la...s/hhasegaw.pdf
FMDenaro is offline   Reply With Quote

Old   February 15, 2022, 13:50
Default
  #12
Senior Member
 
Lucky
Join Date: Apr 2011
Location: Orlando, FL USA
Posts: 5,676
Rep Power: 66
LuckyTran has a spectacular aura aboutLuckyTran has a spectacular aura aboutLuckyTran has a spectacular aura about
Quote:
Originally Posted by aerosayan View Post
The problem occurs when the simulation contains cells that are very large, and very small. How do we deal with it then?

If say, a few thousand cells' volume is less than the machine epsilon, I don't know if we can use the value in mathematical equations.

ex, if y is lower than machine epsilon,

x = y + z is probably going to be saturated to x = z or x = z + machine_epsilon

This might be a problem, right?

ex, if x,y,z represent point co-ordinates, and the equation saturates, then two points essentially start co-inciding with each other. That's a problem.

I don't have the answer to this.
I have to write some code and test this out.
Yes it will always be a problem as long as you have limited machine bits. The precision you have is limited by the number of bits you have and the range of precision that you need to cover is dictated by the largest and smallest numbers in you problem. You can always reallocate and optimize your precision for your problem. But then let's say tomorrow you want to solve a bigger problem with a greater range of scales, then you're in trouble again. In other words, let's say your machine can handle a simulation of the entire universe as it exists today. But then the universe expands. There's actually nothing you can do except go and find more memory. The only thing you can do is relax your precision constraint and do less precise arithmetic so you can at least cover the greater dynamic range but with less precision than before. That's the tradeoff. Using a larger base let's you cover a broad range of numbers but at the cost of machine_epsilon.

Quote:
Originally Posted by aerosayan View Post
1/(2^64)
If a point x1 is less than machine_epsilon distance from the point x2, then both of them will most likely coincide because we can't test for the difference or distance between them.
You could apply a gauge transformation and store and operate on only distance vectors between cells (the same way we do for gauge pressures) and then keep track of a reference location somewhere so that you can recover absolute x,y,z coordinates. This reduces the range of values you need to work with and allows you to stay precise. Now the reason we don't do this in CFD is that no CAD software works this way. CAD writes into very imprecise binary and ascii files of their raw x,y,z coordinates. Which is actually the first place where you'll encounter this problem: not in CFD, but in CAD. Good luck CAD'ing it in the first place.
aerosayan likes this.
LuckyTran is offline   Reply With Quote

Old   February 15, 2022, 16:44
Default
  #13
Senior Member
 
sbaffini's Avatar
 
Paolo Lampitella
Join Date: Mar 2009
Location: Italy
Posts: 2,152
Blog Entries: 29
Rep Power: 39
sbaffini will become famous soon enoughsbaffini will become famous soon enough
Send a message via Skype™ to sbaffini
I think you know that floating point numbers are such that, given a representable number x, the closest larger representable number x+dx is such that dx/x is around the precision of the given floating point (except close to the boundaries), so around 1e-15 for double precision.

So, the question is, can you give a practical example where, on earth, this range is not enough? We can put the earth in a box of side 1.3e7 meters. If centered in 0, it covers from -6.5e6 to 6.5e6 meters in each direction. At a coordinate of 6.5e6 the smallest representable dx with a double precision is approximately dx = 6.5e-9m, 6.5 nanometers!!!

Also, if you have x = 1e-9 and y = 1e-15, then z = x*y = 1e-24 without any problem at all.

So, in the end, as mentioned by LuckyTran, the problem is indeed with the CAD, as most of them uses single precision. In my experience, the CAD problem is not something you have to look at with paranoia, because 6 orders of magnitude are still a lot and you will certainly have a ring bell when dealing with something possibly troublesome.

EDIT: Of course, there will always be someone trying to set the origin of his system on the sun and pretending to work with millimiter sized objects on earth (spoiler alert, you can't with double precision), but we have Darwin Awards for them and they don't deserve using CFD.
aerosayan likes this.
sbaffini is offline   Reply With Quote

Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Issues with running simulations after recomposing and decomposing bobcieri OpenFOAM Running, Solving & CFD 1 June 3, 2020 09:32
Multi Region Mesh Issues - Star CCM+ Hammerson99 STAR-CCM+ 3 September 26, 2018 23:25
Modelled Length Scale in SAS-SST Simulation jego Main CFD Forum 0 January 16, 2016 06:45
Single precision better than double precision? 140raiders CFX 1 July 30, 2015 02:32
Compilation Order and Single Precision Issues gocarts OpenFOAM Bugs 1 October 14, 2009 17:19


All times are GMT -4. The time now is 05:15.