CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > General Forums > Hardware

EPYC 9004 with large L3 cache or 9005 for ~10M FEM mesh?

Register Blogs Members List Search Today's Posts Mark Forums Read

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   October 9, 2025, 05:12
Question EPYC 9004 with large L3 cache or 9005 for ~10M FEM mesh?
  #1
New Member
 
Freewill1's Avatar
 
Join Date: Aug 2014
Posts: 19
Rep Power: 13
Freewill1 is on a distinguished road
Hi, all!

I am looking to buy an EPYC (9004 or 9005) workstation for CFD-like modelling, using a FEM mantle convection code, ASPECT (details: https://aspect-documentation.readthedocs.io/en/latest/), which supports MPI-based parallelism quite well.

My budget is 5,000–10,000 USD.

Here is my typical situation:
  1. 1000x1000 2-D FEM mesh (i.e., 1M mesh);
  2. 200x200x200 3-D FEM mesh (i.e., 8M mesh);
  3. Focus primarily on Stokes flow (or creeping flow, solving the simplified Navier-Stokes equations) or Rayleigh-Benard convection (i.e., thermal convection);
  4. Nonlinear iterations are required and can significantly slow the modeling due to nonlinear rheology of the mantle rock as it convects (20x time required compared with a linear, isoviscous case)

To maximize the performance to the fullest extent possible for the given budget, I'm not sure about the following:
  • Dual-socket vs. single-socket: Does a dual-socket system run twice as fast as a single-socket one? Is there a bottleneck in inter-socket communication for the 9004 or 9005 series CPUs? A dual-socket system costs roughly twice as much as a single-socket one, and I don't want to waste the budget.
  • How many cores per memory channel is most cost-effect? For both a dual-socket and a single-socket machine, how many CPU cores can 12 or 24 memory channels sustain—32, 64, or more? I remember, as a rule of thumb, each memory channel can saturate only 2–3 cores for the older 9001/9002/9003 EPYC CPUs (3200 MT/s, DDR4). Not sure whether the newer 9004 or 9005 can sustain more cores due to higher memory frequency (4800/6400 MT/s, DDR5) and other technological improvements.
  • 9004 with 3D V-Cache vs. 9005: Does a 9004 CPU with 4800 MT/s RAM support and a large 3D L3 cache (e.g., 768 MB for EPYC 9384X) perform better than a 9005 CPU with 6400 MT/s RAM support? My ~1M FEM mesh is not large and can be entirely stored in the L3 cache; would each run thus speed up?
Here are my tentative single-socket hardware configurations:

Option-1:
  1. CPU: EPYC 9384X, 32 core
  2. RAM: 12 × 16 GB = 192 GB, 4800 MT/s
  3. Motherboard: Supermicro H14SSL-NT (with 6400 MT/s RAM and 10 Gbps LAN supports)
  4. ...

Option-2:
  1. CPU: EPYC 9355, 32 core
  2. RAM: 12 × 16 GB = 192 GB, 6400 MT/s
  3. Motherboard: Supermicro H14SSL-NT (with 6400 MT/s RAM and 10 Gbps LAN supports)
  4. ...

Any suggestion for cost-effect configuration(s)?

Best,

Freewill
Attached Images
File Type: jpg 9004.jpg (129.8 KB, 15 views)
File Type: jpg 9005.jpg (133.0 KB, 7 views)

Last edited by Freewill1; October 9, 2025 at 22:09.
Freewill1 is offline   Reply With Quote

Old   November 24, 2025, 16:47
Default
  #2
Senior Member
 
Will Kernkamp
Join Date: Jun 2014
Posts: 388
Rep Power: 15
wkernkamp is on a distinguished road
I think both are good choices for CFD single socket configurations. With CFD it is all about bandwidth. However, the 3D cache reduces cache misses and, therefore, the need for bandwidth. My guess would be that the fourth gen X processor will outperform the 5th gen non-X even though the latter has 33% more bandwidth. To talk about cost effectiveness, I would need to know what the price is for each of these processors. (I am assuming about equal now).


If later on you want to increase your capacity with a dual socket motherboard. Your 5th generation selection is a P part, so can only run on single socket active config.
wkernkamp is offline   Reply With Quote

Old   December 4, 2025, 09:40
Default EPYC 9004 with large L3 cache or 9005 for ~10M FEM mesh?
  #3
New Member
 
Ava Reed
Join Date: Oct 2025
Location: USA
Posts: 2
Rep Power: 0
avareed is on a distinguished road
For a ~10 million element FEM mesh, both the EPYC 9004 with large L3 cache and the 9005 are strong options, but it depends on your workload. If your solver is highly cache-sensitive, the 9004 with 3D V-Cache can offer excellent performance. However, for memory-bound workloads that benefit from higher memory bandwidth, more cores, and newer architecture improvements, the 9005 is likely the better choice. For most large-scale FEM simulations, I’d lean toward the 9005 for a more balanced and future-proof setup.
avareed is offline   Reply With Quote

Reply

Tags
aspect, epyc, fem, l3 cache

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
How to perform larger deformations while mantaining good mesh with adjoint solver FormulaStudentFEB STAR-CCM+ 0 October 13, 2023 12:27
flow over cylinder in openFoam saeed jamshidi OpenFOAM Pre-Processing 3 August 11, 2023 16:16
[snappyHexMesh] SnappyHexMesh/splitMeshRegion : region1 in zone "-1" GuiMagyar OpenFOAM Meshing & Mesh Conversion 3 August 4, 2023 13:38
Gambit problems Althea FLUENT 22 January 4, 2017 04:19
Icemcfd 11: Loss of mesh from surface mesh option? Joe CFX 2 March 26, 2007 19:10


All times are GMT -4. The time now is 17:07.