CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > Software User Forums > OpenFOAM > OpenFOAM Installation

What is the best cluster file system to go with OF

Register Blogs Community New Posts Updated Threads Search

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   December 27, 2005, 10:36
Default I will have to install a new c
  #1
Member
 
diablo80@web.de
Join Date: Mar 2009
Posts: 93
Rep Power: 17
sampaio is on a distinguished road
I will have to install a new cluster which has a lot of HD (120Gb) on the nodes.

I understand that common configurations use NFS server on a head (or master) node, and avoid using disk space from the nodes. I heard that I would experience a performance drop if my nodes started to write to their own disk instead of writing to the NFS server in the master node. Is that the case?

Is there any other efficient alternatives to using NFS concentrating all data storage on the master node?

What would be the ideal file system / architecture to install in a new cluster to go with OF? NFS concentrated on the master node, or some other distributed FS like PVFS? Or anything else?

(Sorry if it is a dumb question, I am just starting to learn how to install a cluster)

Thanks a lot,
luiz
sampaio is offline   Reply With Quote

Old   December 30, 2005, 04:39
Default I would say it is the other wa
  #2
Senior Member
 
Mattijs Janssens
Join Date: Mar 2009
Posts: 1,419
Rep Power: 26
mattijs is on a distinguished road
I would say it is the other way around. If all the nodes have to write results to the master node that can become a bottleneck. So if you can dump all the results to local disks that would be faster.

However I find it a small price to pay for the convenience so personnaly always use nfs. Compare the time spent in i/o to the time spent doing calculations and see if it is worthwhile.
mattijs is offline   Reply With Quote

Old   January 3, 2006, 08:32
Default The only time the nfs IO will
  #3
Senior Member
 
Eugene de Villiers
Join Date: Mar 2009
Posts: 725
Rep Power: 21
eugene is on a distinguished road
The only time the nfs IO will become a serious bottleneck is if you are
1) Dumping every 10 or fewer timesteps, like when you are making animations.
2) Doing aero-acoustic calcs, which require wall pressure data for each timestep.

To alleviate this you can run with node distributed data. I think there is a section in the manual describing how to do this. Hasnt been tested for a few years though.
eugene is offline   Reply With Quote

Old   January 27, 2011, 03:29
Default
  #4
Senior Member
 
Markus Rehm
Join Date: Mar 2009
Location: Erlangen (Germany)
Posts: 184
Rep Power: 17
markusrehm is on a distinguished road
Hello,

I am reactivating this thread because we are thinking of introducing a parallel file system on our Cluster. The nodes run on OpenSUSE, have each one or two disks and are connected via Gigabit Ethernet.

At the moment I am reading the available articles for parallel file systems. But there are so many (PVFS, GFS, OCFS, Panasas, pNFS, XtreemFS, ...) - I am a little confused.

Our main interest is to have a global file system for all nodes and to increase throughput mainly for pre- and post-processing. Redundancy is also an issue.

What is your experience?

Regards, Markus.
markusrehm is offline   Reply With Quote

Old   March 17, 2011, 04:10
Default
  #5
Senior Member
 
Eugene de Villiers
Join Date: Mar 2009
Posts: 725
Rep Power: 21
eugene is on a distinguished road
My experience is that unless you are a sucker for punishment, going with a pre-configured or commercial PFS is by far the least painful option. If you are using it for post-processing, I would recommend a dedicated storage node based on something like the Panasus system. This is expensive though, but several surveys of parallel file systems in the wild have not filled me with excitement. The only one I have come across that really seems to tick all the boxes is the Google FS, but unfortunately they aren't sharing. Hadoop has similar functionality, but it is all Java based API (i.e. not POSIX compliant).
eugene is offline   Reply With Quote

Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
open_udf_library: The system cannot find the file John FLUENT 6 November 10, 2021 23:16
script file for running fluent on linux cluster Worth FLUENT 2 February 9, 2012 11:31
why does not udf source file wor on 64 bit system? Jason K FLUENT 2 January 4, 2009 16:11
Installation on a cluster running a queueing system PBS sampaio OpenFOAM Installation 12 June 20, 2005 17:16
help on silo file system dragson Main CFD Forum 4 May 29, 2003 07:44


All times are GMT -4. The time now is 10:19.