CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > General Forums > Main CFD Forum

Cluster and PVFS2

Register Blogs Community New Posts Updated Threads Search

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   January 6, 2006, 07:46
Default Cluster and PVFS2
  #1
Charles
Guest
 
Posts: n/a
A cluster has lots of unused distributed disk space on the nodes. Has anybody used PVFS2 to use that space for storage? I'm looking for advice and warnings of pitfalls.

  Reply With Quote

Old   January 6, 2006, 16:21
Default Re: Cluster and PVFS2
  #2
Adrin Gharakhani
Guest
 
Posts: n/a
Ideally, when you perform computations on a cluster the data should be distributed as well (each node should dump data locally). So, if there is "waste" perhaps the program logic needs change first!

Adrin Gharakhani
  Reply With Quote

Old   January 6, 2006, 16:31
Default Re: Cluster and PVFS2
  #3
Charles
Guest
 
Posts: n/a
Well, that's not particularly relevant when the program runs fully in memory, then each node does not need to dump any data locally. But each cluster node has a fair amount of spare disk space, which would be a nice place to store output files.

  Reply With Quote

Old   January 6, 2006, 16:40
Default Re: Cluster and PVFS2
  #4
Adrin Gharakhani
Guest
 
Posts: n/a
I don't know the type of problems you're running, and the objective of the simulations, but data is dumped/stored regardless... If you simulate unsteady flow problems and want to examine the time evolution of various parameters, you end up requiring giga-terabytes of disk space (even when the program itself is small and runs in memory), in which case there is not only no wasted disk space but demand for extra disks... Anyway, I digress from your original question

Adrin Gharakhani
  Reply With Quote

Old   January 18, 2006, 22:32
Default Re: Cluster and PVFS2
  #5
steve
Guest
 
Posts: n/a
Parallel file systems generally work very well when you have 1 or more dedicated i/o servers to mount a big disk farm on many compute nodes. Doing what you want (combining disks from all over the place) will work but its very inefficient. Every compute node will also be tapped as an I/O server as well with the result that your calculation will be interrupted randomly on different nodes and your scaling will fall off. It doesn't do I/O well and it slows your calculation. Everyone I know who has tried it (with other systems, not PVFS) have given up.
  Reply With Quote

Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On



All times are GMT -4. The time now is 13:39.