CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > General Forums > Hardware

Setup RDMA in Azure

Register Blogs Community New Posts Updated Threads Search

Like Tree1Likes
  • 1 Post By cheezum

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   October 4, 2016, 03:51
Default Setup RDMA in Azure
  #1
New Member
 
Kees Kuijt
Join Date: Oct 2016
Posts: 3
Rep Power: 9
cheezum is on a distinguished road
Hi,

I tried to setup an RDMA cluster in Azure by using 2 A8 machines with SuSE 12 (default templates from Azure). The network interfaces are available but when I try to pingpong them the machines are not able to see each other.

Both machines are in the same Resource Group and when I use the ethernet connection on eth0 the machines can ping each other.
Also, when I run some basic diagnostics with ibv_dev and other tools, all shows OK and there is 1 port available. What I find weird is that the port is connected via ethernet and not via Infiniband.

So my question is: what do I need to do to at least pingpong both machines?
cheezum is offline   Reply With Quote

Old   March 20, 2017, 07:29
Default SLES 12 & RDMA on azure
  #2
New Member
 
Adrian Rohner
Join Date: Mar 2017
Posts: 3
Rep Power: 9
arohner is on a distinguished road
Hi Cheezum,

Ensure to use the SLES 12 SP1 for HPC. The RDMA Network seems still to have some bugs, but after a reboot of the node all devices are loaded correctly. In addition on this template there is already intel mpi installed. You may fallow these instructions:
https://docs.microsoft.com/en-us/azu...c-rdma-cluster

Good Luck &
best wishes
Adrian
arohner is offline   Reply With Quote

Old   March 20, 2017, 15:52
Default Solution found!
  #3
New Member
 
Kees Kuijt
Join Date: Oct 2016
Posts: 3
Rep Power: 9
cheezum is on a distinguished road
Hi Adrian,

Thanks for your response.
I totally forgot this thread.
In December I've had contact with some HPC guys from Microsoft and one of them had a background with ANSYS.

Guess what ....

There's a bug in ANSYS 17.2 (which we were using at that time) that hostnames longer than 12 characters didn't work. It wasn't a problem in 16 and it isn't in 18.0

That was one part of it.
The other one was the distro:
I already used the SLES 12.1 with HPC and I had no succes in using this distro.
Because the tech guys from Microsoft used examples with the CentOS 7.1 distro with HPC I decided to give that a shot.
And voila... Out of the box, it works. The only part that needs to be done extra is setting up de variables in the .bashrc so the MPI variables are loaded when a user is logging in.

Regarding ANSYS, the guys from MS have some great basic scripts available on the Github to setup a basic linux oriented cluster.
You can find them at https://github.com/tanewill/5clickTe...awANSYSCluster
flotus1 likes this.
cheezum is offline   Reply With Quote

Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Taylor Couette Setup and Boundary Conditions DaSh OpenFOAM Pre-Processing 2 September 28, 2017 12:02
2D Glass Melt Simulation Setup marmz FLUENT 5 October 9, 2016 15:25
[ICEM] surface/curve mesh setup Studi ANSYS Meshing & Geometry 15 November 12, 2014 00:32
[ICEM] Hexa mesh, curve mesh setup, bunching law Anorky ANSYS Meshing & Geometry 4 November 12, 2014 00:27
setup vof test problem Fang Jin FLUENT 1 June 14, 2005 08:27


All times are GMT -4. The time now is 19:45.