|
Home > TERATEC FORUM > Workshop 6
Scalability & performances of general purpose fluid dynamics solver on low power cluster: new perspectives on combined CUDA-ARM architecture With the arrival of 64-bit ARM as well as new host 'REAL" platforms and HPC H2020 projects ready to start, we had the opportunity to study and gets hands-on on one of the first of this architectures in a real cluster connected via FDR infiniband, coupled with GPU. In order to test the real capabilities in engineering field and as well because involved in many CFD and HPC projects we have perfomed a complete porting of a CFD code with a computational engine based on GPU in order to prove scalability even on low power CPU's. The CFD code is Sailfish, a free computational fluid dynamics solver based on the Lattice Boltzmann method and optimized for modern multi-core systems, especially GPUs (Graphics Processing Units). The hardware solution used are based on ARMv8 SoC plus Nvidia GPUs and interconnected with infiniband FDR fabric. In detail, the first machine has single SoC X-gene1 by APM, with 8 cores ARMv8 2.4 GHz with 16 PCIe Gen3 lines 8 connected to the GPU, and 8lines to the infiniband card. A four node cluster has been used as a test environment. The second system is based on CAVIUM Thunder-X Soc, 48 cores ARMv8, single node. This article presents the work done and performance data obtained.
|
||||||||||
© Ter@tec - All rights reserved - Lawful mention |