Loading...
Please wait, while we are loading the content...
Similar Documents
Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units
| Content Provider | Hyper Articles en Ligne (HAL) |
|---|---|
| Author | Obrecht, Christian Kuznik, Frédéric Tourancheau, Bernard Roux, Jean-Jacques |
| Copyright Year | 2011 |
| Abstract | In this work, we investigate the global memory access mech- anism on recent GPUs. For the purpose of this study, we created spe- cific benchmark programs, which allowed us to explore the scheduling of global memory transactions. Thus, we formulate a model capable of estimating the execution time for a large class of applications. Our main goal is to facilitate optimisation of regular data-parallel applications on GPUs. As an example, we finally describe our CUDA implementations of LBM flow solvers on which our model was able to estimate performance with less than 5% relative error. |
| Related Links | https://inria.hal.science/inria-00563159/file/obrecht11a.pdf |
| Volume Number | 6449 |
| Language | English |
| Publisher | HAL CCSD Springer |
| Publisher Date | 2011-01-01 |
| Access Restriction | Open |
| Subject Keyword | GPU computing CUDA lattice Boltzmann method CFD |
| Content Type | Text |
| Resource Type | Article |
| Subject | Physics and Astronomy Computer Science |