Loading...
Please wait, while we are loading the content...
Similar Documents
Deploying genomics workflows on high performance computing (HPC) platforms: storage, memory, and compute considerations
| Content Provider | bioRxiv |
|---|---|
| Author | Marissa, E. Powers Mannthey, Keith Sebastian, Priyanka Adsule, Snehal Kiernan, Elizabeth Jonathan, T. Smith Way, Jessica Shifaw, Beri Roazen, David Narvaez, Paolo |
| Copyright Year | 2022 |
| Abstract | Abstract Next Generation Sequencing (NGS) workloads largely consist of pipelines of tasks with heterogeneous compute, memory, and storage requirements. Identifying the optimal system configuration has historically required expertise in both system architecture and bioinformatics. This paper outlines infrastructure recommendations for one commonly used genomics workload based on extensive benchmarking and profiling, along with recommendations on how to tune genomics workflows for high performance computing (HPC) infrastructure. The demonstrated methodology and learnings can be extended for other genomics workloads and for other infrastructures such as the cloud. |
| Related Links | https://www.biorxiv.org/content/biorxiv/early/2022/04/08/2022.04.05.485833.full.pdf |
| DOI | 10.1101/2022.04.05.485833 |
| Language | English |
| Publisher | Cold Spring Harbor Laboratory |
| Publisher Date | 2022-04-08 |
| Access Restriction | Open |
| Rights License | Creative Commons License (Attribution-NoDerivs 4.0 International), CC BY-ND 4.0 |
| Subject Keyword | Bioinformatics |
| Content Type | Text |
| Resource Type | Preprint |
| Subject | Biochemistry, Genetics and Molecular Biology |