Data Science

January 19, 2024

A catch-all term for the leveraging of dataset analysis for some specific goal. As “big data” has become a concept here in the past decade, practices for meaningfully exploring very large datasets computationally have been developed. While there are many parts of data science that fall outside of HPC, many other parts rely on the massive computing power of HPC to do all kinds of different data analytics tasks. Can be both MPI or embarrassingly parallel depending upon the workload within data science being examined.