HPC Overview
The High Performance Computing cluster (HPC) at FSU is the core of our computational infrastructure. With more than 16,000 cores and over 600 compute nodes, the HPC provides a powerful and scalable computing platform for large, multithreaded, and distributed parallel computations.
The HPC is a tightly integrated system of uniform servers connected by a fast InfiniBand data network that is designed for long-running, compute-intensive programs. This uniformity and integration makes the system extremely well suited for processing workloads that would not scale on regular computers because of memory requirements or CPU limitations.
What is it used for?#
The HPC is used for long-running jobs that require a large amount of compute resources (CPUs and memory). In order to allow many users to run programs at the same time, HPC systems make use of batch, non-interactive jobs that are scheduled on a large but finite amount of resources.
In a batch system, users describe the workflow of their program and submit it to a queue (or Slurm Account/partition, as it is implemented at FSU), it runs independently of any user input until it finishes. Most jobs can be monitored but not interacted with.
Compute jobs on the HPC can operate in parallel using popular frameworks like MVAPICH or OpenMPI.
Many users write and/or compile their own software to run on the HPC, for which we provide a number of tools and libraries to support. Other users can run jobs using general-purpose applications, such as MATLAB or Python.
Visit our software catalog page to see a list of tools, libraries, and applications available on the HPC.
Who has access?#
Access to the HPC is available for all FSU faculty and students/staff with a faculty sponsor, as well as collaborators not affiliated with FSU. To obtain priority access to our resources, many faculty members have made investments in the HPC by purchasing dedicated resources.
How can my research group purchase dedicated resources on the HPC?#
Information about how to purchase dedicated queues and priority access for your research group is available in the ITS Service Catalog.
Popular technologies#
The HPC can run many types of jobs, which makes it appropriate for all areas of scientific research. Some popular platforms and technologies include the following:
- OpenMPI: OpenMPI is one of the several implementations of the Message Passing Interface (MPI) model of parallel computing for distributed systems. OpenMPI is an open-source implementation of this model with a wide range of powerful features
- Python: The HPC provides a robust implementation of Python, which is increasingly used in computational science. We provide support for a large number of Python utilities for compiling Python code and working with Python visually
- GPUs and CUDA: A portion of the HPC cluster provides GPU resources. CUDA is available for running GPU optimized code
- MATLAB: The HPC provides support for distributed MATLAB jobs. In addition, you can compile MATLAB to C and run that on the HPC for higher performance and fewer license restrictions
How to connect#
If you want to try the HPC, head over to our connecting and using page.