Norfolk, Virginia Old Dominion University

HPC Computational Scientist (FP021A)

Information Technology Services (ITS) at Old Dominion University is accepting applications for the HPC Computational Scientist (full-time, 12 month) position. This position will provide expert-level support in designing, implementation and support of large scale high performance computational and research computing cloud environments. This position will also work closely with faculty and researchers to provide the needed help in the transition of scientific and engineering problems to the high performance computing solutions. This position will also work closely with researchers in the area of proposal preparation to secure funding from regional and national funding sources.

Required Education: Master's degree in computer science, engineering, math, or related technical discipline.

Required Qualifications: Considerable experience with the installation/setup, upgrades, configuration changes, and other requirements for implementing and maintaining HPC systems. Considerable experience with UNIX or Linux system administration in a medium computing environment. In-depth knowledge of RedHat or CentOS, including patch management, and use of package management tools. Experience working with MPI, OpenMP, CUDA and openACC. Some experience with cluster management tools (such as BCM, ROCKS). Experience with job scheduling and resource management systems (such as PBS, SGE, and SLURM). Some experience with HPC network architectures such as Infiniband. Some experience with network attached storage. Some experience writing programs and server-side scripts that aid in the operation and automation of administrative tasks using various shells and scripting languages. Familiarity with scientific codes and libraries. Familiarity using compilers (Intel, GCC, PGI) for FORTRAN and C programming languages. Considerable knowledge of HPC hardware and an understanding of modem computer architecture. Considerable knowledge of HPC OS distributions. Considerable knowledge of HPC network hardware and software stack (e.g. Infiniband, Ethernet) Considerable knowledge of HPC resource management and job scheduling systems (e.g. SGI, SLURM, PBS, Maui). Considerable knowledge of system interoperability, storage, networks, servers, and other system type attachments. Considerable knowledge of Parallel and distributed programming: MPI, OpenMP, OpenCL, threading libraries, RPC. Demonstrated ability to interact with professional faculty or research staff to evaluate needs, make recommendations, and provide technical consulting. Demonstrated ability to work independently and manage time efficiently.

Preferred Education: Doctorate degree in computer science, engineering, math, or related technical discipline.

Preferred Qualifications: Knowledge of distributed file systems such as Lustre, GPFS, PVFS. Ability to compile, build, install, and deploy open-source software and/or scientific applications. Considerable experience maintaining and monitoring the security of the HPC systems and servers. Experience with network hardware such as Infiniband. Experience with developing sustainable budget models for research computing. Experience with programming and scripting. Experience assisting users with code development. Experience in configuring and maintaining a code development environment. Experience with installing and tuning scientific application software. Experience with collaborating on research grants. Demonstrated skill and knowledge in designing and implementing large scale HPC technology. Demonstrated skill developing and implementing innovative strategies for HPC expansion and lifecycle management. Knowledge of public and private cloud such as AWS and Azure. Knowledge of containerized software deployment methods such as Docker and Singularity Knowledge of distributed file systems such as Lustre, BeeGFS, GPFS. Familiarity with scientific codes and libraries. Familiarity with using compilers (Intel, GCC, PGI) for Fortran and C programming languages. Ability to compile, build, install and deploy open source software and/or scientific applications.

Review Date: The initial application review date for this position is June 21, 2021 and will remain open until filled.

