High Performance Computing Infrastructure Engineer

Location US-IL-Evanston
Requisition Post Information* : Posted Date 1 month ago(10/23/2023 2:13 PM)
Job ID
2023-34069
# of Openings
1
Job Category
Information Technology
Level 2
NFP Support

 

NFP ULRI+ULSE

At UL, we know why we come to work.

We have an exciting opportunity for a High-Performance Computing Infrastructure Engineer at UL Research Institutes and UL Standards & Engagement, based in our Evanston, IL office. The High-Performance Computing Infrastructure Engineer is responsible for designing, implementing, managing, and optimizing the High-Performance Computing (HPC) infrastructure for the UL Research Institutes and UL Standards and Engagement organizations. Will work closely with cross-functional teams to ensure the smooth operation and continuous improvement of our HPC environment, enabling researchers and data scientists to efficiently execute complex simulations, data analysis, and scientific computations.

 

Underwriters Laboratories

At UL Research Institutes and UL Standards & Engagement (UL), we wake up every day with a common purpose: to make the world a safer, more secure, and sustainable place. Science is in our DNA; we are endlessly curious and passionate about seeking and speaking the truth. We take delight in knowing that our work makes a meaningful contribution to society, and we are proud that our culture is centered on integrity, collaboration, inclusion, and excellence. UL stands at the forefront of technological advancement, and we are continually challenged to find new ways to foster innovation and positive change. Satisfying? Yes. Exciting? Absolutely!

What you’ll learn & achieve:

As the High-Performance Computing Infrastructure Engineer, you will play a key role in the rapid growth of UL Operations as you:

  • Design, deploy, configure, and maintain HPC clusters, including hardware, software, and network components to optimize system architecture for performance, scalability, and reliability in collaboration with stakeholders to define HPC system requirements and specifications.
  • Monitor system performance, troubleshoot issues, and implement solutions to minimize downtime. Perform regular updates, patches, and security measures to keep the infrastructure up to date. Manage and maintain HPC systems, ensuring high availability and reliability.
  • Design and manage high-performance data storage solutions to support large-scale simulations and data analysis. Implement data backup, replication, and archival strategies to ensure data integrity and availability.
  • Conduct benchmarking and performance testing to evaluate and improve the efficiency of HPC applications. Identify performance bottlenecks and implement optimization strategies to enhance system performance.
  • Ensure data integrity and availability in the event of hardware failures or other disruptions. Implement and oversee data backup, replication, and disaster recovery strategies.
  • Provide training and workshops to users on utilizing the HPC infrastructure effectively. Create comprehensive documentation for system configurations, procedures, and troubleshooting steps.
  • Implement security measures and follow best practices to safeguard HPC systems and data to ensure compliance with relevant data protection and security regulations.

What makes you a great fit:

While no one candidate will embody every quality, the successful candidate will bring many of the following professional competencies and personal attributes:

  • Proficiency in Linux/Unix system administration and scripting languages (e.g., Bash, Python).
  • Familiarity with parallel computing frameworks (e.g., MPI, OpenMP) and optimization techniques.
  • Strong understanding of networking principles, storage technologies, and system performance tuning.
  • Demonstrated experience with job scheduling and resource management software (e.g., Slurm, Torque/PBS).
  • Knowledge of high-speed interconnects (InfiniBand, Omni-Path) and GPU acceleration is a plus.
  • Excellent problem-solving skills and the ability to troubleshoot complex technical issues.
  • Strong communication skills and the ability to collaborate effectively with cross-functional teams.

Professional education and experience requirements for the role include:

  • Bachelor’s degree in computer science, computer engineering, or equivalent combination of education and experience. Master's degree preferred.
  • Minimum 6 years of experience in designing, deploying, and managing HPC clusters and infrastructure.

What you’ll experience working at UL:

  • Mission: For UL, corporate and social responsibility isn’t new. Making the world a safer, more secure, and sustainable place has been our business model for the last 128 years and is deeply ingrained in everything we do.
  • People: Ask any UL employee what they love most about working here, and you’ll almost always hear, “the people.” Going beyond what is possible is the standard at UL. We’re able to deliver the best because we employ the best.
  • Interesting work: Every day is different for us here as we eagerly anticipate the next innovation that our customers create. We’re inspired to take on the challenge that will transform how people live, work and play. And as a global company, in many roles, you will get international experience working with colleagues around the world.
  • Grow & achieve: We learn, work, and grow together with targeted development, reward, and recognition programs as well as our very own UL University that offers extensive training programs for employees at all stages, including a technical training track for applicable roles.
  • Total Rewards: All employees at UL Research Institutes and UL Standards & Engagement are eligible for bonus compensation. We offer comprehensive medical, dental, vision, and life insurance plans. a generous 401k matching structure of up to 5% of eligible pay. Additionally, we invest an additional 4% into your retirement saving fund after your first year of continuous employment. Depending on your role, you can work with your manager on flexible working arrangements. We also provide employees with paid time off including vacation, holiday, sick and volunteer time off.

Learn More:

UL Research Institutes and UL Standards & Engagement are nonprofit organizations dedicated to advancing safety science research through the discovery and application of scientific knowledge. We conduct rigorous independent research and analyze safety data, convene experts worldwide to address risks, share knowledge through safety education and public outreach initiatives, and develop standards to guide safe commercialization of evolving technologies. We foster communities of safety, from grassroots initiatives for neighborhoods to summits of world leaders. Our organization employs collaborative and scientific approaches with partners and stakeholders to drive innovation and progress toward improving safety, security, and sustainability, ultimately enhancing societal well-being.

 

Our wholly owned subsidiary, UL Solutions, advances our shared public safety mission. We fund our work through grants, the licensing of standards documents and the business activities of UL Solutions, which conducts testing, verification, and certification, and provides training and advisory services, along with data-driven reporting and decision-making tools for customers around the world. To learn more, visit our websites UL.org and ULSE.org.

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed