BoulderRecruiter Since 2001
the smart solution for Boulder jobs

Software Engineer, ML Compiler and Framework

Company: Fathom Radiant
Location: Boulder
Posted on: May 26, 2023

Job Description:

About us

We are searching for talented individuals who are driven to tackle the most ambitious goal of our time - building the computer hardware that enables the development of safe artificial general intelligence. See more at fathomradiant.co/aboutus

In our people, we above all value kindness, a scout mindset, a focus on improvement, and prioritising to get the right things done. We aim to help build one of the most transformative technologies in the world, with massive social and ethical implications. We think this makes representation even more important, and we are actively striving to have a range of diverse perspectives on our team.

This role

As a Software Engineer responsible for ML Compiler and Framework, you'll develop Fathom's runtime components and work closely with our Network Modeling Architect architect to optimize the performance of our high performance computing architecture. You'd collaborate closely with our partners and also maintain vendor-provided ML software frameworks.

This role is open to fully remote.

Areas of contribution:

  • Deploy and maintain vendor provided ML software frameworks
  • Develop runtime components of Fathoms ML software stack
  • Extend distributed systems collectives libraries for novel network architectures and integration with Fathom's cluster architecture and OpenFlow framework.
  • Augment vendor graph compilers to integrate with Fathom's ML cluster and optimally map ML models to utilize Fathom's high-bandwidth, low-latency interconnect.
  • Collaborate closely with our partners and customers to train large-scale neural networks on Fathom's computing fabric and benchmark Fathom's cluster.

    Requirements (necessary skills for this role):
    • MS or PhD in Computer Science or Electrical Engineering
    • Proficiency in modern C++ (17 preferred) and experience in C and Python
    • Experience in Machine Learning and Computer Architecture
    • Experience with distributed systems collectives such as NCCL or OpenMPI
    • Experience with CUDA programming, OpenMP/OpenCL programming or AI hardware accelerator kernel programming.
    • Experience with compiler optimizations such as loop optimizations, vectorization, and parallelization. Experience with open source projects such as MLIR, LLVM, IREE, XLA, TVM is preferred.
    • Solid foundation in algorithms and data structures.
    • Strong analytical, problem-solving, and communication skills.

      Nice-to-haves (we will prioritize candidates that also have these skills):
      • Experience in working with large ML/HPC workloads with distributed computing systems built with accelerators such as GPUs or TPUs.
      • Experience working with frameworks such as PyTorch and TensorFlow
      • Experience with runtime analysis and optimizations for latency, memory bandwidth, IO access, compute utilization analysis.

        $90,000 - $180,000 a year

        The salary range above is determined for Boulder, CO. The exact salary for remote roles depend on the location and the experience level of the candidate.

        For all roles, we target market salaries, with an additional benefits package. Our comprehensive benefits include startup equity, medical expenses coverage (including extra coverage for employees with a family).

Keywords: Fathom Radiant, Boulder , Software Engineer, ML Compiler and Framework, IT / Software / Systems , Boulder, Colorado

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category
within


Log In or Create An Account

Get the latest Colorado jobs by following @recnetCO on Twitter!

Boulder RSS job feeds