Distributed ML Engineer: vLLM & Kubernetes Inference

  • Red Hat
  • Boston, Massachusetts
  • 04/02/2026
Full time Information Technology Telecommunications Python

Job Description

A leading open-source solution provider in Boston seeks a Machine Learning Engineer focused on distributed vLLM infrastructure. Candidates will contribute to developing innovative solutions, maintaining distributed inference infrastructures, and optimizing performance within cloud-native environments. A strong proficiency in Python and Go, along with experience in Kubernetes, is essential. This role offers a salary range of $133,650 - $220,680 based on qualifications and experience, alongside comprehensive benefits.