5 days ago Be among the first 25 applicants
DescriptionDo you want to be part of the AI revolution? At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to AI hardware and software infrastructure. In order to deliver on that vision, we've created innovative software and hardware solutions that make it possible. AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS Inferentia and Trainium, our custom chips designed to accelerate deep learning workloads.
This role is for a software engineer in the Compiler team for AWS Neuron. As part of this role, you will be responsible for building the next generation Neuron compiler which transforms ML models written in popular frameworks (e.g., PyTorch, TensorFlow, and JAX) to be deployed on AWS Inferentia and Trainium based servers in the Amazon cloud. You will solve hard compiler optimization problems to achieve optimum performance for a wide variety of ML model families, including massive scale large language models such as Llama and Deepseek, as well as stable diffusion, vision transformers, and multi model solutions. You will need an in depth understanding of how these models work inside out to make informed decisions that coax the compiler into generating optimal instruction implementations. You will leverage technical communications skills to partner with internal and external customers/stakeholders and will be involved in pre silicon design, bringing new products/features to market, ultimately making the Neuron compiler highly performant and easy to use.
Key job responsibilitiesAs you design and code solutions to help our team drive efficiencies in compiler architecture, you'll create optimization passes, build feature surface areas for AWS accelerators, implement analysis tools, and resolve compiler defects. You'll also engage in design discussions, code reviews, and maintain effective communication with internal teams and external communities in a startup like environment.
About the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and celebrate knowledge sharing and mentorship. Senior members enjoy one on one mentoring and thorough, but kind, code reviews. We care about career growth and strive to assign projects that help team members develop engineering expertise.
Basic QualificationsAmazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Los Angeles County applicants: Job duties for this position include: working safely and cooperatively with other employees, supervisors, and staff; adhering to standards of excellence despite stressful conditions; communicating effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and following all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, please visit for more information.
Compensation reflects the cost of labor across several US geographic markets. Base pay ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on multiple factors including location, experience, and skills. For more information, please visit This position will remain posted until filled. Applicants should apply via our internal or external career site.