A leading technology company is seeking a Principal Solutions Architect to enable large clusters for AI and HPC workloads. This role involves leading technical discovery, shaping reference architectures, and partnering with teams to build value models. Candidates should have deep hands-on experience with GPU-based infrastructure, strong communication skills, and a Bachelor's degree in a related field. The position is based in San Jose, CA, offering competitive benefits and an inclusive work environment.
04/02/2026
Full time
A leading technology company is seeking a Principal Solutions Architect to enable large clusters for AI and HPC workloads. This role involves leading technical discovery, shaping reference architectures, and partnering with teams to build value models. Candidates should have deep hands-on experience with GPU-based infrastructure, strong communication skills, and a Bachelor's degree in a related field. The position is based in San Jose, CA, offering competitive benefits and an inclusive work environment.
This range is provided by AMD. Your actual pay will be based on your skills and experience - talk with your recruiter to learn more. The Role The AMD Datacenter GPU team is seeking an experienced Principal Solutions Architect to join our team focused on enabling large clusters for AI & HPC workloads. The Person The candidate will be a technical expert in datacenter infrastructure with deep knowledge of datacenter design, strong knowledge of compute (CPUs/GPUs), networking, and storage solutions, and experience partnering with customers to support RFP development. This role offers the opportunity to work at the cutting edge of AI & HPC infrastructure, solving complex technical challenges and helping customers implement transformative datacenter solutions at scale. Key Responsibilities Lead customer technical discovery with data/ML, platform, and infrastructure stakeholders; map business goals to AI & HPC workloads and success metrics. Assess current system state (GPUs/accelerators, storage, fabric, security) and identify gaps, risks, and define required POCs. Shape reference architectures for large AI & HPC clusters (rack design, GPU topology, RoCE/InfiniBand, NVMe/parallel FS) aligned to customer constraints (power, cooling, space). Create high level design. Partner with the business development and product teams to build ROI/TCO models. (CapEx/OpEx, $/token, $/inference) and craft the value story. Support draft of technical sections of RFIs/RFPs; produce architecture diagrams, deployment plans, and implementation timelines. Partner with program & engineering teams to define POC success criteria, test plans, and exit reports. Collaborate with product management to foster product roadmap improvements. Network design for high throughput GPU clusters (scale up / scale out / OOB), cabling. Storage architectures optimized for AI data pipelines. Datacenter layout strategies / power / cooling. Rack power delivery / mechanicals. Required Experience Deep hands on experience designing and implementing large scale GPU based infrastructure solutions, including datacenter network and storage architectures. Proven track record of creating technical documentation and reference architectures. Excellent communication skills with the ability to explain complex technical concepts. Experience working directly with customer technical teams. Academic Credentials Bachelor's degree or higher in Computer Science, Electrical Engineering or closely related field. Location San Jose, CA This role is not eligible for visa sponsorship. Benefits offered are described: AMD benefits at a glance. Equal Employment Opportunity AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's "Responsible AI Policy" is available here. This posting is for an existing vacancy.
04/02/2026
Full time
This range is provided by AMD. Your actual pay will be based on your skills and experience - talk with your recruiter to learn more. The Role The AMD Datacenter GPU team is seeking an experienced Principal Solutions Architect to join our team focused on enabling large clusters for AI & HPC workloads. The Person The candidate will be a technical expert in datacenter infrastructure with deep knowledge of datacenter design, strong knowledge of compute (CPUs/GPUs), networking, and storage solutions, and experience partnering with customers to support RFP development. This role offers the opportunity to work at the cutting edge of AI & HPC infrastructure, solving complex technical challenges and helping customers implement transformative datacenter solutions at scale. Key Responsibilities Lead customer technical discovery with data/ML, platform, and infrastructure stakeholders; map business goals to AI & HPC workloads and success metrics. Assess current system state (GPUs/accelerators, storage, fabric, security) and identify gaps, risks, and define required POCs. Shape reference architectures for large AI & HPC clusters (rack design, GPU topology, RoCE/InfiniBand, NVMe/parallel FS) aligned to customer constraints (power, cooling, space). Create high level design. Partner with the business development and product teams to build ROI/TCO models. (CapEx/OpEx, $/token, $/inference) and craft the value story. Support draft of technical sections of RFIs/RFPs; produce architecture diagrams, deployment plans, and implementation timelines. Partner with program & engineering teams to define POC success criteria, test plans, and exit reports. Collaborate with product management to foster product roadmap improvements. Network design for high throughput GPU clusters (scale up / scale out / OOB), cabling. Storage architectures optimized for AI data pipelines. Datacenter layout strategies / power / cooling. Rack power delivery / mechanicals. Required Experience Deep hands on experience designing and implementing large scale GPU based infrastructure solutions, including datacenter network and storage architectures. Proven track record of creating technical documentation and reference architectures. Excellent communication skills with the ability to explain complex technical concepts. Experience working directly with customer technical teams. Academic Credentials Bachelor's degree or higher in Computer Science, Electrical Engineering or closely related field. Location San Jose, CA This role is not eligible for visa sponsorship. Benefits offered are described: AMD benefits at a glance. Equal Employment Opportunity AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's "Responsible AI Policy" is available here. This posting is for an existing vacancy.
Lead Forward Deployed Software Engineer (2) Join AMD as a Lead Forward Deployed Software Engineer and work on cutting edge AI solutions that drive real business value for our partners. About AMD At AMD, our mission is to build great products that accelerate next generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Our culture of innovation, collaboration, and diversity pushes the limits of technology and enables bold ideas. Role Overview As a Forward Deployment Software Engineer you will be a technical partner for our most strategic clients, turning AMD's AI technology into tangible business value. You will work side by side with customers to prove out and deploy AI solutions on AMD GPUs, identify software optimization opportunities, and influence product road maps. In This Role, You Will Work closely with strategic customers to understand their requirements, challenges, and identify opportunities for AMD hardware and software. Close gaps in the AMD software stack needed to support customer solutions. Work hands on as a technical expert, developing side by side with customers to drive projects from proof of concept to production. Act as the voice of the customer, translating their needs and on the ground insights into actionable feedback that shapes AMD's AI roadmap. Thrive in unfamiliar territory with a high degree of autonomy, finding novel ways to apply AMD technology to real world problems. Key Qualifications Strong programming skills in C/C++ and Python. Experience with GPU kernel programming using CUDA, HIP or OpenCL. Proficiency in common ML performance analysis tools. Track record of client engagement, working directly with customers to solve ambiguous technical problems. Strong performance analysis and optimization skills for both CPU and GPU. Experience with containerization and orchestration technologies like Singularity, Docker, and/or Kubernetes. Expertise with modern AI/ML frameworks (e.g., PyTorch, TensorFlow, JAX). Experience with distributed training and inference frameworks. Experience with open source software development including collaboration with community maintainers and submitting contributions. Experience with software engineering methodologies such as Agile, Scrum, Kanban. Excellent analytical and problem solving skills. Ability to work independently and as part of a team. BS/MS/PhD in Computer Science or related field. Preferred Experience Experience in compiler or ISA. Experience shipping software in an end customer production environment. Experience implementing and optimizing communication primitives on GPU accelerators (NCCL/RCCL, OpenMP, MPI). Experience in all phases of software development, from requirement gathering to final release. Experience providing clear and timely communication of status and key project aspects to executive leadership. Benefits offered are described: AMD benefits at a glance. Benefits details: AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under applicable laws throughout all stages of the recruitment and selection process.
04/02/2026
Full time
Lead Forward Deployed Software Engineer (2) Join AMD as a Lead Forward Deployed Software Engineer and work on cutting edge AI solutions that drive real business value for our partners. About AMD At AMD, our mission is to build great products that accelerate next generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Our culture of innovation, collaboration, and diversity pushes the limits of technology and enables bold ideas. Role Overview As a Forward Deployment Software Engineer you will be a technical partner for our most strategic clients, turning AMD's AI technology into tangible business value. You will work side by side with customers to prove out and deploy AI solutions on AMD GPUs, identify software optimization opportunities, and influence product road maps. In This Role, You Will Work closely with strategic customers to understand their requirements, challenges, and identify opportunities for AMD hardware and software. Close gaps in the AMD software stack needed to support customer solutions. Work hands on as a technical expert, developing side by side with customers to drive projects from proof of concept to production. Act as the voice of the customer, translating their needs and on the ground insights into actionable feedback that shapes AMD's AI roadmap. Thrive in unfamiliar territory with a high degree of autonomy, finding novel ways to apply AMD technology to real world problems. Key Qualifications Strong programming skills in C/C++ and Python. Experience with GPU kernel programming using CUDA, HIP or OpenCL. Proficiency in common ML performance analysis tools. Track record of client engagement, working directly with customers to solve ambiguous technical problems. Strong performance analysis and optimization skills for both CPU and GPU. Experience with containerization and orchestration technologies like Singularity, Docker, and/or Kubernetes. Expertise with modern AI/ML frameworks (e.g., PyTorch, TensorFlow, JAX). Experience with distributed training and inference frameworks. Experience with open source software development including collaboration with community maintainers and submitting contributions. Experience with software engineering methodologies such as Agile, Scrum, Kanban. Excellent analytical and problem solving skills. Ability to work independently and as part of a team. BS/MS/PhD in Computer Science or related field. Preferred Experience Experience in compiler or ISA. Experience shipping software in an end customer production environment. Experience implementing and optimizing communication primitives on GPU accelerators (NCCL/RCCL, OpenMP, MPI). Experience in all phases of software development, from requirement gathering to final release. Experience providing clear and timely communication of status and key project aspects to executive leadership. Benefits offered are described: AMD benefits at a glance. Benefits details: AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under applicable laws throughout all stages of the recruitment and selection process.
Principal / Senior GPU Software Performance Engineer - Post Training Join to apply for the Principal / Senior GPU Software Performance Engineer - Post Training role at AMD Base pay range $226,400.00/yr - $339,600.00/yr What you do at AMD At AMD, our mission is to build great products that accelerate next generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. The Role Drive the performance of post training workloads on AMD Instinct GPUs. You'll work across kernels, distributed training, and framework integrations to deliver fast, stable, and reproducible training pipelines on ROCm. The Person The ideal candidate is passionate about software engineering and the craft of training performance. You lead sophisticated cross stack issues-spanning data loaders, kernels, distributed training, and compilers-to clear resolution. You communicate crisply and collaborate effectively with framework, compiler, kernel, and model teams across AMD, driving measurable improvements with rigor, ownership, and reproducibility. Key Responsibilities Lead performance for finetuning and RL training solutions on AMD GPUs. Improve throughput, memory efficiency, and stability across data, model, and optimizer steps. Optimize multi GPU/multi node training and communication patterns. Contribute efficient kernels/ops and targeted graph level optimizations. Profile, diagnose, and resolve bottlenecks using standard tooling; prevent regressions in CI. Ship reproducible pipelines and documentation adopted by internal teams and external developers. Collaborate with framework, compiler, and model teams to land durable improvements. Preferred Experience Proven GPU performance engineering for deep learning (ROCm/HIP, Triton, or similar). Hands on with SFT. LoRA and RL based training at scale. Strong PyTorch experience (torch.distributed, FSDP/ZeRO or equivalent). Proficient in Python and C++; comfortable reading/writing kernels when needed. Experience with distributed systems and collective communication libraries. Track record of turning profiles into fixes, upstreaming changes, and documenting results. Academic Credentials B.S./M.S./Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, or equivalent Location San Jose, CA preferred. Other US based locations may be considered. Benefits Benefits offered are described: AMD benefits at a glance. Equal Opportunity Employer AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
04/02/2026
Full time
Principal / Senior GPU Software Performance Engineer - Post Training Join to apply for the Principal / Senior GPU Software Performance Engineer - Post Training role at AMD Base pay range $226,400.00/yr - $339,600.00/yr What you do at AMD At AMD, our mission is to build great products that accelerate next generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. The Role Drive the performance of post training workloads on AMD Instinct GPUs. You'll work across kernels, distributed training, and framework integrations to deliver fast, stable, and reproducible training pipelines on ROCm. The Person The ideal candidate is passionate about software engineering and the craft of training performance. You lead sophisticated cross stack issues-spanning data loaders, kernels, distributed training, and compilers-to clear resolution. You communicate crisply and collaborate effectively with framework, compiler, kernel, and model teams across AMD, driving measurable improvements with rigor, ownership, and reproducibility. Key Responsibilities Lead performance for finetuning and RL training solutions on AMD GPUs. Improve throughput, memory efficiency, and stability across data, model, and optimizer steps. Optimize multi GPU/multi node training and communication patterns. Contribute efficient kernels/ops and targeted graph level optimizations. Profile, diagnose, and resolve bottlenecks using standard tooling; prevent regressions in CI. Ship reproducible pipelines and documentation adopted by internal teams and external developers. Collaborate with framework, compiler, and model teams to land durable improvements. Preferred Experience Proven GPU performance engineering for deep learning (ROCm/HIP, Triton, or similar). Hands on with SFT. LoRA and RL based training at scale. Strong PyTorch experience (torch.distributed, FSDP/ZeRO or equivalent). Proficient in Python and C++; comfortable reading/writing kernels when needed. Experience with distributed systems and collective communication libraries. Track record of turning profiles into fixes, upstreaming changes, and documenting results. Academic Credentials B.S./M.S./Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, or equivalent Location San Jose, CA preferred. Other US based locations may be considered. Benefits Benefits offered are described: AMD benefits at a glance. Equal Opportunity Employer AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
A leading semiconductor company based in Santa Clara is seeking a Senior GPU Firmware Engineer to support GPU customers across various segments, including Cloud and HPC. The role involves collaboration with OEM partners and internal teams to ensure successful deployment of AMD's technology. Ideal candidates should possess strong skills in C/C++, embedded development, and system-level debugging, along with excellent communication skills.
04/02/2026
Full time
A leading semiconductor company based in Santa Clara is seeking a Senior GPU Firmware Engineer to support GPU customers across various segments, including Cloud and HPC. The role involves collaboration with OEM partners and internal teams to ensure successful deployment of AMD's technology. Ideal candidates should possess strong skills in C/C++, embedded development, and system-level debugging, along with excellent communication skills.
Sr. Recruiter / Talent Consultant at AMD, WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. SENIOR GPU FIRMWARE ENGINEER The Role Join AMD's Datacenter firmware application team as a Firmware Application Engineer, supporting our GPU customers across Cloud, HPC, and OEM segments. In this customer centric role, you will collaborate with external OEM partners, internal development and validation teams, and cross functional stakeholders to bring next generation server platforms to market powered by AMD's Instinct Accelerators-and ensure their successful deployment in customer data centers. The Person An ideal candidate should be familiar with embedded/firmware development, GPU driver/runtime, OS kernel internals, microcontroller fundamentals, hardware power/frequency controls, etc. He/she should be comfortable performing quantitative analysis of workload, pinpoint issues, and drive improvements together with upper layer stack to achieve the ultimate performance. You are a hands on technical problem solver who thrives at the intersection of hardware and software. You enjoy collaborating directly with customers and internal engineering teams to turn complex system challenges into actionable solutions. You'll Excel In This Role If You Are energized by customer engagement and technical troubleshooting. Have strong analytical instincts and a structured approach to problem solving. Communicate clearly and proactively across technical and non technical audiences. Enjoy collaborating across hardware, firmware, and software disciplines. Bring curiosity, creativity, and persistence to complex engineering challenges. Key Responsibilities Manage technical interaction with OEM/ODM Partners to enable deployment of AMD Instinct Accelerators in Partner systems. Work alongside hardware and upper software layers to co optimize the whole AI software stack. Design and build tools for better collecting/presenting GPU performance details correlating to low level hardware characteristics. Support Partners in the bring up and validation of AMD Instinct GPUs in their system, guide partners on use of AMD tools, qualification test methods, and analysis of test results. Lead the debug of Partner/Customer issues (firmware, HW, driver), working with a cross functional team and driving the root cause investigation. Work with Partners on the development of manufacturing/screen tests to ensure reliability at scale. Understand Partner requirements and schedule, identify gaps in AMD offering and work with other stakeholders to close them. Author design guideline, technical presentations, and training material. Provide recommendation to improve customer experience with our SW and HW. Preferred Experience Experience with firmware developments. Experience with embedded software development. Experience with power management and control theory. Experience working on system level reliability and resiliency features. Familiarity with OS kernel/driver internals. Familiarity with GPU architectures and runtimes. Familiarity with microcontroller fundamentals (caches, buses, memory controllers, DMA, etc.). Strong C/C++ programming skills. Strong knowledge in PC/server architecture and interfaces, experience with system level debug. Strong System Level debugging skills with hands on experiences in system bring up, HW debug, and performance optimizations on various system architectures. Understanding and experience working with Enterprise Linux environment (Ubuntu, CentOS/RHEL and SLES). Excellent oral and written communication skills to communicate technical results clearly and accurately. Experience or knowledge of server firmware/BIOS settings, boot process, server monitoring and management SW. Solid knowledge of Shell/BASH, C/C++, Python, or other framework. Experience with OpenCL, CUDA, or ROCm is a plus. Preferred Academic Credentials BS/MS (Computer Science, Computer Engineering, Electrical Engineering, or related equivalent). Location Santa Clara, CA EEO Statement AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
04/02/2026
Full time
Sr. Recruiter / Talent Consultant at AMD, WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. SENIOR GPU FIRMWARE ENGINEER The Role Join AMD's Datacenter firmware application team as a Firmware Application Engineer, supporting our GPU customers across Cloud, HPC, and OEM segments. In this customer centric role, you will collaborate with external OEM partners, internal development and validation teams, and cross functional stakeholders to bring next generation server platforms to market powered by AMD's Instinct Accelerators-and ensure their successful deployment in customer data centers. The Person An ideal candidate should be familiar with embedded/firmware development, GPU driver/runtime, OS kernel internals, microcontroller fundamentals, hardware power/frequency controls, etc. He/she should be comfortable performing quantitative analysis of workload, pinpoint issues, and drive improvements together with upper layer stack to achieve the ultimate performance. You are a hands on technical problem solver who thrives at the intersection of hardware and software. You enjoy collaborating directly with customers and internal engineering teams to turn complex system challenges into actionable solutions. You'll Excel In This Role If You Are energized by customer engagement and technical troubleshooting. Have strong analytical instincts and a structured approach to problem solving. Communicate clearly and proactively across technical and non technical audiences. Enjoy collaborating across hardware, firmware, and software disciplines. Bring curiosity, creativity, and persistence to complex engineering challenges. Key Responsibilities Manage technical interaction with OEM/ODM Partners to enable deployment of AMD Instinct Accelerators in Partner systems. Work alongside hardware and upper software layers to co optimize the whole AI software stack. Design and build tools for better collecting/presenting GPU performance details correlating to low level hardware characteristics. Support Partners in the bring up and validation of AMD Instinct GPUs in their system, guide partners on use of AMD tools, qualification test methods, and analysis of test results. Lead the debug of Partner/Customer issues (firmware, HW, driver), working with a cross functional team and driving the root cause investigation. Work with Partners on the development of manufacturing/screen tests to ensure reliability at scale. Understand Partner requirements and schedule, identify gaps in AMD offering and work with other stakeholders to close them. Author design guideline, technical presentations, and training material. Provide recommendation to improve customer experience with our SW and HW. Preferred Experience Experience with firmware developments. Experience with embedded software development. Experience with power management and control theory. Experience working on system level reliability and resiliency features. Familiarity with OS kernel/driver internals. Familiarity with GPU architectures and runtimes. Familiarity with microcontroller fundamentals (caches, buses, memory controllers, DMA, etc.). Strong C/C++ programming skills. Strong knowledge in PC/server architecture and interfaces, experience with system level debug. Strong System Level debugging skills with hands on experiences in system bring up, HW debug, and performance optimizations on various system architectures. Understanding and experience working with Enterprise Linux environment (Ubuntu, CentOS/RHEL and SLES). Excellent oral and written communication skills to communicate technical results clearly and accurately. Experience or knowledge of server firmware/BIOS settings, boot process, server monitoring and management SW. Solid knowledge of Shell/BASH, C/C++, Python, or other framework. Experience with OpenCL, CUDA, or ROCm is a plus. Preferred Academic Credentials BS/MS (Computer Science, Computer Engineering, Electrical Engineering, or related equivalent). Location Santa Clara, CA EEO Statement AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Overview At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. The Role The AI Models and Applications team at AMD is looking for a specialized senior manager who is passionate about enabling innovative and efficient Generative AI training/inferencing at scale. You will lead a core team of incredibly talented specialists and will work with the latest Generative AI algorithm and software technology. The Person The ideal candidate has deep technical understanding of the latest generative AI applications like large language models (LLMs), large multimodal models (LMMs), image/video generation and is passionate about innovating efficient approaches to enable on AMD devices. You have outstanding people leadership skills to lead, motivate, and guide your team working in a fast-paced organization. Strong ability to communicate effectively and work optimally with different teams across AMD. Why Join Us? Exciting Opportunities: As a Senior Manager for AI Models Software Development, you will be at the forefront of innovation, working with the latest Gen AI models and algorithms. You will have the opportunity to shape the future of AI model training and inference optimizations across a variety of applications. Talented Team: Join a team of highly skilled industry specialists who are passionate about pushing the boundaries of AI. Collaborate with like-minded professionals and learn from the best in the field. Cutting-edge Technology: Work with state-of-the-art AI algorithm and software technology, enabling you to stay ahead of the curve and drive advancements in AI model development. Impactful Work: Your contributions will directly impact the performance and efficiency of AI models, making a significant difference in various industries and applications. Key Responsibilities Lead a team focusing on research, design, and implement novel methods for efficient Generative AI. Propose and apply innovative techniques to support both training and inferencing including innovative transformer architectures, parallelism strategies to train on large clusters, low-precision training, model quantization, and acceleration for algorithms on various AMD platform, e.g., GPU/CPU/AIE. Collaborate with software and hardware team to E2E co-optimize performance on current and future AMD solutions. Work with open-source framework and community (e.g., PyTorch, JAX, Huggingface) to integrate AMD optimized models, libraries and publish training recipes. Publish and promote your work within AMD and at external venues. Preferred Experience Strong technical expertise in Gen AI model training and inference, and familiarity working with deep learning frameworks like PyTorch/JAX/vLLM/SGLang Strong technical expertise in algorithmic innovation towards efficient Gen AI application for both training and inferencing; Expertise/publications in one of the areas preferred - efficient model architectures, optimized training, innovative parallelism strategies, low-precision training, model quantization Additional plus if publications include conferences such as NeuRIPS, CVPR, ECCV/ICCV, ICML, ICLR, etc. Experience productizing any of the following also a plus: LLMs, Large Multi-modal models, 3D World Model or agentic workflows Strong leadership and management skills. Excellent written, verbal, and presentation skills, ability to coordinate internally and externally. Several years of experience in AI, deep learning and related software development, along with experience mentoring and managing high performing teams Academic Credentials: Master's degree or above; with major in CS, EE, Mathematics, or a related field. Location: San Jose, CA, or Bellevue, WA. Hybrid options are available. Benefits: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
04/02/2026
Full time
Overview At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. The Role The AI Models and Applications team at AMD is looking for a specialized senior manager who is passionate about enabling innovative and efficient Generative AI training/inferencing at scale. You will lead a core team of incredibly talented specialists and will work with the latest Generative AI algorithm and software technology. The Person The ideal candidate has deep technical understanding of the latest generative AI applications like large language models (LLMs), large multimodal models (LMMs), image/video generation and is passionate about innovating efficient approaches to enable on AMD devices. You have outstanding people leadership skills to lead, motivate, and guide your team working in a fast-paced organization. Strong ability to communicate effectively and work optimally with different teams across AMD. Why Join Us? Exciting Opportunities: As a Senior Manager for AI Models Software Development, you will be at the forefront of innovation, working with the latest Gen AI models and algorithms. You will have the opportunity to shape the future of AI model training and inference optimizations across a variety of applications. Talented Team: Join a team of highly skilled industry specialists who are passionate about pushing the boundaries of AI. Collaborate with like-minded professionals and learn from the best in the field. Cutting-edge Technology: Work with state-of-the-art AI algorithm and software technology, enabling you to stay ahead of the curve and drive advancements in AI model development. Impactful Work: Your contributions will directly impact the performance and efficiency of AI models, making a significant difference in various industries and applications. Key Responsibilities Lead a team focusing on research, design, and implement novel methods for efficient Generative AI. Propose and apply innovative techniques to support both training and inferencing including innovative transformer architectures, parallelism strategies to train on large clusters, low-precision training, model quantization, and acceleration for algorithms on various AMD platform, e.g., GPU/CPU/AIE. Collaborate with software and hardware team to E2E co-optimize performance on current and future AMD solutions. Work with open-source framework and community (e.g., PyTorch, JAX, Huggingface) to integrate AMD optimized models, libraries and publish training recipes. Publish and promote your work within AMD and at external venues. Preferred Experience Strong technical expertise in Gen AI model training and inference, and familiarity working with deep learning frameworks like PyTorch/JAX/vLLM/SGLang Strong technical expertise in algorithmic innovation towards efficient Gen AI application for both training and inferencing; Expertise/publications in one of the areas preferred - efficient model architectures, optimized training, innovative parallelism strategies, low-precision training, model quantization Additional plus if publications include conferences such as NeuRIPS, CVPR, ECCV/ICCV, ICML, ICLR, etc. Experience productizing any of the following also a plus: LLMs, Large Multi-modal models, 3D World Model or agentic workflows Strong leadership and management skills. Excellent written, verbal, and presentation skills, ability to coordinate internally and externally. Several years of experience in AI, deep learning and related software development, along with experience mentoring and managing high performing teams Academic Credentials: Master's degree or above; with major in CS, EE, Mathematics, or a related field. Location: San Jose, CA, or Bellevue, WA. Hybrid options are available. Benefits: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. The Role Be part of AMD's analog/mixed signal IP design team responsible for the design and development of next generation IOs, high speed memory (LPDDR5, DDR5, gDDR6, HBM2/HBM3, chip-to-chip, ) and chip-to-chip Gbps proprietary PHY IP solutions. Responsibilities include The Person The ideal candidate has experience leading others in technical settings. You also have excellent communication, writing, and presentation skills. Key Responsibilities Definition, review and sign off on IP top level and component level specifications AMS components circuit and layout design Supervise pre silicon layout, post silicon characterization and debug Support product bring up and debug, and sign off on test plans and characterization reports Interface with SOC teams, system HW/SW teams, and global manufacturing teams Preferred Experience Experience in high speed serial and/or parallel mixed signal PHY/IO designs Strong fundamentals and knowledge of mixed signal circuit architecture and design techniques for receiver/transmitter and PLL/DLL/clocking Hands on design experience in multi Gbps serial (PCIe, USB, ), parallel high BW memory interface PHY/IOs (DDR4/DDR5, HBM2/HBM3, gDDR5/gDDR6, ) and chip to chip links PHY IPs such as UCIe Experience in mixed signal design circuit blocks such as digital/analog DLLs, duty cycle corrector, clock and data recovery, clock mixer, Experience in low power design techniques for high speed/custom digital circuit (e.g. CMOS/CML high speed design for counters, dividers, ) design and analysis including transistor level timing sign off Solid understanding of power, area and performance trade offs in mixed signal IP design Design experience in FinFet advanced CMOS process nodes 7nm and below coupled with a solid understanding of transistor device performance and fundamentals Proficient in AMS design flows, tools, and methodologies. Familiar with Cadence schematic capture, virtuoso, Spectre and/or HSPICE circuit simulation tools Work with project manager, system architects, IC designers and physical designers to guarantee quality/timely deliverables meeting project's schedule and technical requirements Track record of successfully taking designs to production Excellent written and verbal communication skills able to operate without direct supervision but also work cross functionally, cross geographies collaborating and being part of a multi disciplinary team in a dynamic/fast paced environment Exhibit strong initiative and ownership of tasks and responsibilities. Seek help proactively as well as share and pass on knowledge Academic Credentials BS, MS or PhD in Electrical Engineering, Computer Engineering or related equivalent LOCATION: San Jose, California This role is not eligible for visa sponsorship. Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's "Responsible AI Policy" is available here. This posting is for an existing vacancy.
04/02/2026
Full time
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. The Role Be part of AMD's analog/mixed signal IP design team responsible for the design and development of next generation IOs, high speed memory (LPDDR5, DDR5, gDDR6, HBM2/HBM3, chip-to-chip, ) and chip-to-chip Gbps proprietary PHY IP solutions. Responsibilities include The Person The ideal candidate has experience leading others in technical settings. You also have excellent communication, writing, and presentation skills. Key Responsibilities Definition, review and sign off on IP top level and component level specifications AMS components circuit and layout design Supervise pre silicon layout, post silicon characterization and debug Support product bring up and debug, and sign off on test plans and characterization reports Interface with SOC teams, system HW/SW teams, and global manufacturing teams Preferred Experience Experience in high speed serial and/or parallel mixed signal PHY/IO designs Strong fundamentals and knowledge of mixed signal circuit architecture and design techniques for receiver/transmitter and PLL/DLL/clocking Hands on design experience in multi Gbps serial (PCIe, USB, ), parallel high BW memory interface PHY/IOs (DDR4/DDR5, HBM2/HBM3, gDDR5/gDDR6, ) and chip to chip links PHY IPs such as UCIe Experience in mixed signal design circuit blocks such as digital/analog DLLs, duty cycle corrector, clock and data recovery, clock mixer, Experience in low power design techniques for high speed/custom digital circuit (e.g. CMOS/CML high speed design for counters, dividers, ) design and analysis including transistor level timing sign off Solid understanding of power, area and performance trade offs in mixed signal IP design Design experience in FinFet advanced CMOS process nodes 7nm and below coupled with a solid understanding of transistor device performance and fundamentals Proficient in AMS design flows, tools, and methodologies. Familiar with Cadence schematic capture, virtuoso, Spectre and/or HSPICE circuit simulation tools Work with project manager, system architects, IC designers and physical designers to guarantee quality/timely deliverables meeting project's schedule and technical requirements Track record of successfully taking designs to production Excellent written and verbal communication skills able to operate without direct supervision but also work cross functionally, cross geographies collaborating and being part of a multi disciplinary team in a dynamic/fast paced environment Exhibit strong initiative and ownership of tasks and responsibilities. Seek help proactively as well as share and pass on knowledge Academic Credentials BS, MS or PhD in Electrical Engineering, Computer Engineering or related equivalent LOCATION: San Jose, California This role is not eligible for visa sponsorship. Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's "Responsible AI Policy" is available here. This posting is for an existing vacancy.
A leading technology company is seeking a Lead Forward Deployed Software Engineer in Santa Clara, California. You will work closely with strategic clients to deliver cutting-edge AI solutions using AMD technologies. The ideal candidate has strong programming skills in C/C++ and Python, and experience with GPU programming, machine learning frameworks, and client engagement. This role offers a unique opportunity to influence product roadmaps and drive innovation in AI solutions.
04/02/2026
Full time
A leading technology company is seeking a Lead Forward Deployed Software Engineer in Santa Clara, California. You will work closely with strategic clients to deliver cutting-edge AI solutions using AMD technologies. The ideal candidate has strong programming skills in C/C++ and Python, and experience with GPU programming, machine learning frameworks, and client engagement. This role offers a unique opportunity to influence product roadmaps and drive innovation in AI solutions.