Browse IT Jobs | IT Job Board

Glean San Francisco, California

About Glean About Glean Founded in 2019, Glean is an innovative AI-powered knowledge management platform designed to help organizations quickly find, organize, and share information across their teams. By integrating seamlessly with tools like Google Drive, Slack, and Microsoft Teams, Glean ensures employees can access the right knowledge at the right time, boosting productivity and collaboration. The company's cutting-edge AI technology simplifies knowledge discovery, making it faster and more efficient for teams to leverage their collective intelligence. Glean was born from Founder & CEO Arvind Jain's deep understanding of the challenges employees face in finding and understanding information at work. Seeing firsthand how fragmented knowledge and sprawling SaaS tools made it difficult to stay productive, he set out to build a better way - an AI-powered enterprise search platform that helps people quickly and intuitively access the information they need. Since then, Glean has evolved into the leading Work AI platform, combining enterprise-grade search, an AI assistant, and powerful application- and agent-building capabilities to fundamentally redefine how employees work. The Role The Agents Runtime team builds the low-latency, reliable, and secure foundation that powers Glean's AI agents and assistant experiences at scale. You'll design and operate core runtime services for multi-turn orchestration, tool calling, model routing, memory, streaming, and safety. You'll work across distributed systems, production observability, and ML infra integrations to deliver an experience that feels instant, accurate, and trustworthy - while optimizing cost and reliability. You Will Own impactful runtime problems end-to-end - from architecture and design to production launch and ongoing reliability. Build and evolve core services for session lifecycle, streaming responses (e.g., gRPC/WebSockets), structured tool execution, memory/state, and policy/guardrails. Design for performance, correctness, and cost: reduce p50/p95 latency, improve tail behavior, and optimize token/tool budgets. Integrate with leading LLM providers (e.g., OpenAI, Anthropic, Google Gemini) and internal evaluation frameworks to improve quality and predictability. Harden the platform with fault isolation, retries, timeouts, circuit-breaking, backpressure, and graceful degradation. Instrument deep observability (tracing, metrics, logs) and create playbooks/SLOs for high availability and on-call excellence. Collaborate closely with product, quality, and application teams to prioritize the most impactful roadmap investments. You Are 3+ years of software engineering experience building production distributed systems or cloud-native applications. BS/BA in Computer Science or related field, or equivalent practical experience. Strong coding skills in at least one of: Python, Go, Java, or C++, with a focus on reliability, performance, and tests. Product-minded: you prioritize customer impact, clear SLAs/SLOs, and pragmatic iteration. Ownership-driven with a positive, proactive attitude; comfortable leading projects or learning from battle-tested engineers. Experience operating services on Kubernetes and at least one major cloud (e.g., GCP, AWS, or Azure). Familiarity with event/streaming systems (e.g., Pub/Sub, Kafka), caching (e.g., Redis), and data stores for low-latency paths. Practical understanding of LLM/agents building blocks: tool/function calling, structured outputs, streaming, and model selection/routing. Strong observability and debugging skills: tracing (e.g., OpenTelemetry), metrics, dashboards, and production forensics. Background in one or more areas is a plus: policy/guardrails, multi-tenant isolation, rate-limiting, concurrency control, cost optimization. Location This role is hybrid (3-4 days a week in one of our SF Bay Area offices) Compensation & Benefits The standard base salary range for this position is $140,000 - $265,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits. We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused. We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race. We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race. Seniority level Mid-Senior level Employment type Full-time Job function Engineering and Information Technology Industries Software Development

04/05/2026

Full time

About Glean About Glean Founded in 2019, Glean is an innovative AI-powered knowledge management platform designed to help organizations quickly find, organize, and share information across their teams. By integrating seamlessly with tools like Google Drive, Slack, and Microsoft Teams, Glean ensures employees can access the right knowledge at the right time, boosting productivity and collaboration. The company's cutting-edge AI technology simplifies knowledge discovery, making it faster and more efficient for teams to leverage their collective intelligence. Glean was born from Founder & CEO Arvind Jain's deep understanding of the challenges employees face in finding and understanding information at work. Seeing firsthand how fragmented knowledge and sprawling SaaS tools made it difficult to stay productive, he set out to build a better way - an AI-powered enterprise search platform that helps people quickly and intuitively access the information they need. Since then, Glean has evolved into the leading Work AI platform, combining enterprise-grade search, an AI assistant, and powerful application- and agent-building capabilities to fundamentally redefine how employees work. The Role The Agents Runtime team builds the low-latency, reliable, and secure foundation that powers Glean's AI agents and assistant experiences at scale. You'll design and operate core runtime services for multi-turn orchestration, tool calling, model routing, memory, streaming, and safety. You'll work across distributed systems, production observability, and ML infra integrations to deliver an experience that feels instant, accurate, and trustworthy - while optimizing cost and reliability. You Will Own impactful runtime problems end-to-end - from architecture and design to production launch and ongoing reliability. Build and evolve core services for session lifecycle, streaming responses (e.g., gRPC/WebSockets), structured tool execution, memory/state, and policy/guardrails. Design for performance, correctness, and cost: reduce p50/p95 latency, improve tail behavior, and optimize token/tool budgets. Integrate with leading LLM providers (e.g., OpenAI, Anthropic, Google Gemini) and internal evaluation frameworks to improve quality and predictability. Harden the platform with fault isolation, retries, timeouts, circuit-breaking, backpressure, and graceful degradation. Instrument deep observability (tracing, metrics, logs) and create playbooks/SLOs for high availability and on-call excellence. Collaborate closely with product, quality, and application teams to prioritize the most impactful roadmap investments. You Are 3+ years of software engineering experience building production distributed systems or cloud-native applications. BS/BA in Computer Science or related field, or equivalent practical experience. Strong coding skills in at least one of: Python, Go, Java, or C++, with a focus on reliability, performance, and tests. Product-minded: you prioritize customer impact, clear SLAs/SLOs, and pragmatic iteration. Ownership-driven with a positive, proactive attitude; comfortable leading projects or learning from battle-tested engineers. Experience operating services on Kubernetes and at least one major cloud (e.g., GCP, AWS, or Azure). Familiarity with event/streaming systems (e.g., Pub/Sub, Kafka), caching (e.g., Redis), and data stores for low-latency paths. Practical understanding of LLM/agents building blocks: tool/function calling, structured outputs, streaming, and model selection/routing. Strong observability and debugging skills: tracing (e.g., OpenTelemetry), metrics, dashboards, and production forensics. Background in one or more areas is a plus: policy/guardrails, multi-tenant isolation, rate-limiting, concurrency control, cost optimization. Location This role is hybrid (3-4 days a week in one of our SF Bay Area offices) Compensation & Benefits The standard base salary range for this position is $140,000 - $265,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits. We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused. We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race. We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race. Seniority level Mid-Senior level Employment type Full-time Job function Engineering and Information Technology Industries Software Development

Staff Software Engineer, AI Agentic Experience (Auth0)

Okta San Francisco, California

Staff Software Engineer, AI Agentic Experience (Auth0) Get to know Okta Okta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth. The Auth0 Team Auth0 is an easy to implement authentication and authorization platform designed by developers for developers. We make access to applications safe, secure, and seamless for the more than 100 million daily logins around the world. Our modern approach to identity enables this Tier Ø global service to deliver convenience, privacy, and security so customers can focus on innovation. The Staff Software Engineer At Okta, we're building the next generation of authentication for the GenAI era. We're looking for a Staff Software Engineer to join the AI DevEx team at Auth0, extending and complementing our Auth for GenAI offering by building the infrastructure, tooling, and developer experiences that empower both human developers and AI agents to build secure, intelligent applications. What you will be doing Design and build developer tooling that helps developers secure and manage infrastructure like MCP servers. Build demo applications that showcase secure, identity powered AI use cases in real world environments. Contribute to open source projects, both within Auth0 and across the broader AI + identity ecosystem. Write and maintain high quality documentation including API references, quickstarts, and best practices for both developers and AI native tooling (e.g., llm.txt). Drive integration with emerging AI frameworks by creating adapters, utilities, and interfaces for agent runtimes and orchestration layers. Collaborate with design, product, and security teams to align on developer needs, roadmap direction, and compliance requirements. Mentor and support other engineers, setting strong examples in code quality, testing practices, and architectural thinking. Influence engineering standards by leading design discussions and contributing to team wide architectural decisions. Ensure resilience and security of systems involved in agent to agent or model to service communication. You might be a good fit if Experience in software engineering with a proven track record in building tools, frameworks, or platforms for other developers. Proficiency in JavaScript/TypeScript, Golang and/or Python, and the ability to move fluidly between front end and back end contexts. Experience working with LLM APIs, agent runtimes, orchestration layers, or prompt pipelines. Familiarity with authentication and authorization systems, especially standards like OAuth2, OIDC, and JWT. Demonstrated experience leading architecture and design efforts for scalable, production grade systems. Comfort contributing to and maintaining open source projects and engaging with developer communities. A passion for documentation as part of the developer experience-not just writing code, but making it understandable and usable. Ability to thrive in highly collaborative environments with cross functional stakeholders. Technologies you may work with Languages: JavaScript, TypeScript, Python Frameworks: React, Next.js, FastAPI AI Ecosystem: Model APIs, orchestration runtimes, prompt management systems, agent toolkits Auth0 Stack: Token Vault, Async Authorization, Fine Grained Authorization (FGA) Compensation Annual base salary range for candidates located in the San Francisco Bay Area: $188,000-$282,000 USD. Okta also offers equity (where applicable), bonus, and benefits, including health, dental, and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. Benefits Amazing benefits Making social impact Developing talent and fostering connection & community at Okta Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws. If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation. Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at

04/04/2026

Full time

Staff Software Engineer, AI Agentic Experience (Auth0) Get to know Okta Okta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth. The Auth0 Team Auth0 is an easy to implement authentication and authorization platform designed by developers for developers. We make access to applications safe, secure, and seamless for the more than 100 million daily logins around the world. Our modern approach to identity enables this Tier Ø global service to deliver convenience, privacy, and security so customers can focus on innovation. The Staff Software Engineer At Okta, we're building the next generation of authentication for the GenAI era. We're looking for a Staff Software Engineer to join the AI DevEx team at Auth0, extending and complementing our Auth for GenAI offering by building the infrastructure, tooling, and developer experiences that empower both human developers and AI agents to build secure, intelligent applications. What you will be doing Design and build developer tooling that helps developers secure and manage infrastructure like MCP servers. Build demo applications that showcase secure, identity powered AI use cases in real world environments. Contribute to open source projects, both within Auth0 and across the broader AI + identity ecosystem. Write and maintain high quality documentation including API references, quickstarts, and best practices for both developers and AI native tooling (e.g., llm.txt). Drive integration with emerging AI frameworks by creating adapters, utilities, and interfaces for agent runtimes and orchestration layers. Collaborate with design, product, and security teams to align on developer needs, roadmap direction, and compliance requirements. Mentor and support other engineers, setting strong examples in code quality, testing practices, and architectural thinking. Influence engineering standards by leading design discussions and contributing to team wide architectural decisions. Ensure resilience and security of systems involved in agent to agent or model to service communication. You might be a good fit if Experience in software engineering with a proven track record in building tools, frameworks, or platforms for other developers. Proficiency in JavaScript/TypeScript, Golang and/or Python, and the ability to move fluidly between front end and back end contexts. Experience working with LLM APIs, agent runtimes, orchestration layers, or prompt pipelines. Familiarity with authentication and authorization systems, especially standards like OAuth2, OIDC, and JWT. Demonstrated experience leading architecture and design efforts for scalable, production grade systems. Comfort contributing to and maintaining open source projects and engaging with developer communities. A passion for documentation as part of the developer experience-not just writing code, but making it understandable and usable. Ability to thrive in highly collaborative environments with cross functional stakeholders. Technologies you may work with Languages: JavaScript, TypeScript, Python Frameworks: React, Next.js, FastAPI AI Ecosystem: Model APIs, orchestration runtimes, prompt management systems, agent toolkits Auth0 Stack: Token Vault, Async Authorization, Fine Grained Authorization (FGA) Compensation Annual base salary range for candidates located in the San Francisco Bay Area: $188,000-$282,000 USD. Okta also offers equity (where applicable), bonus, and benefits, including health, dental, and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. Benefits Amazing benefits Making social impact Developing talent and fostering connection & community at Okta Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws. If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation. Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at

Software Engineer, Backend

Cleric San Francisco, California

Overview Join us at Cleric. We're building an autonomous AI agent that investigates and resolves production incidents. Our agent combines LLMs with tools to understand systems, reason through problems, and take corrective actions - even for issues it hasn't seen before. Cleric is already running in production at high-scale companies across fintech, ride-sharing, and autonomous vehicles. About The Role We're hiring a Software Engineer, Backend to help build and scale our autonomous agent. This role is ideal for engineers who want to build, ship, and work on production AI systems in real-world environments. You'll work closely with senior engineers on agent reasoning, integrations, and runtime systems, contributing directly to real customer-facing functionality. What You'll Do Contribute to autonomous agent capabilities and tool execution workflows Build and extend integrations with production systems Support development of agent evaluation and observability tooling Work with senior engineers to design, implement, and ship features Learn how to build reliable AI systems operating in production environments You have 1-2 years of professional software engineering experience Strong fundamentals in software engineering and system design Experience programming in Python (experience with other languages is a plus) Interest in AI, ML, and agentic systems A desire to learn quickly and work on real production problems Nice to have Exposure to AI/ML systems or LLM-based products Experience with distributed systems or cloud infrastructure Startup or fast-paced team experience How We Work Small teams, big impact Radical candor in a positive environment In-person collaboration AI-first in everything we do Interview process Intro Call & Technical Screen Software engineering session System design Bar raiser (product & engineering practices)

04/02/2026

Full time

Overview Join us at Cleric. We're building an autonomous AI agent that investigates and resolves production incidents. Our agent combines LLMs with tools to understand systems, reason through problems, and take corrective actions - even for issues it hasn't seen before. Cleric is already running in production at high-scale companies across fintech, ride-sharing, and autonomous vehicles. About The Role We're hiring a Software Engineer, Backend to help build and scale our autonomous agent. This role is ideal for engineers who want to build, ship, and work on production AI systems in real-world environments. You'll work closely with senior engineers on agent reasoning, integrations, and runtime systems, contributing directly to real customer-facing functionality. What You'll Do Contribute to autonomous agent capabilities and tool execution workflows Build and extend integrations with production systems Support development of agent evaluation and observability tooling Work with senior engineers to design, implement, and ship features Learn how to build reliable AI systems operating in production environments You have 1-2 years of professional software engineering experience Strong fundamentals in software engineering and system design Experience programming in Python (experience with other languages is a plus) Interest in AI, ML, and agentic systems A desire to learn quickly and work on real production problems Nice to have Exposure to AI/ML systems or LLM-based products Experience with distributed systems or cloud infrastructure Startup or fast-paced team experience How We Work Small teams, big impact Radical candor in a positive environment In-person collaboration AI-first in everything we do Interview process Intro Call & Technical Screen Software engineering session System design Bar raiser (product & engineering practices)

Founding Full-Stack Engineer

Floot (YC S25) San Francisco, California

Overview Founding Full-Stack Engineer role at Floot (YC S25). Base pay range is provided; your actual pay will be based on skills and experience - talk with your recruiter to learn more. Base pay range $120,000.00/yr - $250,000.00/yr Responsibilities As engineer , you will work directly with the founders to build the product and shape our culture. You'll wear many hats and touch everything from AI systems, user-facing features, to backend infrastructure. Talk to users often to deeply understand how to improve Floot's AI and UX (you can expect to spend 10-20% of your time on support) Ship full-stack features and build the foundational building blocks that users will use to build their apps seamlessly (e.g. storage, payments, auth, etc) Improve our AI runtime to handle edge cases and make our agentic system more reliable Prompt engineering and evals to improve AI output quality (for both function + design) Build robust infrastructure for our end-to-end platform to power millions of production apps Requirements Strong full-stack experience with React, TypeScript, and Node.js: you've shipped production features end-to-end Strong sense of ownership: you're autonomous and own features from start to finish Fast but thoughtful: you ship quickly without cutting corners on things that matter Strong sense for UX and product polish: you believe that craft matters Seniority level Entry level Employment type Full-time Job function Engineering and Information Technology Industries Software Development We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

04/02/2026

Full time

Overview Founding Full-Stack Engineer role at Floot (YC S25). Base pay range is provided; your actual pay will be based on skills and experience - talk with your recruiter to learn more. Base pay range $120,000.00/yr - $250,000.00/yr Responsibilities As engineer , you will work directly with the founders to build the product and shape our culture. You'll wear many hats and touch everything from AI systems, user-facing features, to backend infrastructure. Talk to users often to deeply understand how to improve Floot's AI and UX (you can expect to spend 10-20% of your time on support) Ship full-stack features and build the foundational building blocks that users will use to build their apps seamlessly (e.g. storage, payments, auth, etc) Improve our AI runtime to handle edge cases and make our agentic system more reliable Prompt engineering and evals to improve AI output quality (for both function + design) Build robust infrastructure for our end-to-end platform to power millions of production apps Requirements Strong full-stack experience with React, TypeScript, and Node.js: you've shipped production features end-to-end Strong sense of ownership: you're autonomous and own features from start to finish Fast but thoughtful: you ship quickly without cutting corners on things that matter Strong sense for UX and product polish: you believe that craft matters Seniority level Entry level Employment type Full-time Job function Engineering and Information Technology Industries Software Development We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Full Stack Engineer

Moss (YC F25) San Francisco, California

Direct message the job poster from Moss (YC F25) We're on a mission to make conversational AI agents think and respond at the speed of human conversation, everywhere they run. Today's retrieval stack wasn't built for real-time reasoning, so we're building Moss: the semantic search layer for the AI-agent era. Backed by YC, we're making every AI product feel instantaneous, deeply personalized, and actually useful in the real world. We're looking for a senior Full-Stack Engineer (6+ years experience) who thrives on solving complex problems and will not wait for someone else to tell what to do. You'll work directly with the founding team to architect core systems, build complex infrastructure, and shape the future of agentic experiences. Must-Have Requirements 6+ years building and shipping production software in a product-focused environment. Strong expertise in TypeScript and modern front-end frameworks (React / Next.js). Experience designing and owning backend services (Node.js, Rust, Go, or similar). Strong fundamentals in performance optimization, caching, and profiling. Experience with cloud infrastructure (AWS, Azure, Cloudflare, or GCP) and observability. Ability to move fast in ambiguous environments - high ownership and bias for action. Nice-to-Have Skills Familiarity with vector search, embedding pipelines, or real-time retrieval systems. Experience building developer tools or production SDKs. Knowledge of API versioning, usage metering, billing, and multi-tenant auth. Contributions to open-source infra / web performance libraries. Key Responsibilities Architect and build core services that power Moss at scale. Own end-to-end product surfaces: APIs, dashboards, developer experience, and tooling. Optimize Moss for latency, resource efficiency, and reliability across runtime environments. Work directly with customers and design partners to shape product direction. Contribute to the Moss engineering culture with strong code quality, fast iteration, rapid learning. Equity: Meaningful stock options. Location: San Francisco (preferred), flexibility for exceptional talent. Work: In person collaboration as a small, high-trust, founder-led team. Seniority level Mid-Senior level Employment type Full-time Referrals increase your chances of interviewing at Moss (YC F25) by 2x

04/02/2026

Full time

Direct message the job poster from Moss (YC F25) We're on a mission to make conversational AI agents think and respond at the speed of human conversation, everywhere they run. Today's retrieval stack wasn't built for real-time reasoning, so we're building Moss: the semantic search layer for the AI-agent era. Backed by YC, we're making every AI product feel instantaneous, deeply personalized, and actually useful in the real world. We're looking for a senior Full-Stack Engineer (6+ years experience) who thrives on solving complex problems and will not wait for someone else to tell what to do. You'll work directly with the founding team to architect core systems, build complex infrastructure, and shape the future of agentic experiences. Must-Have Requirements 6+ years building and shipping production software in a product-focused environment. Strong expertise in TypeScript and modern front-end frameworks (React / Next.js). Experience designing and owning backend services (Node.js, Rust, Go, or similar). Strong fundamentals in performance optimization, caching, and profiling. Experience with cloud infrastructure (AWS, Azure, Cloudflare, or GCP) and observability. Ability to move fast in ambiguous environments - high ownership and bias for action. Nice-to-Have Skills Familiarity with vector search, embedding pipelines, or real-time retrieval systems. Experience building developer tools or production SDKs. Knowledge of API versioning, usage metering, billing, and multi-tenant auth. Contributions to open-source infra / web performance libraries. Key Responsibilities Architect and build core services that power Moss at scale. Own end-to-end product surfaces: APIs, dashboards, developer experience, and tooling. Optimize Moss for latency, resource efficiency, and reliability across runtime environments. Work directly with customers and design partners to shape product direction. Contribute to the Moss engineering culture with strong code quality, fast iteration, rapid learning. Equity: Meaningful stock options. Location: San Francisco (preferred), flexibility for exceptional talent. Work: In person collaboration as a small, high-trust, founder-led team. Seniority level Mid-Senior level Employment type Full-time Referrals increase your chances of interviewing at Moss (YC F25) by 2x

Sr Machine Learning Manager, Intelligent System Experience

Apple Inc. San Francisco, California

San Francisco Bay Area, California, United States Machine Learning and AI The Apple Intelligence Platform team builds the foundational on-device software infrastructure that powers Apple Intelligence. We develop the APIs, platforms, and systems that enable breakthrough features like Writing Tools, Siri, Visual Intelligence, and Image Playground. Our work spans the full software stack - from low level inference engines and runtime optimization to high level platform APIs like the Foundation Models API. Our mission is to build production ready software platforms that deliver magical experiences to millions of users. We work closely with research teams to understand new ML techniques, then focus on the engineering challenges of building robust, scalable systems to ship them. Whether it's designing APIs for agentic workflows, optimizing inference pipelines, implementing efficient caching strategies for attention mechanisms, or creating infrastructure for search and retrieval, we're focused on the platform engineering and software architectures that make Apple Intelligence possible. We value strong software engineering fundamentals combined with deep understanding of ML systems and techniques. Our team members know how to build production platforms that are reliable, performant, and maintainable at scale, while also understanding the ML primitives - transformers, attention, KV caches, kernels - that power these systems. We're looking for a leader who can drive platform development, build strong engineering teams, and deliver the software infrastructure that powers Apple Intelligence features used by millions every day. Description As a Senior Machine Learning Manager on the Apple Intelligence Platform team, you will lead a team of engineers building critical platform infrastructure and APIs that power features used by millions of Apple customers daily. You'll be responsible for the technical execution and delivery of software systems that span the platform stack - from runtime engines and inference pipelines to developer facing APIs and service integrations. This role requires someone who deeply understands both ML fundamentals and platform engineering, able to make informed architectural decisions about how to build software systems that efficiently support modern ML techniques. You will work cross functionally with researchers, product teams, and platform engineers to build the production software systems needed to ship new capabilities. Success in this role means delivering robust, well architected platforms that leverage your understanding of ML to enable world class Apple Intelligence experiences. Responsibilities Lead and grow a team of platform engineers building ML infrastructure, APIs, and services for Apple Intelligence Build production software platforms to support new ML techniques, translating research innovations into shippable, scalable systems Ensure platforms meet Apple's standards for performance, reliability, scalability, privacy, and user experience Guide development of APIs and frameworks for agentic workflows and complex multi step ML systems Establish engineering excellence through software architecture standards, code quality practices, testing, and operational rigor Mentor engineers on platform engineering, ML systems implementation, API design, and production software best practices Partner with product and engineering leaders to align platform capabilities with feature requirements and business goals Minimum Qualifications 8+ years of experience in ML platform engineering, ML infrastructure, or related fields, with 3+ years in technical leadership or management roles Deep understanding of ML fundamentals including neural network architectures, transformers, attention mechanisms, and inference optimization Strong software engineering fundamentals with expertise in systems design, API architecture, and distributed systems Proven experience building and shipping production ML APIs, platforms, or infrastructure at scale Strong knowledge of ML software stacks and modeling primitives: KV caching, kernel methods, attention architectures, and efficient inference techniques Hands on experience with ML frameworks (PyTorch, TensorFlow, JAX), serving systems, and production ML deployment Understanding of modern agentic workflows, multi step reasoning systems, and API design for complex ML applications Track record of leading engineering teams and delivering complex ML platform projects from conception to production Strong architectural skills with experience making technical decisions for large scale, high performance ML systems Excellent communication and collaboration skills with ability to work across research, product, and engineering teams Preferred Qualifications BS, MS, or PhD in Computer Science, Machine Learning, or related field (or equivalent industry experience) At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $262,500 and $433,400, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits. Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program. Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant. Apple accepts applications to this posting on an ongoing basis.

04/02/2026

Full time

San Francisco Bay Area, California, United States Machine Learning and AI The Apple Intelligence Platform team builds the foundational on-device software infrastructure that powers Apple Intelligence. We develop the APIs, platforms, and systems that enable breakthrough features like Writing Tools, Siri, Visual Intelligence, and Image Playground. Our work spans the full software stack - from low level inference engines and runtime optimization to high level platform APIs like the Foundation Models API. Our mission is to build production ready software platforms that deliver magical experiences to millions of users. We work closely with research teams to understand new ML techniques, then focus on the engineering challenges of building robust, scalable systems to ship them. Whether it's designing APIs for agentic workflows, optimizing inference pipelines, implementing efficient caching strategies for attention mechanisms, or creating infrastructure for search and retrieval, we're focused on the platform engineering and software architectures that make Apple Intelligence possible. We value strong software engineering fundamentals combined with deep understanding of ML systems and techniques. Our team members know how to build production platforms that are reliable, performant, and maintainable at scale, while also understanding the ML primitives - transformers, attention, KV caches, kernels - that power these systems. We're looking for a leader who can drive platform development, build strong engineering teams, and deliver the software infrastructure that powers Apple Intelligence features used by millions every day. Description As a Senior Machine Learning Manager on the Apple Intelligence Platform team, you will lead a team of engineers building critical platform infrastructure and APIs that power features used by millions of Apple customers daily. You'll be responsible for the technical execution and delivery of software systems that span the platform stack - from runtime engines and inference pipelines to developer facing APIs and service integrations. This role requires someone who deeply understands both ML fundamentals and platform engineering, able to make informed architectural decisions about how to build software systems that efficiently support modern ML techniques. You will work cross functionally with researchers, product teams, and platform engineers to build the production software systems needed to ship new capabilities. Success in this role means delivering robust, well architected platforms that leverage your understanding of ML to enable world class Apple Intelligence experiences. Responsibilities Lead and grow a team of platform engineers building ML infrastructure, APIs, and services for Apple Intelligence Build production software platforms to support new ML techniques, translating research innovations into shippable, scalable systems Ensure platforms meet Apple's standards for performance, reliability, scalability, privacy, and user experience Guide development of APIs and frameworks for agentic workflows and complex multi step ML systems Establish engineering excellence through software architecture standards, code quality practices, testing, and operational rigor Mentor engineers on platform engineering, ML systems implementation, API design, and production software best practices Partner with product and engineering leaders to align platform capabilities with feature requirements and business goals Minimum Qualifications 8+ years of experience in ML platform engineering, ML infrastructure, or related fields, with 3+ years in technical leadership or management roles Deep understanding of ML fundamentals including neural network architectures, transformers, attention mechanisms, and inference optimization Strong software engineering fundamentals with expertise in systems design, API architecture, and distributed systems Proven experience building and shipping production ML APIs, platforms, or infrastructure at scale Strong knowledge of ML software stacks and modeling primitives: KV caching, kernel methods, attention architectures, and efficient inference techniques Hands on experience with ML frameworks (PyTorch, TensorFlow, JAX), serving systems, and production ML deployment Understanding of modern agentic workflows, multi step reasoning systems, and API design for complex ML applications Track record of leading engineering teams and delivering complex ML platform projects from conception to production Strong architectural skills with experience making technical decisions for large scale, high performance ML systems Excellent communication and collaboration skills with ability to work across research, product, and engineering teams Preferred Qualifications BS, MS, or PhD in Computer Science, Machine Learning, or related field (or equivalent industry experience) At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $262,500 and $433,400, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits. Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program. Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant. Apple accepts applications to this posting on an ongoing basis.

6 jobs found

Modal Window