Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: The Principal Software Engineer for the Model LifeCycle team will play a crucial role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs). What You'll Be Working On: Manage fine-tuning systems for large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling. Implement and maintain end-to-end training pipelines for Large Language Models. RFT and Reinforcement learning to the fine tuning and training sections Distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling). Dataset, model, and experiment management: versioning, lineage, evaluation, and reproducible fine-tuning at scale. What You'll Bring to the Team: Advanced degree in Computer Science, Engineering, or a related field. 8+ years of industry experience leading and driving impactful projects in the AI Space Experience in Generative AI (Large Language Models, Multimodal). Hands-on experience training, fine-tuning, and aligning LLMs using Reinforcement Learning and Reinforcement Fine-Tuning (RFT) techniques. Proactive and collaborative approach with the ability to work autonomously. Passion for building cutting-edge AI products and solving challenging technical problems. Bonus Points: Proficiency in Golang or Python for large-scale, production-level services and PyTorch. Contributions to open-source AI projects such as vLLM or similar frameworks. Performance optimizations on GPU systems and inference frameworks. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $208,725 - $279,565 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: The Principal Software Engineer for the Model LifeCycle team will play a crucial role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs). What You'll Be Working On: Manage fine-tuning systems for large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling. Implement and maintain end-to-end training pipelines for Large Language Models. RFT and Reinforcement learning to the fine tuning and training sections Distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling). Dataset, model, and experiment management: versioning, lineage, evaluation, and reproducible fine-tuning at scale. What You'll Bring to the Team: Advanced degree in Computer Science, Engineering, or a related field. 8+ years of industry experience leading and driving impactful projects in the AI Space Experience in Generative AI (Large Language Models, Multimodal). Hands-on experience training, fine-tuning, and aligning LLMs using Reinforcement Learning and Reinforcement Fine-Tuning (RFT) techniques. Proactive and collaborative approach with the ability to work autonomously. Passion for building cutting-edge AI products and solving challenging technical problems. Bonus Points: Proficiency in Golang or Python for large-scale, production-level services and PyTorch. Contributions to open-source AI projects such as vLLM or similar frameworks. Performance optimizations on GPU systems and inference frameworks. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $208,725 - $279,565 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: The Senior Staff Software Engineer for the Model LifeCycle team will play a crucial role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs). What You'll Be Working On: Manage fine-tuning systems for large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling. Implement and maintain end-to-end training pipelines for Large Language Models. RFT and Reinforcement learning to the fine tuning and training sections Distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling). Dataset, model, and experiment management: versioning, lineage, evaluation, and reproducible fine-tuning at scale. What You'll Bring to the Team: Advanced degree in Computer Science, Engineering, or a related field. 8+ years of industry experience leading and driving impactful projects in the AI Space Experience in Generative AI (Large Language Models, Multimodal). Hands-on experience training, fine-tuning, and aligning LLMs using Reinforcement Learning and Reinforcement Fine-Tuning (RFT) techniques. Proactive and collaborative approach with the ability to work autonomously Passion for building cutting-edge AI products and solving challenging technical problems. Bonus Points: Proficiency in Golang or Python for large-scale, production-level services and PyTorch Contributions to open-source AI projects such as vLLM or similar frameworks. Performance optimizations on GPU systems and inference frameworks. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $237,600 - $318,240 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: The Senior Staff Software Engineer for the Model LifeCycle team will play a crucial role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs). What You'll Be Working On: Manage fine-tuning systems for large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling. Implement and maintain end-to-end training pipelines for Large Language Models. RFT and Reinforcement learning to the fine tuning and training sections Distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling). Dataset, model, and experiment management: versioning, lineage, evaluation, and reproducible fine-tuning at scale. What You'll Bring to the Team: Advanced degree in Computer Science, Engineering, or a related field. 8+ years of industry experience leading and driving impactful projects in the AI Space Experience in Generative AI (Large Language Models, Multimodal). Hands-on experience training, fine-tuning, and aligning LLMs using Reinforcement Learning and Reinforcement Fine-Tuning (RFT) techniques. Proactive and collaborative approach with the ability to work autonomously Passion for building cutting-edge AI products and solving challenging technical problems. Bonus Points: Proficiency in Golang or Python for large-scale, production-level services and PyTorch Contributions to open-source AI projects such as vLLM or similar frameworks. Performance optimizations on GPU systems and inference frameworks. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $237,600 - $318,240 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role We are seeking a Staff Software Engineer to architect, design, and develop the intelligence layer that controls how every GPU node in Crusoe's fleet gets assigned, monetized, and managed. You would be one of the first engineers on both the Virtual Pool Service and Capacity Management Intelligence systems, shaping implementation, making real design decisions, and building the foundational infrastructure that Crusoe's entire cloud platform depends on. You'll play a crucial role in delivering end-to-end use cases and workflows for a vertically integrated, AI-first cloud while driving key business revenue metrics at scale. What You'll Be Working On Building the Virtual Pool Service (VP Service), a physical infrastructure classification layer that serves as the single source of truth for every GPU node's state, pool membership, and transition history across Crusoe's fleet Designing and implementing Capacity Management Intelligence (CMI), the automation layer that handles priority-descending allocation, forward availability forecasting, and automated node lifecycle transitions - replacing manual spreadsheet workflows with enforced, auditable, event-driven automation Collaborating extensively across teams to architect and implement physical infrastructure management systems, availability platforms, and frameworks that meet end-to-end customer use cases Championing reliability, scalability, and security of our systems, designing high-performing, highly available cloud architectures optimized for both performance and cost-effectiveness Streamlining cloud deployment, configuration management, and operations using Go, gRPC, NATS event streaming, PostgreSQL (CNPG on Kubernetes), and Netbox as the physical source of truth Mentoring fellow engineers and actively contributing to team growth in collaboration with engineering managers What You'll Bring to the Team A Bachelor's degree in Computer Science or Software Engineering 10+ years of relevant experience building and operating distributed systems at scale Proven experience building reliable, scalable, and secure cloud platforms and running them in production Strong distributed systems thinking with the ability to reason about consistency, failure modes, event ordering, and correctness invariants Fluency in Go, Rust, Java, or C++; Go is our primary language, but strong engineers from other backgrounds ramp quickly A collaborative, platform-minded approach to building robust systems and driving adoption across dev and ops teams Ownership mentality with comfort owning a system end to end: design, implementation, testing, ops, and iteration Good judgment under ambiguity, with the ability to drive open-ended technical decisions to resolution Excellent communication and troubleshooting skills across cross-functional teams Bonus Points Hands-on experience deploying, managing, and troubleshooting Kubernetes clusters Prior experience with event-driven architectures or message streaming systems (NATS, Kafka, Kinesis) Experience with capacity planning, resource scheduling, or fleet management systems Background in GPU compute, AI/ML platform infrastructure, or fast-paced startup environments A passion for sustainability, clean energy, and building AI infrastructure that scales responsibly Benefits Industry competitive pay Restricted Stock Units in a fast growing, well-funded technology company Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents Employer contributions to HSA accounts Paid Parental Leave Paid life insurance, short-term and long-term disability Teladoc 401(k) with a 100% match up to 4% of salary Generous paid time off and holiday schedule Cell phone reimbursement Tuition reimbursement Subscription to the Calm app MetLife Legal Company paid commuter benefit; $300 per month Compensation Compensation will be paid in the range of $209,000 - $253,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role We are seeking a Staff Software Engineer to architect, design, and develop the intelligence layer that controls how every GPU node in Crusoe's fleet gets assigned, monetized, and managed. You would be one of the first engineers on both the Virtual Pool Service and Capacity Management Intelligence systems, shaping implementation, making real design decisions, and building the foundational infrastructure that Crusoe's entire cloud platform depends on. You'll play a crucial role in delivering end-to-end use cases and workflows for a vertically integrated, AI-first cloud while driving key business revenue metrics at scale. What You'll Be Working On Building the Virtual Pool Service (VP Service), a physical infrastructure classification layer that serves as the single source of truth for every GPU node's state, pool membership, and transition history across Crusoe's fleet Designing and implementing Capacity Management Intelligence (CMI), the automation layer that handles priority-descending allocation, forward availability forecasting, and automated node lifecycle transitions - replacing manual spreadsheet workflows with enforced, auditable, event-driven automation Collaborating extensively across teams to architect and implement physical infrastructure management systems, availability platforms, and frameworks that meet end-to-end customer use cases Championing reliability, scalability, and security of our systems, designing high-performing, highly available cloud architectures optimized for both performance and cost-effectiveness Streamlining cloud deployment, configuration management, and operations using Go, gRPC, NATS event streaming, PostgreSQL (CNPG on Kubernetes), and Netbox as the physical source of truth Mentoring fellow engineers and actively contributing to team growth in collaboration with engineering managers What You'll Bring to the Team A Bachelor's degree in Computer Science or Software Engineering 10+ years of relevant experience building and operating distributed systems at scale Proven experience building reliable, scalable, and secure cloud platforms and running them in production Strong distributed systems thinking with the ability to reason about consistency, failure modes, event ordering, and correctness invariants Fluency in Go, Rust, Java, or C++; Go is our primary language, but strong engineers from other backgrounds ramp quickly A collaborative, platform-minded approach to building robust systems and driving adoption across dev and ops teams Ownership mentality with comfort owning a system end to end: design, implementation, testing, ops, and iteration Good judgment under ambiguity, with the ability to drive open-ended technical decisions to resolution Excellent communication and troubleshooting skills across cross-functional teams Bonus Points Hands-on experience deploying, managing, and troubleshooting Kubernetes clusters Prior experience with event-driven architectures or message streaming systems (NATS, Kafka, Kinesis) Experience with capacity planning, resource scheduling, or fleet management systems Background in GPU compute, AI/ML platform infrastructure, or fast-paced startup environments A passion for sustainability, clean energy, and building AI infrastructure that scales responsibly Benefits Industry competitive pay Restricted Stock Units in a fast growing, well-funded technology company Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents Employer contributions to HSA accounts Paid Parental Leave Paid life insurance, short-term and long-term disability Teladoc 401(k) with a 100% match up to 4% of salary Generous paid time off and holiday schedule Cell phone reimbursement Tuition reimbursement Subscription to the Calm app MetLife Legal Company paid commuter benefit; $300 per month Compensation Compensation will be paid in the range of $209,000 - $253,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About the Role: Crusoe Cloud seeks a highly skilled and experienced Staff Software Engineer to lead the development and execution of our cutting-edge Software Defined Networking strategy. You will play a pivotal role in driving innovation and performance improvements within our network infrastructure by leveraging advanced technologies such as XDP/EBPF, DPDK, SmartNICs, and DPUs/IPUs. What You'll Be Working On: Develop and execute the roadmap for Crusoe Cloud's Software Defined Networking strategy. Lead the engineering team through all phases of the software development lifecycle, including architecture decisions, design processes, design reviews, code reviews, and implementation tasks. Collaborate closely with the network infrastructure organization to develop and deploy industry-leading networking solutions. Lead the design, development, and support of Linux Kernel and driver components, focusing on system architecture and optimization. Drive the adoption and integration of kernel bypass technologies such as XDP/EBPF, AF_XDP, and DPDK. Deeply understand and leverage network accelerators such as Mellanox/Nvidia SmartNICs (ConnectX6/7), DPU Bluefield3, and Intel IPU. Collaborate with cross-functional teams across the organization to ensure successful project delivery and operational excellence. What You'll Bring to the Team: 6+ years of proven experience in building and operating high-performance networking systems in a production environment. Strong proficiency in system programming languages such as C, C++, and/or Rust. Deep expertise in Linux Systems Internals, including kernel architecture, memory management, and device drivers. In-depth knowledge of network programming principles and packet processing pipelines. Hands-on experience with kernel bypass technologies like XDP/EBPF, AF_XDP, and DPDK. Proven understanding of TCP/IP and networking accelerators such as Mellanox/Nvidia SmartNICs, DPU Bluefield3, and Intel IPU. Familiarity with technologies like SR-IOV, vDPA, and scalable functions. Strong background in kernel or embedded development, with a focus on the Linux kernel. Experience with Open vSwitch, Openflow, and Open Virtual Networking technologies. Proven ability to effectively communicate and collaborate with both technical and non-technical stakeholders. Demonstrated commitment to professional software engineering best practices, including coding standards, code reviews, source control management, testing, and operations. A strong track record of contributions to the open-source community (e.g., Open vSwitch/OVS, Open Virtual Networking/OVN, Multus, Cilium). Bonus Points Advanced degree in Computer Science, Engineering, or a related field. Proven leadership experience in a technical role. Strong analytical and problem-solving skills. Experience with cloud networking platforms (AWS, Azure, GCP) and virtualization technologies (VMware, KVM). Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $185,000 - $224,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About the Role: Crusoe Cloud seeks a highly skilled and experienced Staff Software Engineer to lead the development and execution of our cutting-edge Software Defined Networking strategy. You will play a pivotal role in driving innovation and performance improvements within our network infrastructure by leveraging advanced technologies such as XDP/EBPF, DPDK, SmartNICs, and DPUs/IPUs. What You'll Be Working On: Develop and execute the roadmap for Crusoe Cloud's Software Defined Networking strategy. Lead the engineering team through all phases of the software development lifecycle, including architecture decisions, design processes, design reviews, code reviews, and implementation tasks. Collaborate closely with the network infrastructure organization to develop and deploy industry-leading networking solutions. Lead the design, development, and support of Linux Kernel and driver components, focusing on system architecture and optimization. Drive the adoption and integration of kernel bypass technologies such as XDP/EBPF, AF_XDP, and DPDK. Deeply understand and leverage network accelerators such as Mellanox/Nvidia SmartNICs (ConnectX6/7), DPU Bluefield3, and Intel IPU. Collaborate with cross-functional teams across the organization to ensure successful project delivery and operational excellence. What You'll Bring to the Team: 6+ years of proven experience in building and operating high-performance networking systems in a production environment. Strong proficiency in system programming languages such as C, C++, and/or Rust. Deep expertise in Linux Systems Internals, including kernel architecture, memory management, and device drivers. In-depth knowledge of network programming principles and packet processing pipelines. Hands-on experience with kernel bypass technologies like XDP/EBPF, AF_XDP, and DPDK. Proven understanding of TCP/IP and networking accelerators such as Mellanox/Nvidia SmartNICs, DPU Bluefield3, and Intel IPU. Familiarity with technologies like SR-IOV, vDPA, and scalable functions. Strong background in kernel or embedded development, with a focus on the Linux kernel. Experience with Open vSwitch, Openflow, and Open Virtual Networking technologies. Proven ability to effectively communicate and collaborate with both technical and non-technical stakeholders. Demonstrated commitment to professional software engineering best practices, including coding standards, code reviews, source control management, testing, and operations. A strong track record of contributions to the open-source community (e.g., Open vSwitch/OVS, Open Virtual Networking/OVN, Multus, Cilium). Bonus Points Advanced degree in Computer Science, Engineering, or a related field. Proven leadership experience in a technical role. Strong analytical and problem-solving skills. Experience with cloud networking platforms (AWS, Azure, GCP) and virtualization technologies (VMware, KVM). Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $185,000 - $224,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About the Role: We are actively seeking an exceptional Senior Software Engineer for our cloud software team who will contribute to the operations of our cutting-edge infrastructure. Your expertise will be instrumental in designing and scaling our carbon-reducing operating model, as well as managing critical hardware, software, and network components. In this role, you will be involved in writing and reviewing code, contributing to proposals and architecture documents. You will evaluate tools and frameworks, carefully considering their impact on reliability, scalability, operational costs, and ease of adoption. Your expertise in orchestration and optimization will be instrumental in advancing our managed Kubernetes and AI training clusters, ensuring they lead the industry in reliability and performance. What You'll Be Working On: Contribute to the development of scalable and robust software solutions, closely aligning with the strategic objectives outlined in the Crusoe Cloud roadmap Work collaboratively with tech leads and engineers to create a dynamic environment where creativity and technical excellence are encouraged, leading to the development of cutting-edge cloud solutions Continuously stay abreast of the latest trends and techniques in cloud software, incorporating these insights to keep Crusoe's offerings innovative While you won't have formal management responsibilities, you will support the development of your peers by sharing knowledge and providing guidance in technical discussions What You'll Bring to the Team: You have 5-7 years of experience working in software engineering, with strong experience in Systems Engineering You possess 2+ years of programming experience in GoLang You have experience with Kubernetes and Linux Engineering and debugging You are skilled in infrastructure as code and familiar with systems-level challenges You have experience with Terraform and GCP (preferred) You understand Argo, CI/CD, and Automated Testing pipelines You can build and manage Kubernetes operators and controllers, developing and maintaining essential components that ensure the reliability and efficiency of the Kubernetes environment You can develop scalable systems to compete with leading services like Google Kubernetes Engine (GKE) and Amazon Elastic Kubernetes Service (EKS) You can oversee critical projects with broad impact, leading initiatives focused on networking, quality control, and automation to ensure optimal performance and reliability You can design system architecture, taking ownership of system architecture, including CI/CD pipelines, while ensuring adherence to security standards You have excellent communication skills, both verbal and written Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range: Compensation will be paid in the range of $180,000 - $210,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About the Role: We are actively seeking an exceptional Senior Software Engineer for our cloud software team who will contribute to the operations of our cutting-edge infrastructure. Your expertise will be instrumental in designing and scaling our carbon-reducing operating model, as well as managing critical hardware, software, and network components. In this role, you will be involved in writing and reviewing code, contributing to proposals and architecture documents. You will evaluate tools and frameworks, carefully considering their impact on reliability, scalability, operational costs, and ease of adoption. Your expertise in orchestration and optimization will be instrumental in advancing our managed Kubernetes and AI training clusters, ensuring they lead the industry in reliability and performance. What You'll Be Working On: Contribute to the development of scalable and robust software solutions, closely aligning with the strategic objectives outlined in the Crusoe Cloud roadmap Work collaboratively with tech leads and engineers to create a dynamic environment where creativity and technical excellence are encouraged, leading to the development of cutting-edge cloud solutions Continuously stay abreast of the latest trends and techniques in cloud software, incorporating these insights to keep Crusoe's offerings innovative While you won't have formal management responsibilities, you will support the development of your peers by sharing knowledge and providing guidance in technical discussions What You'll Bring to the Team: You have 5-7 years of experience working in software engineering, with strong experience in Systems Engineering You possess 2+ years of programming experience in GoLang You have experience with Kubernetes and Linux Engineering and debugging You are skilled in infrastructure as code and familiar with systems-level challenges You have experience with Terraform and GCP (preferred) You understand Argo, CI/CD, and Automated Testing pipelines You can build and manage Kubernetes operators and controllers, developing and maintaining essential components that ensure the reliability and efficiency of the Kubernetes environment You can develop scalable systems to compete with leading services like Google Kubernetes Engine (GKE) and Amazon Elastic Kubernetes Service (EKS) You can oversee critical projects with broad impact, leading initiatives focused on networking, quality control, and automation to ensure optimal performance and reliability You can design system architecture, taking ownership of system architecture, including CI/CD pipelines, while ensuring adherence to security standards You have excellent communication skills, both verbal and written Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range: Compensation will be paid in the range of $180,000 - $210,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: The Principal Software Engineer for the Model LifeCycle team will play a crucial role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs). What You'll Be Working On: Manage fine-tuning systems for large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling. Implement and maintain end-to-end training pipelines for Large Language Models. RFT and Reinforcement learning to the fine tuning and training sections Distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling). Dataset, model, and experiment management: versioning, lineage, evaluation, and reproducible fine-tuning at scale. What You'll Bring to the Team: Advanced degree in Computer Science, Engineering, or a related field. 8+ years of industry experience leading and driving impactful projects in the AI Space Experience in Generative AI (Large Language Models, Multimodal). Hands-on experience training, fine-tuning, and aligning LLMs using Reinforcement Learning and Reinforcement Fine-Tuning (RFT) techniques. Proactive and collaborative approach with the ability to work autonomously. Passion for building cutting-edge AI products and solving challenging technical problems. Bonus Points: Proficiency in Golang or Python for large-scale, production-level services and PyTorch. Contributions to open-source AI projects such as vLLM or similar frameworks. Performance optimizations on GPU systems and inference frameworks. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $208,725 - $279,565 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: The Principal Software Engineer for the Model LifeCycle team will play a crucial role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs). What You'll Be Working On: Manage fine-tuning systems for large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling. Implement and maintain end-to-end training pipelines for Large Language Models. RFT and Reinforcement learning to the fine tuning and training sections Distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling). Dataset, model, and experiment management: versioning, lineage, evaluation, and reproducible fine-tuning at scale. What You'll Bring to the Team: Advanced degree in Computer Science, Engineering, or a related field. 8+ years of industry experience leading and driving impactful projects in the AI Space Experience in Generative AI (Large Language Models, Multimodal). Hands-on experience training, fine-tuning, and aligning LLMs using Reinforcement Learning and Reinforcement Fine-Tuning (RFT) techniques. Proactive and collaborative approach with the ability to work autonomously. Passion for building cutting-edge AI products and solving challenging technical problems. Bonus Points: Proficiency in Golang or Python for large-scale, production-level services and PyTorch. Contributions to open-source AI projects such as vLLM or similar frameworks. Performance optimizations on GPU systems and inference frameworks. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $208,725 - $279,565 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: Crusoe Cloud is revolutionizing high-performance computing by offering sustainable, low-cost GPU compute power. As a Senior Cloud Support Engineer, you'll play a crucial role in empowering our customers to leverage this technology for groundbreaking advancements in fields like AI/ML, physics simulations, and computational biology. You will be the primary point of contact for technical support, ensuring our customers can seamlessly utilize Crusoe Cloud to achieve their goals. This role directly impacts Crusoe's mission by enabling our customers to accelerate their research and development, contributing to a more sustainable future. You will be involved in exciting projects, working with cutting-edge technologies and collaborating with a talented team to solve complex challenges. The ideal candidate is a highly motivated and experienced technical professional with a passion for customer success, a deep understanding of cloud technologies, and a commitment to Crusoe's values. This is a full-time position. What You'll Be Working On: Customer Support: Provide exceptional technical support to customers via Zendesk, meeting SLAs and maintaining high CSAT (95%+). On-Call Rotation: Participate in a 24/7 on-call rotation to ensure timely resolution of critical issues. Troubleshooting: Diagnose and resolve issues related to VMs, hardware failures, and scaling tests using CLI and internal tools. Alert Triage and Maintenance: Manage alert triage, prepare for maintenance windows, and conduct node delivery testing. Collaboration: Work closely with SRE, Networking, and Storage teams from initial triage to root cause analysis (RCA) delivery. Global Teamwork: Adhere to global team collaboration and handoff processes for ticketing and on-call procedures. Knowledge Sharing: Develop onboarding/training materials, knowledge base documentation, and standard operating procedures (SOPs). What You'll Bring to the Team: Education/Experience: Bachelor's degree in IT, Computer Science, Engineering, or a related field, or 4+ years of equivalent technical experience. Linux Proficiency: Strong command-line interface (CLI) skills in Linux environments. Version Control: Proficiency with Git for code management and collaboration. Customer Support Experience: 5+ years of experience in a customer support role, ideally within cloud, storage, or networking environments. Cloud Technologies: Experience with container orchestration (e.g., Kubernetes), workload management (e.g., Slurm, Terraform), and monitoring tools (e.g., Grafana). Public Cloud Knowledge: Familiarity with other public cloud platforms (e.g., AWS, Azure, GCP). Communication Skills: Excellent communication and customer service skills, including the ability to prioritize competing escalations. HPC Knowledge: Understanding of HPC technologies such as Infiniband, RDMA, RoCE, and Software Defined Networking (SDN). Bonus Points: Certifications: CKA, CKAD, CKS, KCNA, AWS Machine Learning - Specialty, Data Analytics - Specialty, Solutions Architect - Professional, Developer - Associate, NVIDIA AI Infrastructure and Operations, Generative AI and LLMs, Generative AI Multi-modal, Infiniband, Linux Foundation IT Associate, System Administrator. Cloud Expertise: Deep understanding of specific cloud platforms and services. Automation Skills: Experience with automation tools and scripting languages. Problem-Solving Abilities: Demonstrated ability to analyze complex technical issues and develop effective solutions. Collaboration and Mentorship: Proven ability to mentor, train, and onboard colleagues. Passion for Sustainability: A strong interest in contributing to a more sustainable future through technology. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation: Compensation will be paid between $125,000 and $151,000 + Bonus. Restricted Stock Units are included in all offers. Salary will be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: Crusoe Cloud is revolutionizing high-performance computing by offering sustainable, low-cost GPU compute power. As a Senior Cloud Support Engineer, you'll play a crucial role in empowering our customers to leverage this technology for groundbreaking advancements in fields like AI/ML, physics simulations, and computational biology. You will be the primary point of contact for technical support, ensuring our customers can seamlessly utilize Crusoe Cloud to achieve their goals. This role directly impacts Crusoe's mission by enabling our customers to accelerate their research and development, contributing to a more sustainable future. You will be involved in exciting projects, working with cutting-edge technologies and collaborating with a talented team to solve complex challenges. The ideal candidate is a highly motivated and experienced technical professional with a passion for customer success, a deep understanding of cloud technologies, and a commitment to Crusoe's values. This is a full-time position. What You'll Be Working On: Customer Support: Provide exceptional technical support to customers via Zendesk, meeting SLAs and maintaining high CSAT (95%+). On-Call Rotation: Participate in a 24/7 on-call rotation to ensure timely resolution of critical issues. Troubleshooting: Diagnose and resolve issues related to VMs, hardware failures, and scaling tests using CLI and internal tools. Alert Triage and Maintenance: Manage alert triage, prepare for maintenance windows, and conduct node delivery testing. Collaboration: Work closely with SRE, Networking, and Storage teams from initial triage to root cause analysis (RCA) delivery. Global Teamwork: Adhere to global team collaboration and handoff processes for ticketing and on-call procedures. Knowledge Sharing: Develop onboarding/training materials, knowledge base documentation, and standard operating procedures (SOPs). What You'll Bring to the Team: Education/Experience: Bachelor's degree in IT, Computer Science, Engineering, or a related field, or 4+ years of equivalent technical experience. Linux Proficiency: Strong command-line interface (CLI) skills in Linux environments. Version Control: Proficiency with Git for code management and collaboration. Customer Support Experience: 5+ years of experience in a customer support role, ideally within cloud, storage, or networking environments. Cloud Technologies: Experience with container orchestration (e.g., Kubernetes), workload management (e.g., Slurm, Terraform), and monitoring tools (e.g., Grafana). Public Cloud Knowledge: Familiarity with other public cloud platforms (e.g., AWS, Azure, GCP). Communication Skills: Excellent communication and customer service skills, including the ability to prioritize competing escalations. HPC Knowledge: Understanding of HPC technologies such as Infiniband, RDMA, RoCE, and Software Defined Networking (SDN). Bonus Points: Certifications: CKA, CKAD, CKS, KCNA, AWS Machine Learning - Specialty, Data Analytics - Specialty, Solutions Architect - Professional, Developer - Associate, NVIDIA AI Infrastructure and Operations, Generative AI and LLMs, Generative AI Multi-modal, Infiniband, Linux Foundation IT Associate, System Administrator. Cloud Expertise: Deep understanding of specific cloud platforms and services. Automation Skills: Experience with automation tools and scripting languages. Problem-Solving Abilities: Demonstrated ability to analyze complex technical issues and develop effective solutions. Collaboration and Mentorship: Proven ability to mentor, train, and onboard colleagues. Passion for Sustainability: A strong interest in contributing to a more sustainable future through technology. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation: Compensation will be paid between $125,000 and $151,000 + Bonus. Restricted Stock Units are included in all offers. Salary will be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About the Role: As a Senior Staff Cloud Support Engineer, you are a technical authority within Crusoe Cloud and a force multiplier across Customer Experience, SRE, Networking, Fleet, and Product teams. You operate beyond ticket resolution. You design reliability guardrails, influence architecture decisions, mentor senior engineers, and directly protect revenue by preventing large-scale incidents. You bring deep expertise in Linux systems, Kubernetes, networking, and AI/ML infrastructure, and apply that knowledge with strong customer focus. You are comfortable operating in ambiguity, leading incident response, and shaping how Crusoe scales high-performance AI infrastructure globally. What You'll Be Working On Technical Leadership & Escalations Serve as highest-level escalation point for complex P1/P0 incidents. Lead cross-functional root cause investigations involving compute, networking (IB/RDMA/RoCE), storage, and orchestration layers. Partner with SRE, Software teams (Storage, Networking, Compute, K8) to design systemic fixes rather than recurring workarounds. Reliability Architecture Design and improve node validation, burn-in processes, performance baselining, and release readiness. Influence Kubernetes architecture, workload orchestration (Slurm, Terraform), and AI/ML cluster stability. Reduce MTTR and incident recurrence through structural improvements. AI/ML Infrastructure Expertise Troubleshoot NCCL, IB, GPU driver/firmware issues, distributed training failures. Support complex AI workloads (training + inference) with performance tuning and observability improvements. Customer-Facing Authority Act as senior technical advisor during high-risk customer incidents. Deliver executive-ready RCAs with clarity and confidence. Drive trust through transparency and technical depth. Mentorship & Standards Mentor P3/P4 engineers. Define SOPs and technical standards for support excellence. Partner with Enablement to raise the technical bar across the organization. What You Bring to the Team: 8+ years experience in SRE, DevOps, HPC, or Cloud Infrastructure roles. Advanced Linux systems expertise. Deep Kubernetes operational experience (CKA-level or higher). Strong networking knowledge: Infiniband, RDMA, RoCE, SDN. Experience supporting AI/ML workloads at scale (GPU clusters). Proven track record of resolving multi-layer, distributed system failures. Strong customer communication and executive-facing presence. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $180,000 -$220,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About the Role: As a Senior Staff Cloud Support Engineer, you are a technical authority within Crusoe Cloud and a force multiplier across Customer Experience, SRE, Networking, Fleet, and Product teams. You operate beyond ticket resolution. You design reliability guardrails, influence architecture decisions, mentor senior engineers, and directly protect revenue by preventing large-scale incidents. You bring deep expertise in Linux systems, Kubernetes, networking, and AI/ML infrastructure, and apply that knowledge with strong customer focus. You are comfortable operating in ambiguity, leading incident response, and shaping how Crusoe scales high-performance AI infrastructure globally. What You'll Be Working On Technical Leadership & Escalations Serve as highest-level escalation point for complex P1/P0 incidents. Lead cross-functional root cause investigations involving compute, networking (IB/RDMA/RoCE), storage, and orchestration layers. Partner with SRE, Software teams (Storage, Networking, Compute, K8) to design systemic fixes rather than recurring workarounds. Reliability Architecture Design and improve node validation, burn-in processes, performance baselining, and release readiness. Influence Kubernetes architecture, workload orchestration (Slurm, Terraform), and AI/ML cluster stability. Reduce MTTR and incident recurrence through structural improvements. AI/ML Infrastructure Expertise Troubleshoot NCCL, IB, GPU driver/firmware issues, distributed training failures. Support complex AI workloads (training + inference) with performance tuning and observability improvements. Customer-Facing Authority Act as senior technical advisor during high-risk customer incidents. Deliver executive-ready RCAs with clarity and confidence. Drive trust through transparency and technical depth. Mentorship & Standards Mentor P3/P4 engineers. Define SOPs and technical standards for support excellence. Partner with Enablement to raise the technical bar across the organization. What You Bring to the Team: 8+ years experience in SRE, DevOps, HPC, or Cloud Infrastructure roles. Advanced Linux systems expertise. Deep Kubernetes operational experience (CKA-level or higher). Strong networking knowledge: Infiniband, RDMA, RoCE, SDN. Experience supporting AI/ML workloads at scale (GPU clusters). Proven track record of resolving multi-layer, distributed system failures. Strong customer communication and executive-facing presence. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $180,000 -$220,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About the Role: As a Senior Staff Cloud Support Engineer, you are a technical authority within Crusoe Cloud and a force multiplier across Customer Experience, SRE, Networking, Fleet, and Product teams. You operate beyond ticket resolution. You design reliability guardrails, influence architecture decisions, mentor senior engineers, and directly protect revenue by preventing large-scale incidents. You bring deep expertise in Linux systems, Kubernetes, networking, and AI/ML infrastructure, and apply that knowledge with strong customer focus. You are comfortable operating in ambiguity, leading incident response, and shaping how Crusoe scales high-performance AI infrastructure globally. What You'll Be Working On Technical Leadership & Escalations Serve as highest-level escalation point for complex P1/P0 incidents. Lead cross-functional root cause investigations involving compute, networking (IB/RDMA/RoCE), storage, and orchestration layers. Partner with SRE, Software teams (Storage, Networking, Compute, K8) to design systemic fixes rather than recurring workarounds. Reliability Architecture Design and improve node validation, burn-in processes, performance baselining, and release readiness. Influence Kubernetes architecture, workload orchestration (Slurm, Terraform), and AI/ML cluster stability. Reduce MTTR and incident recurrence through structural improvements. AI/ML Infrastructure Expertise Troubleshoot NCCL, IB, GPU driver/firmware issues, distributed training failures. Support complex AI workloads (training + inference) with performance tuning and observability improvements. Customer-Facing Authority Act as senior technical advisor during high-risk customer incidents. Deliver executive-ready RCAs with clarity and confidence. Drive trust through transparency and technical depth. Mentorship & Standards Mentor P3/P4 engineers. Define SOPs and technical standards for support excellence. Partner with Enablement to raise the technical bar across the organization. What You Bring to the Team: 8+ years experience in SRE, DevOps, HPC, or Cloud Infrastructure roles. Advanced Linux systems expertise. Deep Kubernetes operational experience (CKA-level or higher). Strong networking knowledge: Infiniband, RDMA, RoCE, SDN. Experience supporting AI/ML workloads at scale (GPU clusters). Proven track record of resolving multi-layer, distributed system failures. Strong customer communication and executive-facing presence. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $180,000 -$220,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About the Role: As a Senior Staff Cloud Support Engineer, you are a technical authority within Crusoe Cloud and a force multiplier across Customer Experience, SRE, Networking, Fleet, and Product teams. You operate beyond ticket resolution. You design reliability guardrails, influence architecture decisions, mentor senior engineers, and directly protect revenue by preventing large-scale incidents. You bring deep expertise in Linux systems, Kubernetes, networking, and AI/ML infrastructure, and apply that knowledge with strong customer focus. You are comfortable operating in ambiguity, leading incident response, and shaping how Crusoe scales high-performance AI infrastructure globally. What You'll Be Working On Technical Leadership & Escalations Serve as highest-level escalation point for complex P1/P0 incidents. Lead cross-functional root cause investigations involving compute, networking (IB/RDMA/RoCE), storage, and orchestration layers. Partner with SRE, Software teams (Storage, Networking, Compute, K8) to design systemic fixes rather than recurring workarounds. Reliability Architecture Design and improve node validation, burn-in processes, performance baselining, and release readiness. Influence Kubernetes architecture, workload orchestration (Slurm, Terraform), and AI/ML cluster stability. Reduce MTTR and incident recurrence through structural improvements. AI/ML Infrastructure Expertise Troubleshoot NCCL, IB, GPU driver/firmware issues, distributed training failures. Support complex AI workloads (training + inference) with performance tuning and observability improvements. Customer-Facing Authority Act as senior technical advisor during high-risk customer incidents. Deliver executive-ready RCAs with clarity and confidence. Drive trust through transparency and technical depth. Mentorship & Standards Mentor P3/P4 engineers. Define SOPs and technical standards for support excellence. Partner with Enablement to raise the technical bar across the organization. What You Bring to the Team: 8+ years experience in SRE, DevOps, HPC, or Cloud Infrastructure roles. Advanced Linux systems expertise. Deep Kubernetes operational experience (CKA-level or higher). Strong networking knowledge: Infiniband, RDMA, RoCE, SDN. Experience supporting AI/ML workloads at scale (GPU clusters). Proven track record of resolving multi-layer, distributed system failures. Strong customer communication and executive-facing presence. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation Range Compensation will be paid in the range of up to $180,000 -$220,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: Crusoe Cloud is revolutionizing high-performance computing by offering sustainable, low-cost GPU compute power. As a Senior Cloud Support Engineer, you'll play a crucial role in empowering our customers to leverage this technology for groundbreaking advancements in fields like AI/ML, physics simulations, and computational biology. You will be the primary point of contact for technical support, ensuring our customers can seamlessly utilize Crusoe Cloud to achieve their goals. This role directly impacts Crusoe's mission by enabling our customers to accelerate their research and development, contributing to a more sustainable future. You will be involved in exciting projects, working with cutting-edge technologies and collaborating with a talented team to solve complex challenges. The ideal candidate is a highly motivated and experienced technical professional with a passion for customer success, a deep understanding of cloud technologies, and a commitment to Crusoe's values. This is a full-time position. What You'll Be Working On: Customer Support: Provide exceptional technical support to customers via Zendesk, meeting SLAs and maintaining high CSAT (95%+). On-Call Rotation: Participate in a 24/7 on-call rotation to ensure timely resolution of critical issues. Troubleshooting: Diagnose and resolve issues related to VMs, hardware failures, and scaling tests using CLI and internal tools. Alert Triage and Maintenance: Manage alert triage, prepare for maintenance windows, and conduct node delivery testing. Collaboration: Work closely with SRE, Networking, and Storage teams from initial triage to root cause analysis (RCA) delivery. Global Teamwork: Adhere to global team collaboration and handoff processes for ticketing and on-call procedures. Knowledge Sharing: Develop onboarding/training materials, knowledge base documentation, and standard operating procedures (SOPs). What You'll Bring to the Team: Education/Experience: Bachelor's degree in IT, Computer Science, Engineering, or a related field, or 4+ years of equivalent technical experience. Linux Proficiency: Strong command-line interface (CLI) skills in Linux environments. Version Control: Proficiency with Git for code management and collaboration. Customer Support Experience: 5+ years of experience in a customer support role, ideally within cloud, storage, or networking environments. Cloud Technologies: Experience with container orchestration (e.g., Kubernetes), workload management (e.g., Slurm, Terraform), and monitoring tools (e.g., Grafana). Public Cloud Knowledge: Familiarity with other public cloud platforms (e.g., AWS, Azure, GCP). Communication Skills: Excellent communication and customer service skills, including the ability to prioritize competing escalations. HPC Knowledge: Understanding of HPC technologies such as Infiniband, RDMA, RoCE, and Software Defined Networking (SDN). Bonus Points: Certifications: CKA, CKAD, CKS, KCNA, AWS Machine Learning - Specialty, Data Analytics - Specialty, Solutions Architect - Professional, Developer - Associate, NVIDIA AI Infrastructure and Operations, Generative AI and LLMs, Generative AI Multi-modal, Infiniband, Linux Foundation IT Associate, System Administrator. Cloud Expertise: Deep understanding of specific cloud platforms and services. Automation Skills: Experience with automation tools and scripting languages. Problem-Solving Abilities: Demonstrated ability to analyze complex technical issues and develop effective solutions. Collaboration and Mentorship: Proven ability to mentor, train, and onboard colleagues. Passion for Sustainability: A strong interest in contributing to a more sustainable future through technology. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation: Compensation will be paid between $125,000 and $151,000 + Bonus. Restricted Stock Units are included in all offers. Salary will be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
04/24/2026
Full time
Job DescriptionJob Description Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack - from electrons to tokens - to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that - with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved - people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: Crusoe Cloud is revolutionizing high-performance computing by offering sustainable, low-cost GPU compute power. As a Senior Cloud Support Engineer, you'll play a crucial role in empowering our customers to leverage this technology for groundbreaking advancements in fields like AI/ML, physics simulations, and computational biology. You will be the primary point of contact for technical support, ensuring our customers can seamlessly utilize Crusoe Cloud to achieve their goals. This role directly impacts Crusoe's mission by enabling our customers to accelerate their research and development, contributing to a more sustainable future. You will be involved in exciting projects, working with cutting-edge technologies and collaborating with a talented team to solve complex challenges. The ideal candidate is a highly motivated and experienced technical professional with a passion for customer success, a deep understanding of cloud technologies, and a commitment to Crusoe's values. This is a full-time position. What You'll Be Working On: Customer Support: Provide exceptional technical support to customers via Zendesk, meeting SLAs and maintaining high CSAT (95%+). On-Call Rotation: Participate in a 24/7 on-call rotation to ensure timely resolution of critical issues. Troubleshooting: Diagnose and resolve issues related to VMs, hardware failures, and scaling tests using CLI and internal tools. Alert Triage and Maintenance: Manage alert triage, prepare for maintenance windows, and conduct node delivery testing. Collaboration: Work closely with SRE, Networking, and Storage teams from initial triage to root cause analysis (RCA) delivery. Global Teamwork: Adhere to global team collaboration and handoff processes for ticketing and on-call procedures. Knowledge Sharing: Develop onboarding/training materials, knowledge base documentation, and standard operating procedures (SOPs). What You'll Bring to the Team: Education/Experience: Bachelor's degree in IT, Computer Science, Engineering, or a related field, or 4+ years of equivalent technical experience. Linux Proficiency: Strong command-line interface (CLI) skills in Linux environments. Version Control: Proficiency with Git for code management and collaboration. Customer Support Experience: 5+ years of experience in a customer support role, ideally within cloud, storage, or networking environments. Cloud Technologies: Experience with container orchestration (e.g., Kubernetes), workload management (e.g., Slurm, Terraform), and monitoring tools (e.g., Grafana). Public Cloud Knowledge: Familiarity with other public cloud platforms (e.g., AWS, Azure, GCP). Communication Skills: Excellent communication and customer service skills, including the ability to prioritize competing escalations. HPC Knowledge: Understanding of HPC technologies such as Infiniband, RDMA, RoCE, and Software Defined Networking (SDN). Bonus Points: Certifications: CKA, CKAD, CKS, KCNA, AWS Machine Learning - Specialty, Data Analytics - Specialty, Solutions Architect - Professional, Developer - Associate, NVIDIA AI Infrastructure and Operations, Generative AI and LLMs, Generative AI Multi-modal, Infiniband, Linux Foundation IT Associate, System Administrator. Cloud Expertise: Deep understanding of specific cloud platforms and services. Automation Skills: Experience with automation tools and scripting languages. Problem-Solving Abilities: Demonstrated ability to analyze complex technical issues and develop effective solutions. Collaboration and Mentorship: Proven ability to mentor, train, and onboard colleagues. Passion for Sustainability: A strong interest in contributing to a more sustainable future through technology. Benefits: Competitive compensation Restricted Stock Units Paid time off & paid holidays Comprehensive health, dental & vision insurance Employer contributions to HSA account Paid parental leave Paid life insurance, short-term and long-term disability Professional development & tuition reimbursement Mental health & wellness support Commuter benefits (parking & transit) Cell phone stipend 401(k) Retirement plan with company match up to 4% of salary Volunteer time off Compensation: Compensation will be paid between $125,000 and $151,000 + Bonus. Restricted Stock Units are included in all offers. Salary will be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data. Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.