Job Description
Infrastructure Engineer (Systems & Storage) We are seeking a versatile Infrastructure Engineer to act as the guardian of our data integrity and physical backbone. This role manages the "Persistence Layer" of our global healthcare-related suite ensuring our high-performance storage and network stacks are resilient, secure, and ready for massive scale. You are a technical "Generalist" who is equally comfortable in a terminal as you are in a data center. You will own the health of our NetApp and Qumulo storage clusters and maintain the network security boundaries that keep our patient data safe. Key ResponsibilitiesNetwork & Security: Maintain Fortigate firewalls and Cisco network infrastructure, ensuring high availability and secure VLAN management. Storage Management: Administer high-performance NetApp clusters and high-capacity Qumulo systems. Data Center Operations: Manage the physical health of our East Coast data centers, including racking, cabling, and hardware lifecycle management. Collaboration: part of a global team, working closely with off-shore infrastructure engineers. Willing to initiate discussions with development teams early in the cycle. Incident Response: Participate in a modernized, data-driven incident response process to ensure 24/7 stability for our production environments. What You BringStorage Depth: 2+ years of experience with enterprise storage (specifically NetApp/ONTAP) including volume management and snapshots. Systems Generalist: A solid foundation in Linux (RHEL, OL8) and VMware virtualization. Networking Fundamentals: Comfortable managing firewall rules and switching in a complex, multi-site environment. Operational Discipline: Experience working within highly regulated frameworks like HIPAA, SOC2, or ARC-AMPE. Mobility: Ability to travel to our East Coast data center locations for hands-on hardware. Preferred Skills & MindsetCloud Infrastructure Management: Experience with Oracle OCI is a plus. Automation-First: Experience with (or a strong desire to master) Ansible, Terraform, and Python is a plus. Monitoring & Logs: Experience with modern observability tools like Datadog APM and CloudStrike Falcon is a plus. Problem Solver: You don't just fix the ticket; you find a way to automate the fix so the ticket never returns. PI