Site Reliability Engineer (SRE) / Software Engineer (SWE)
Location: Mountain View, CA
Required Qualifications
- Proficiency in Go and/or Kotlin programming languages preferred.
- Experience with Google Cloud Platform (GCP) services and architecture is a must.
- Strong understanding of infrastructure as code (IaC) principles, particularly with Terraform.
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
Key Responsibilities
- Monitor and maintain the health of production applications.
- Respond to system alerts and logs to ensure high availability and performance.
- Analyze, troubleshoot, and resolve code issues in Go and Kotlin.
- Collaborate with the development team to implement fixes and improvements.
- Design, implement, and manage infrastructure using Terraform.
- Set up and maintain monitoring, logging, and alerting systems to proactively identify and address problems.
- Work closely with cross functional teams to ensure seamless integration and deployment of applications.
- Participate in on call rotations and provide support as needed.
Experience
- Previous experience in a DevOps or Site Reliability Engineering role with a focus on cloud environments.
- Demonstrated ability to troubleshoot complex systems and code issues.
Seniority level
Mid Senior level
Employment type
Contract
Job function
Engineering and Information Technology
Industries
IT Services and IT Consulting