Senior Site Reliability Engineer in Denver · Austin Fraser
Banner Default Image


About the job

This is a remote, US-based role.

This role will address two of our main products: Kubernetes Advisory and ClusterOps.

Through Kubernetes Advisory, we write infrastructure automation, migrate to high-availability solutions, and craft Kubernetes-based microservice architectures for our clients. You'll work directly with clients to ensure their goals are met through automation, analysis, and infrastructure configuration.

Via ClusterOps, we help companies implement Kubernetes quickly and effectively on top of AWS or Google Cloud Platform. You'll manage globally load-balanced installations, as well as smaller, single-region affairs. If the client elects to subscribe to our pager service, we provide 24/7/365 pager rotation for infrastructure outages (only on the great infrastructures we build).

No matter the product, we take a collaborative approach with our clients - we believe that we provide amazing service and our clients get the best value when we work closely together. We integrate task tracking (TargetProcess), chat (Slack), and source control (Github) into our own workflow through careful and clever use of automation, giving our clients the ability to contact us like they would with an internal DevOps team but without an operational burden on us.

You'll work with technologies such as Terraform and Kubernetes on top of AWS, Google Cloud Platform, and Azure. You will setup and automate high availability application clusters for technologies like Ruby-on-Rails, Django, Node, and Elixir (to name a few). You'll engineer management, orchestration, monitoring, and alerting for fleets of instances. If you've ever wanted to work at a scale that few companies do, you'll find the right challenge here at Fairwinds!

We all work from home (US remote hours only). We know that the whole world has turned remote as a result of a massive pandemic complicating all of our lives, but we have always been and will continue to be "Remote First". We have a minimum vacation policy and unlimited time off, and require healthy work-life balance. We pay 100% of individual and family health insurance premiums.

We're hiring across experience bands from mid to senior levels.

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.


  • Perform DevOps-focused consulting work for diverse clients
  • Build and maintain large-scale Kubernetes deployments
  • Create and maintain system architecture, design, and automation
  • Pair with other SRE/SAs, mentor junior staff
  • Release and maintain open-source software and projects
  • Author blog posts and participate in the community by going to meetups, conferences, etc. as a Fairwinds representative
  • Manage availability and performance problems for clients; automate resolution to prevent re-occurrence
  • Limited participation in 24/7 pager as necessary