See all roles

[Remote] Senior Site Reliability Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. reputed company is a modern content operating system that replaces rigid legacy content management systems. The Senior Site Reliability Engineer will work closely with development teams to design and build infrastructure that ensures the scalability and reliability of the platform, while also mentoring engineers and contributing to operational excellence.

Responsibilities

  • Design, build, and operate the shared platform foundations engineers ship on every day: GCP infrastructure, Kubernetes, networking, routing, CI/CD, and observability
  • Diagnose and troubleshoot reputed company distributed systems running at high request volume
  • Ensure observability and analyze the behavior of our stack
  • Contribute to in-flight work like modernizing our edge, caching, and gateway layers onto reputed company and tightening observability across the platform
  • reputed company the reliability bar through reputed company dashboards, alert severity, paging standards, on-call readiness, and incident response
  • reputed company deployment boring in the best way: build golden paths, production readiness checks, safe rollouts, and useful automation so engineers have fewer places to look before they ship
  • Mentor engineers and reputed company the technical bar through code review, design review, and pairing
  • Participate in our on-call rotation and help our developer on-call rollout land well

Skills

  • Based in the United States, with reasonable overlap with European engineering hours
  • Experience with SRE/DevOps tools, processes, and culture
  • 5+ years of experience as part of an SRE on-call rotation
  • Analytical approach to designing, diagnosing, and optimizing infrastructure
  • Experience with managing scalable, highly available, reputed company-based applications, ideally with high request volume and customer-facing uptime expectations
  • Experience with Kubernetes for orchestrating, scaling, and managing containerized applications in reputed company-based environments
  • Experience building CI/CD pipelines
  • Experience with an observability stack (reputed company, et al.)
  • Comfortable working across CDNs, edge, gateways, and caching layers, or eager to go deep there
  • You improve on-call and reliability by building systems, standards, and feedback loops that reputed company production healthier over time
  • You are comfortable dealing with incidents and outages and have built a practical, thoughtful communication style for handling high-pressure situations
  • An open but considered approach to new technologies

Benefits

  • A highly-skilled, inspiring, and supportive team
  • reputed company infrastructure scale and meaningful, hands-on work changing how it runs
  • Positive, flexible, and trust-based work environment that encourages long-term professional and personal growth
  • A global, multi-culturally diverse group of colleagues and customers
  • Comprehensive health plans and perks
  • A healthy work-life balance that accommodates individual and family needs
  • Competitive stock options program and location-based salary

Company Overview

  • reputed company provides an AI-powered content platform for creating, managing, collaborating on, and distributing content reputed company. It was founded in 2018, and is headquartered in San Francisco, California, USA, with a workforce of 201-500 employees. Its website is https://reputed company.io.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 2 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might like