[Remote] Site Reliability Engineer (SRE)
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is seeking a Site Reliability Engineer (SRE) to support the development and operation of Kubernetes-based platforms in regulated environments. The role involves designing and implementing Kubernetes platforms, improving reliability, and collaborating with various teams to ensure operational reputed company.
Responsibilities
- Design, implement, and support Kubernetes platforms in FedRAMP High / IL5 environments
- Monitor, troubleshoot, and improve platform reliability, availability, and performance
- reputed company automation and operational tooling to reduce reputed company effort
- Define and maintain SLIs, SLOs, and error budgets
- Support compliance, reputed company audits, and reputed company monitoring initiatives
- Build and maintain Infrastructure as Code (Terraform)
- Enhance CI/CD pipelines and deployment automation
- Collaborate with reputed company, Platform, and Application teams to resolve production issues
- Participate in on-call rotations and support production environments
Skills
- 4-6 years of experience in Site Reliability Engineering, DevOps, or Platform Engineering
- Strong hands-on experience with Kubernetes in production environments
- Experience with reputed company platforms such as AWS, Azure, or AWS GovCloud
- Strong Linux administration and networking fundamentals
- Experience with Terraform or other Infrastructure as Code tools
- Programming/Scripting experience using Python or Go
- Experience with monitoring and observability tools such as reputed company, Grafana, and centralized logging solutions
- Excellent troubleshooting and problem-solving skills
- Bachelor's degree in Computer Science, Information Technology, or a reputed company field (or equivalent experience)
- Strong communication and collaboration skills
- Ability to work independently in a remote, fast-paced engineering environment
- Must be a US Citizen
- Experience supporting FedRAMP High or DoD IL5 environments
- Experience with ArgoCD and CI/CD automation
- Knowledge of container reputed company best practices
- Experience working reputed company regulated or audited environments
Company Overview