Published on www.allthetopbananas.com 06 Mar 2025
We are hiring multiple Site Reliability Engineers (SREs) to join our growing team. The SREs will work closely with the DevOps team to implement standardized tools and practices to ensure high reliability and scalability of our systems. Responsibilities: Maintain and enhance the reliability, availability, and performance of large-scale systems. Follow established DevOps guidelines and standards for tool development and system management. Develop automation scripts for monitoring, alerting, and incident response. Collaborate with the DevOps team to improve infrastructure and platform tools (e.g. spug.cc). Design and implement CI/CD pipelines using GitLab for application and infrastructure deployment. Manage containerized environments using Kubernetes. Monitor and analyze system metrics to optimize performance and efficiency. Implement disaster recovery and high-availability strategies to ensure system resilience. Requirements: 3-8 years of experience in SRE or DevOps roles. Proficiency in Infrastructure as Code (IaC) using Terraform. Hands-on experience with CI/CD pipelines in GitLab. Proficiency in scripting languages like Python and Bash. Familiarity with cloud platforms such as AWS technology like EC2, KMS, VPC. Strong problem-solving and collaboration skills. Seniority level
Mid-Senior level Employment type
Full-time Job function
Referrals increase your chances of interviewing at OSL by 2x.
#J-18808-Ljbffr