In this role, you will be responsible for:
Technical Leadership: Provide technical guidance and mentorship to the DevOps engineering team, overseeing daily operations and long-term strategic initiatives.
Cloud Architecture: Design, implement, and manage highly available, scalable, and secure cloud infrastructure on Amazon Web Services (AWS), adhering to industry best practices.
Container Orchestration: Oversee the management, optimization, and security of Kubernetes clusters and containerized environments.
Infrastructure as Code (IaC): Lead the adoption of IaC principles, directing the development and maintenance of configurations using Terraform/Tofu and Terragrunt.
System Reliability: Establish and refine Site Reliability Engineering (SRE) and monitoring methodologies, using tools like Prometheus to ensure system health, performance, and reliability.
Automation & CI/CD: Drive the continuous improvement of CI/CD pipelines to enhance automation, efficiency, and deployment speed.
Collaboration: Work closely with software development and operations teams to integrate DevOps principles throughout the software development lifecycle (SDLC).
Technical Leadership: Provide technical guidance and mentorship to the DevOps engineering team, overseeing daily operations and long-term strategic initiatives.
Cloud Architecture: Design, implement, and manage highly available, scalable, and secure cloud infrastructure on Amazon Web Services (AWS), adhering to industry best practices.
Container Orchestration: Oversee the management, optimization, and security of Kubernetes clusters and containerized environments.
Infrastructure as Code (IaC): Lead the adoption of IaC principles, directing the development and maintenance of configurations using Terraform/Tofu and Terragrunt.
System Reliability: Establish and refine Site Reliability Engineering (SRE) and monitoring methodologies, using tools like Prometheus to ensure system health, performance, and reliability.
Automation & CI/CD: Drive the continuous improvement of CI/CD pipelines to enhance automation, efficiency, and deployment speed.
Collaboration: Work closely with software development and operations teams to integrate DevOps principles throughout the software development lifecycle (SDLC).
Requirements:
A minimum of five years of experience in a DevOps, SRE, or a related technical leadership role.
Demonstrated expertise in containerization and orchestration, with extensive hands-on experience managing Kubernetes in production environments.
Proven proficiency in designing and deploying both microservice and monolithic architectures on Amazon Web Services (AWS).
Advanced skills in Infrastructure as Code (IaC) with Terraform/Tofu and Terragrunt.
Comprehensive systems administration experience across both Linux and Windows environments.
Strong command of Site Reliability Engineering (SRE) principles and monitoring stacks, with a particular emphasis on Prometheus.
In-depth knowledge of CI/CD pipeline implementation and automation best practices.
Exceptional analytical, problem-solving, and communication abilities
Preferred Qualifications:
Professional certifications such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or related credentials.Hands-on experience with comprehensive monitoring and logging solutions and various vendor tools.
Familiarity with managed data services such as Mongo Atlas, Confluent Kafka, and Elastic Cloud.
Deep understanding of cloud security best practices, particularly for AWS and containerized workloads.
Proficiency with networking principles and web server technologies, including Nginx and IIS.
Strong understanding of leveraging AI and AI-powered IDEs to enhance development and operational efficiency.
A minimum of five years of experience in a DevOps, SRE, or a related technical leadership role.
Demonstrated expertise in containerization and orchestration, with extensive hands-on experience managing Kubernetes in production environments.
Proven proficiency in designing and deploying both microservice and monolithic architectures on Amazon Web Services (AWS).
Advanced skills in Infrastructure as Code (IaC) with Terraform/Tofu and Terragrunt.
Comprehensive systems administration experience across both Linux and Windows environments.
Strong command of Site Reliability Engineering (SRE) principles and monitoring stacks, with a particular emphasis on Prometheus.
In-depth knowledge of CI/CD pipeline implementation and automation best practices.
Exceptional analytical, problem-solving, and communication abilities
Preferred Qualifications:
Professional certifications such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or related credentials.Hands-on experience with comprehensive monitoring and logging solutions and various vendor tools.
Familiarity with managed data services such as Mongo Atlas, Confluent Kafka, and Elastic Cloud.
Deep understanding of cloud security best practices, particularly for AWS and containerized workloads.
Proficiency with networking principles and web server technologies, including Nginx and IIS.
Strong understanding of leveraging AI and AI-powered IDEs to enhance development and operational efficiency.
This position is open to all candidates.







