Site Reliability Engineer-Lead

Crelio Health

Posted: 11 months ago

Company Website
https://cutshort.io/jo...
Position type
full time
Job source
Cutshort
Category
programming
Remote
No
Salary
---
Job location
Pune
About

Job Summary:

We are seeking a Senior DevOps & SRE Engineer to join our team and help us build, deploy, and maintain our infrastructure and applications. The ideal candidate will have experience working in a fast-paced environment and a strong background in DevOps and Site Reliability Engineering (SRE). You will be responsible for ensuring the reliability, scalability, and security of our applications and infrastructure.

 

Responsibilities:

  • Build and maintain our CI/CD pipeline and deployment automation tools
  • Design and implement monitoring and alerting systems to ensure the health of our applications and infrastructure
  • Work closely with development teams to ensure that code is deployed in a reliable and scalable manner
  • Participate in on-call rotations to provide 24/7 support for our production systems
  • Develop and maintain disaster recovery plans and processes
  • Continuously improve our infrastructure and processes to ensure scalability, reliability, and security
  • Mentor and provide technical leadership to junior team members
  • Keep up-to-date with industry best practices and emerging technologies in DevOps and SRE

Requirements:

  • Bachelor’s degree in Computer Science, Engineering, or a related field
  • 5+ years of experience in DevOps or SRE
  • Strong programming skills in at least one of the following languages: Python, Go, Ruby, or Java
  • Experience with infrastructure as code tools such as Terraform or CloudFormation
  • Experience with containerization technologies such as Docker and Kubernetes
  • Strong understanding of networking concepts such as TCP/IP, DNS, and load balancing
  • Experience with monitoring and logging tools such as Prometheus, Grafana, and ELK stack
  • Excellent problem-solving skills and the ability to troubleshoot complex issues in a fast-paced environment
  • Strong communication and collaboration skills with both technical and non-technical stakeholders

Preferred Qualifications:

  • Experience with cloud providers such as AWS or Azure
  • Experience with building and maintaining large-scale distributed systems
  • Experience with database technologies such as MySQL, PostgreSQL, or MongoDB
  • Experience with automation tools such as Ansible or Chef
  • Experience with Agile development methodologies such as Scrum or Kanban

If you are passionate about DevOps and SRE and have the skills and experience we are looking for, we encourage you to apply for this exciting opportunity.

Skills:- SRE, Reliability engineering, Site reliability, and Site reliability engineer

Subscribe to our daily job alerts

Sign up for our newsletter to stay up to date with new jobs posted on Profilehunt

Please confirm your email address once you subscribe.