This post is over 30 days old. The position may no longer be available

Site Reliability Engineering Lead

Eloquent Info Solution Private Limited , Bangalore · workindia.in · Full-time employment · Programming

Build and Lead a team of SREs ensuring that production applications are stable and reliable.

Be directly responsible for uptime.

Own end-to-end availability and performance of key services and build automation to prevent problem recurrence.

Automate current manual infrastructure management and alerts handling processes via Kubernetes, Terraform, CI/CD pipelines etc.

Assist in the roll-out and deployment of new product features and installations.

Find scalability bottlenecks and areas for performance improvements.


Work closely with technical leads to ensure that platforms are designed with scale and operability in mind

Help SREs in your team to grow and develop their careers through mentorship and performance management.

Requirements

5+ years of technical experience in Site Reliability Engineering

3+ years of experience as a people manager in an Engineering or Operations capacity.

Strong Linux administration skills with an emphasis on shell scripting.

Expertise with AWS platform.

Expertise with Jenkins.

Experience with infrastructure monitoring platforms and Application Performance Management (APM) systems (New Relic).

Experience with Configuration management tools (Puppet, Chef, Ansible).

Experience with CI/CD pipeline configuration, deployment, and support.

Experience making hiring decisions for SRE/DevOps teams.

Hands-on technical experience with supporting multi-tenant applications is required.

Apply for this position

Login with Google or GitHub to see instructions on how to apply. Your identity will not be revealed to the employer.

It is NOT OK for recruiters, HR consultants, and other intermediaries to contact this employer