Site Reliability Engineer 3 (Remote - India)

Jobgether
India
On-site
Full-time
Posted 13 days ago

Job Description

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Site Reliability Engineer 3 in India.

As a Site Reliability Engineer 3, you will play a critical role in maintaining the reliability, scalability, and performance of cloud-based systems. You will lead initiatives to automate processes, monitor infrastructure, and improve operational efficiency while collaborating closely with software engineering teams. This role involves supporting production environments, managing incidents, and implementing long-term solutions to ensure high availability. You will also contribute to capacity planning, security best practices, and system improvements, all within a fast-paced, globally distributed environment. Ideal candidates thrive in problem-solving, mentoring, and designing solutions for complex, large-scale systems.

Accountabilities:

  • Provide production support and participate in an on-call rotation to ensure system uptime and reliability.
  • Manage customer and internal tickets, performing troubleshooting and corrective actions.
  • Develop and maintain automation scripts and tools to reduce manual processes and improve operational efficiency.
  • Monitor system performance and respond to alerts, ensuring high availability and optimal system performance.
  • Lead incident management, including root cause analysis and implementation of preventative measures.
  • Collaborate with engineering teams to support deployments, architecture design, and system improvements.
  • Create and maintain documentation for processes, troubleshooting, and operational guidelines.
  • Assist in capacity planning to support growth and scalability.
  • Implement and adhere to security best practices to protect systems and data.

Requirements

  • 5+ years of experience in site reliability engineering, DevOps, or software engineering, managing large-scale, high-availability systems.
  • Bachelor’s or Master’s degree in Computer Science, Information Technology, or equivalent practical experience.
  • Proficiency in Linux/Unix systems, networking, and cloud services (AWS, Azure, Google Cloud).
  • Strong experience with scripting languages such as Python, Bash, or Ruby, and programming languages like Go, Java, or C++.
  • Hands-on experience with containerization (Docker, Kubernetes) and infrastructure as code (Terraform, CloudFormation).
  • Knowledge of database management (SQL, NoSQL), load balancing, and distributed systems.
  • Expertise with monitoring/logging tools (Prometheus, Grafana, Splunk), configuration management (Ansible, Chef, Puppet), and CI/CD pipelines.
  • Excellent analytical, problem-solving, and communication skills.
  • Ability to lead and mentor team members and manage cross-functional initiatives.
  • Relevant certifications (AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or similar) are a plus.

Disclaimer: Real Jobs From Anywhere is an independent platform dedicated to providing information about job openings. We are not affiliated with, nor do we represent, any company, agency, or agent mentioned in the job listings. Please refer to our Terms of Services for further details.