Conviction HR
Site Reliability Engineer - Distributed Systems
Job Location
hyderabad, India
Job Description
Job Title : Site Reliability Engineer (SRE) - : Hyderabad (5 Days WFO) (Hyderabad Candidates are only preferred.) IMMEDIATE : Conviction HR Type : Contract-to-Hire (C2H) Job Description : ConvictionHR is seeking a talented Site Reliability Engineer (SRE) to join our growing team. This Contract-to-Hire position is perfect for an individual who is passionate about improving system reliability and performance while collaborating closely with both development and operations teams. Business Requirements : - 10 Yrs exp in Linux, Windows, VMWare with strong skills in Infrastructure as Code using Java, Python, Unix Shell, Powershell or equivalent. - Hands on experience in Jenkins, DevSecOps Frameworks, Terraform, Ansible, Chef or Puppet will be a plus. - Strong experience in building and deploying CI/CD pipelines for complex distributed software is required. - Good working knowledge on Containers using Docker or Podman, Kubernetes is a plus. - Experience in one or more programing languages - Java, Python, Unix Shell, Powershell or equivalent - Experience and expertise in distributed systems is a must - Experience in building software and systems to manage platform infrastructure and application - Providing primary operational support and engineering for multiple large scale distributed software applications Key Responsibilities : - Design, implement, and maintain scalable and reliable infrastructure and services. - Monitor system performance and reliability, ensuring high availability and quick recovery from incidents. - Collaborate with development teams to improve application performance through automation and best practices. - Develop and maintain incident response plans, conducting post-mortem analyses to prevent future issues. - Implement monitoring and alerting solutions to proactively identify and resolve issues. - Participate in the on-call rotation to provide support for production systems. - Advocate for a culture of reliability, operational excellence, and continuous improvement. Qualifications : - Bachelor's degree in Computer Science, Information Technology, or a related field. - 3 years of experience in a Site Reliability Engineering or related role. - Strong understanding of cloud platforms (e.g., AWS, Azure, Google Cloud) and container orchestration (e.g., Kubernetes, Docker). - Proficiency in scripting and automation tools (e.g., Python, Bash, Terraform). - Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack). - Familiarity with CI/CD pipelines and DevOps methodologies. - Excellent problem-solving skills and a proactive mindset. - Strong communication skills and ability to work collaboratively with diverse teams. What We Offer : - Competitive salary and comprehensive benefits. - Opportunities for professional growth and development. - A collaborative, innovative, and inclusive work environment. (ref:hirist.tech)
Location: hyderabad, IN
Posted Date: 10/9/2024
Location: hyderabad, IN
Posted Date: 10/9/2024
Contact Information
Contact | Human Resources Conviction HR |
---|