About the company

Gemini is a regulated cryptocurrency exchange, wallet, and custodian that makes it simple and secure to buy bitcoin, ether, and other cryptocurrencies.

Job Summary

Responsibilities:

📍Lead and manage a team of Site Reliability Engineers, fostering a culture of collaboration, innovation, and operational excellence. 📍Develop and execute the SRE team's strategic goals, objectives, and roadmap in alignment with the overall business objectives. 📍Oversee the design, implementation, and maintenance of highly available and scalable production systems. 📍Drive continuous improvement initiatives by identifying areas for enhancement and implementing best practices, automation, and process improvements. 📍Collaborate with cross-functional teams and Departments to ensure smooth integration of applications and systems. 📍Define and enforce Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure system reliability and uptime. 📍Monitor system performance, troubleshoot issues, and ensure timely incident response, root cause analysis, and problem resolution. 📍Implement effective monitoring, logging, and alerting systems to proactively identify and mitigate potential issues. 📍Stay up-to-date with industry trends, emerging technologies, and best practices related to SRE and DevOps, and apply them to improve operational efficiency.

Minimum Qualifications:

📍Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience). 📍Proven experience as a Site Reliability Engineer or similar role, with at least 5 years of hands-on experience in managing production systems. 📍Strong expertise in the listed technologies: Ansible, Concourse CI, Jenkins, Github Actions, EKS (Kubernetes), Linux Administration. 📍Demonstrated experience in leading and managing a team of technical professionals. 📍Solid understanding of SRE principles, including reliability, scalability, availability, and performance. 📍Proficient in scripting and automation (e.g., Python, Bash, or similar). 📍Experience with infrastructure-as-code (IaC) tools, configuration management, and CI/CD pipelines. 📍Knowledge of cloud platforms (e.g., AWS, Azure, or Google Cloud) and containerization technologies (e.g., Docker). 📍Excellent problem-solving skills and the ability to thrive in a fast-paced, dynamic environment. 📍Strong communication and leadership skills, with the ability to collaborate effectively with both technical and non-technical stakeholders.

Preferred Qualifications:

📍Relevant certifications, such as Certified Kubernetes Administrator (CKA) or AWS Certified DevOps Engineer. 📍Experience with monitoring and observability tools (e.g., Datadog, New Relic, Prometheus, Grafana, ELK Stack). 📍Familiarity with agile methodologies and experience working in an Agile/Scrum environment.

Senior Manager Production Engineering

About the company

Job Summary

Responsibilities:

Minimum Qualifications:

Preferred Qualifications:

Salaries for similar jobs:

Similar jobs