About the company
Gemini is a regulated cryptocurrency exchange, wallet, and custodian that makes it simple and secure to buy bitcoin, ether, and other cryptocurrencies.
Job Summary
Responsibilities:
📍Running on-going performance evaluations and improvements for Gemini systems 📍Creating âProduction-ready Scorecardsâ to evaluate the health of systems pre-launch 📍Implementing and teaching monitoring, alerting and automated resolution best practices 📍Defining SLIs, SLOs with Engineering teams 📍Educating and guiding Engineering teams on reliability and resiliency best practices, like statelessness, chaos testing, blue/green deployments etc. 📍Building operational tooling and automations
Qualifications:
📍2+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale 📍Good knowledge for various cloud technology providers like AWS, GCP, or Azure 📍Experience in a code-first environment, developing automated solutions to solve support and operational issues 📍Experience working with containerization such as Nomad, EKS (k8s), Docker, etc. 📍Experience working with Configuration Management such as Ansible, Chef, Puppet 📍Experience writing scripts or cli tools that help increase Developer Productivity 📍Experience working with Engineering teams to implement best-practice technical solutions 📍Experience working in a code-drive, automation-first public cloud infrastructure (Terraform)