About the company
We combine the power of Web3 and creativity to build experiences that connect people from all corners of the globe.
Job Summary
Responsibilities:
š Identify, propose and execute improvements to performance and scalability bottlenecks in our current systems/infrastructure on AWS šMeasure systems' health, scalability and performance metrics and identify areas of improvement šUtilize your knowledge of code to solve broad operational challenges within the Limit Breaks Infrastructure and Platform šWork with the wider engineering team to identify how we can provide the most production-like environment for running both manual and automated testing šDefine SLOs, SLIs, monitoring, alerting and incident response practices
Qualifications:
š 5+ years experience in SRE, Dev Ops or Systems engineering šStrong background in kubernetes šExtensive experience in Terraform and Ansible šCI/CD and automation experience šStrong background in AWS šAbility to participate in an on-call rotation šEffective communication skills to be able to clearly explain your reasoning and thought process for anything you propose šExcellent collaboration skills to be able to work closely with product engineers and product owners to understand their context and co-design appropriate solutions which balance feature velocity with site reliability šImplementation of in-house monitoring and observability infrastructure šImplementation of Elastic Search stack or equivalent solutions for capturing logs from all environments šWorking with InfoSec to implement various tools to monitor and protect the environment in real-time



