About the company
Impossible Cloud represents the spirit of innovation and determination. Our cutting-edge cloud solutions help bridging the gap between web3 technology and mainstream B2B cloud use cases. We are eliminating frictions currently slowing web3 mass adoption and deliver key benefits like increased speed and security while optimizing costs. Impossible Cloud was founded by serial entrepreneurs who formerly built multiple unicorns. With our top team of passionate people who have joined Impossible Cloud from several different countries, we are continuously researching and pushing the boundaries of distributed technologies. Impossible Cloud is backed by an all-star team of internationally renowned venture capital companies, and we are part of the Protocol Labs Network.
Job Summary
What you will do
šMaintain and Expand the Observability Stack: Ensure the reliability, availability, and scalability of our observability stack leveraging Grafana. šImplement Monitoring Solutions: Develop and implement monitoring solutions to meet both engineering and business needs, enhancing our ability to detect and diagnose issues proactively. šOptimize Performance and Reduce Costs: Continuously improve the performance and efficiency of the observability tools to handle large-scale, distributed environments at a low cost. šCollaborate with Teams: Work closely with engineering and operations teams to understand their requirements and provide effective observability solutions. šDocumentation and Training: Create and maintain documentation for observability tools and processes, and assist in training team members to effectively use these tools. šTroubleshoot and Resolve Issues: Investigate and resolve issues related to observability, ensuring minimal disruption to the business and maintaining system integrity.
What you bring to the table
šKubernetes: Hands-on experience deploying and managing components in a Kubernetes environment. Also experience with tools like Helm, AgoCD and Git will greatly benefit you. šFirst experience with Grafana Stack: First experience with Grafana and its components, i.e. Mimir, Loki, Tempo, and/or Alloy is a plus. šTechnical Skills: Good understanding of monitoring and observability principles in highly distributed systems (on-prem). šProblem-Solving Abilities: Excellent troubleshooting and problem-solving skills. šCommunication Skills: Strong written and verbal communication skills, with the ability to document and convey technical information clearly.