About the company
Our mission is to bring blockchain to a billion people. The Alchemy Platform is a world class developer platform designed to make building on the blockchain easy. We've built leading infrastructure in the space, powering over $105 billion in transactions for tens of millions of users in 99% of countries worldwide. The Alchemy team draws from decades of deep expertise in massively scalable infrastructure, AI, and blockchain from leadership roles at leading companies and universities like Google, Microsoft, Facebook, Stanford, and MIT. Alchemy recently raised a Series C1 at a $10.2B valuation led by Lightspeed and Silver Lake. Previously, Alchemy raised from a16z, Coatue, Addition, Stanford University, Coinbase, the Chairman of Google, Charles Schwab, and the founders and executives of leading organizations. Alchemy powers the top blockchain companies globally and has been featured in TechCrunch, Forbes, Bloomberg, and elsewhere.
Job Summary
What You'll Do:
šSet high standards for Reliability at Alchemy šDevelop and own company wide Reliability best practices like SLO definition, incident management, postmortem reviews, launch readiness reviews, change management šArchitect production infrastructure and tools that encourage and enforces high reliability šInspire the broader engineering organization to ensure Reliability is a first class citizen in the products we build šCollaborate, partner, advice, review and mentor engineering teams on Reliability topics like high reliability architecture, observability, safe change management šImprove critical infrastructure and systems that are used to operate infrastructure at scale (i.e. compute, networking, deployment, observability, code tooling/libraries etc.) šDevelop and own best practices for managing production infrastructure: provisioning, application scaling, configuration management, capacity planning, monitoring, etc. šDevelop and own best practices for developer processes: CI/CD, dev and staging environments, etc. šProvide input into long-term platform requirements and operational guidelines with a focus on reliability šContinuously raise our standard of engineering excellence by implementing best practices for coding, testing, and deployment šBuild and maintain documentation around process and workflows
What We're Looking For:
š5+ years of experience as an Infrastructure Engineer focused on Reliability (e.g., Site Reliability Engineer, Production Engineer, Platform Engineer) šExperience leading and driving company wide reliability efforts and engineering initiatives šExperience with observability best practices and tooling like Prometheus, Grafana and Datadog šExperience designing and operating large-scale, multi-region production systems šExperience working with AWS or other cloud infrastructures
The future of finance is here ā whether youāre interested in blockchain, cryptocurrency, or remote web3 jobs, thereās a perfect role waiting for you.