About the company
About OKX OKX is a leading crypto trading app, and a Web3 ecosystem. Trusted by more than 20 million global customers in over 180 international markets, OKX is known for being the fastest and most reliable crypto trading app of choice for investors and professional traders globally. Our Singapore office is a Product and Engineering hub and we are in the progress of expanding our teams in Singapore for the continuous growth of our global business. We build and maintain core trading platform with millions of daily active users. Design, Product and Engineering teams work cross-functionally to identify customer needs, and ship high-quality new features through fast iterations.
Job Summary
What You’ll Be Doing
📍Effectively optimize existing runtime environments (KVM, Docker, K8S, JVM, etc.) to ensure efficient resource utilization and stable service operation. 📍Deeply understand the architecture and principles of middleware (Kafka, Spring Cloud, Nacos, Apollo, Kong Gateway, etc.), ensuring high performance and availability. 📍Ensure stability and optimize big data platforms (Alibaba Cloud DataWorks, AWS EMR, AWS DataBricks, Spark, Flink) and data warehouses (MaxCompute, Hologres, Hive, Clickhouse, StarRocks, etc.). 📍Comprehend network architecture and security, providing guidance on infrastructure stability based on network architecture and security layers, ensuring secure, stable, and efficient network communications. 📍Lead chaos engineering exercises, coordinating with business units to validate system robustness and recovery capabilities through simulated failure scenarios. 📍Participate in rapid response and troubleshooting of system failures, continuously optimize monitoring strategies to reduce system downtime and ensure service continuity and stability. 📍Drive infrastructure automation and intelligence to improve SRE work efficiency and quality. 📍Collaborate closely with development teams, providing technical support and advice on infrastructure to jointly promote continuous product improvement and innovation.
What We Look For In You
📍Bachelor's degree or above in Computer Science or related field, with 8+ years of experience in large-scale internet or cloud computing platform development/SRE/operations. 📍In-depth understanding of big data platforms, data warehouses, middleware, runtime environments, and network technology principles and architectures, with rich practical experience and troubleshooting skills. 📍Proficient in Linux system management and optimization, familiar with scripting languages such as Shell/Python, able to write automation tools and scripts.
If you’re passionate about blockchain and decentralized technologies, explore more opportunities in web3 and cryptocurrency careers.