About the company
VMLY&R COMMERCE is WPP’s newest end-to-end Creative Commerce Company.  Built on the commerce expertise of Geometry and scaled through VMLY&R’s connected brand promise, we help modern brands grow by unifying marketing strategies around commerce. We believe the space of commerce holds the most untapped creative potential to grow brands and people.  Living CommerceTM is our proprietary approach and platform that guides how we work. By understanding how, when and why people buy, we deliver the most engaging and culturally-relevant creative commerce experiences across Retail, Experience, Design and Innovation. We do this as part of a network of 13,000 specialists across 80 countries, everywhere life intersects with commerce – this is end-to-end. We're committed to building a united global network of creators and thinkers, regardless of their age, race, skin color, religion, disability, sexual orientation, gender identity or expression. With over 12,000 employees across the globe, our influence is made stronger by applying our collective experiences to tell stories that not only sell products and experiences, but also guarantee positive representations of all communities.
Job Summary
What you'll do
📍We are seeking a talented and experienced platform engineer with a focus on observability to join our team. As a platform engineer, you will be responsible for designing, implementing, and maintaining our cloud-based platform that powers our applications and services. You will play a critical role in ensuring that our platform is reliable, scalable, and performant, and that we have the necessary tools and processes in place to monitor and debug issues in real-time.
Responsibilities:
📍Design, implement, and maintain our cloud-based platform, ensuring high availability, scalability, and performance 📍Develop and maintain monitoring, alerting, and logging systems to ensure timely detection and resolution of issues 📍Collaborate with cross-functional teams to implement best practices for observability, including instrumentation, tracing, and metrics 📍Build and maintain dashboards and visualizations to provide real-time visibility into the health of our platform and services 📍Continuously improve our observability practices by staying up-to-date with the latest technologies and industry trends 📍Participate in on-call rotation to respond to incidents and ensure system uptime 📍Document and communicate platform architecture, processes, and best practices to internal teams and stakeholders
Requirements:
📍Bachelor's degree in Computer Science, Engineering, or related field 📍5+ years of experience in platform engineering, with a focus on observability 📍Strong experience with cloud infrastructure and technologies, particularly AWS 📍Proficiency in one or more programming languages such as Python, Java, or Go 📍Experience with observability tools such as Prometheus, Grafana, and Elasticsearch 📍Experience with distributed tracing tools such as OpenTelemetry or Jaeger 📍Familiarity with containerization technologies such as Docker and Kubernetes 📍Experience with Crossplane technology for managing infrastructure as code 📍Experience with Backstage for managing internal software development 📍Excellent analytical and problem-solving skills 📍Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams