About the company
Kraken, the trusted and secure digital asset exchange, is on a mission to accelerate the adoption of cryptocurrency so that you and the rest of the world can achieve financial freedom and inclusion. Our 2,350+ Krakenites are a world-class team ranging from the crypto-curious to industry experts, united by our desire to discover and unlock the potential of crypto and blockchain technology. As a fully remote company, we already have Krakenites in 70+ countries (speaking 50+ languages). We're one of the most diverse organizations on the planet and this remains key to our values. We continue to lead the industry with new product advancements like Kraken NFT, on- and off-chain staking and instant bitcoin transfers via the Lightning Network.
Job Summary
The opportunity
šContribute to developing AI agents for task automation and orchestration. šAssist in designing systems for intelligent task planning and human-AI collaboration. šSupport the implementation of workflows for efficient task routing across AI models and human operators. šBuild and maintain monitoring systems to track agent performance and workflow efficiency. šAI/ML pipelines for model training, evaluation, and monitoring in production. šCollaborate with SREs and cross-functional teams to ensure optimal AI platform performance. šWork with senior engineers to identify and address AI/ML opportunities for improving scalability and efficiency in operations. šStay informed on AI/ML advancements, contributing to team knowledge and fostering innovation in the crypto industry.
Skills you should HODL
šExperience building, fine-tuning, and deploying ML systems in production , including LLMs and RAG systems. šStrong applied ML fundamentals and proficiency in AI algorithms. šHands-on experience with post-deployment tasks such as monitoring, rollouts, and re-tuning. šProficiency with workflow orchestration tools like Prefect, Airflow, or Argo Workflows. šFamiliarity with deploying web services using container orchestration tools such as Kubernetes (K8s). šExperience with ML lifecycle tools (e.g., MLFlow, Kubeflow) and inference servers like Triton, TGI, or vLLM. šProven ability to build reliable CI/CD pipelines for services and model deployments. šStrong Python programming skills with expertise in software engineering principles to build scalable systems. šStrong problem-solving mindset and effective communication skills for cross-team collaboration.