Create New Account
Sign up to continue searching for suitable jobs in Web 3.0

OR
Terms of Use
Already have an account?

Log In to Your Account
Log in to continue searching for suitable jobs in Web 3.0

OR
Don’t have an account?
Binance
AI Evaluation Specialist
at Binance
about 18 hours ago | 34 views | Be the first one to apply

AI Evaluation Specialist

Full-time
Hong Kong, Asia

About the company

The Binance Exchange is a leading cryptocurrency exchange founded in 2017 in Hong Kong. It features a strong focus on altcoin trading. Binance offers crypto-to-crypto trading in more than 600 cryptocurrencies and virtual tokens, including Bitcoin (BTC), Ether (ETH), Litecoin (LTC), Dogecoin (DOGE), and its own token Binance Coin (BNB).

Job Summary

Responsibilities:

📍Participate in the entire software development lifecycle, encompassing all stages from requirements analysis to test planning, execution, defect tracking, through to product release and maintenance. 📍Go to person in relation to A.I Agents evaluation and continuously monitoring. 📍Create comprehensive and effective test strategies and hands-on testing to ensure the accuracy, reliability, and performance of AI and data applications . 📍Root cause analysis of test failures and product issues in an effective manner, and drive optimization for future enhancements. 📍Design and develop internal tools leveraging AI technology to improve engineering and testing work efficiency.

Requirements:

📍Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Data Science, or a related field. 📍Strong understanding of Large Language Models (LLMs), autonomous AI agents, and their system architectures. 📍Experience with AI evaluation methodologies, including offline benchmarking, online monitoring, and hybrid human-AI evaluation approaches. 📍Familiarity with software engineering best practices such as Test-Driven Development (TDD), Behavior-Driven Development (BDD), and their limitations in AI contexts. 📍Proficiency in designing adaptive, lifecycle-spanning evaluation frameworks that incorporate both quantitative and qualitative metrics. 📍Experience with evaluation tools and frameworks (e.g., Opik,LangSmith) is a plus.

The crypto industry is evolving rapidly, offering new opportunities in blockchain, web3, and remote crypto roles — don’t miss your chance to be part of it.

Salaries for similar jobs:

Similar jobs

3 days ago | 68 views | Be the first one to apply
Full-time
China
8 days ago | 58 views | Be the first one to apply
Full-time
Europe
$85,000 To $120,000 per year
10 days ago | 62 views | 1 applications
Full-time
United States, North America
$160,000 To $210,000 per year
11 days ago | 54 views | Be the first one to apply
Full-time
Germany, Europe
11 days ago | 50 views | Be the first one to apply
Full-time
Asia