Ai Evaluation Tools

The simplest way to find the best AI tools!

Free
Freemium
Web
Android
iOS
Featured
New
Verified
Hiring
NSFW
Openlayer favicon
Openlayer
Web
ai monitoring

An automated AI evaluation and monitoring platform for building, testing, and deploying AI systems.

Samba1 Turbo favicon
Samba1 Turbo
Web
ai evaluation

Samba1 Turbo enables evaluating expert models via developer inference services.

Vectorview favicon
Vectorview
Web
ai evaluation

Custom evaluations of AI capabilities for safety, risk, and performance benchmarking.

Vocalize favicon
Vocalize
Web
speech evaluation

An AI evaluation suite focused on enhancing human-computer conversations through speech recognition tests.

RagaAI favicon
RagaAI
Freemium
Web
ai evaluation

Automated platform for evaluating and optimizing multi-agent AI applications.

Elevora favicon
Elevora
Web
candidate screening

AI tool for conducting automated candidate screening interviews quickly and efficiently.

Beauty.AI favicon
Beauty.AI
Web
photo evaluation

An international beauty contest evaluated by artificial intelligence.

Patronus AI favicon
Patronus AI
Web
model evaluation

Innovative tool for evaluating LLMs and detecting AI-generated risks in real-world scenarios.

EvalAI favicon
EvalAI
Web
evaluate ai

An open-source platform for evaluating and comparing ML and AI algorithms through hosted challenges.

Scale AI favicon
Scale AI
Web
data management

Accelerate the development of AI applications with comprehensive data solutions.

EvalsOne favicon
EvalsOne
Web
evaluate applications

A comprehensive evaluation platform for optimizing generative AI applications.

HoneyHive favicon
HoneyHive
Freemium
Web
evaluation and monitoring
Hiring (3 jobs)

AI Evaluation and Observability Platform for developers to ensure reliable AI products.

Flow AI favicon
Flow AI
Web
ai evaluation

Advanced evaluation tools for LLM applications to enhance AI product performance.

Spellforge favicon
Spellforge
Web
quality evaluation

AI quality gatekeeper for existing release pipelines with dynamic testing of Custom GPTs.