Evaluation Tools

The simplest way to find the best AI tools!

Free
Freemium
Web
Android
iOS
Featured
New
Verified
Hiring
NSFW
Objective, Inc favicon
Objective, Inc
Freemium
Web
search api

AI-native search engine optimizing user queries for web and mobile applications.

Confident AI favicon
Confident AI
Freemium
Web
evaluation

Open source evaluation infrastructure for LLMs to enhance their performance and reliability.

Inductor favicon
Inductor
Freemium
Web
llm evaluation

Prototype, evaluate, and observe LLM applications with Inductor.

QuAC favicon
QuAC
Web
question answering

Dataset for modeling information seeking dialog through question answering in context.

HEAR Benchmark favicon
HEAR Benchmark
Web
audio evaluation

A benchmark for evaluating audio representations across diverse tasks in speech, music, and environmental sound.

BEIR favicon
BEIR
Free
Web
model evaluation

A Heterogeneous Benchmark for evaluating Information Retrieval models across diverse datasets.

PyTorch-Ignite favicon
PyTorch-Ignite
Web
neural training

A high-level library for training and evaluating neural networks in PyTorch.

Algomax favicon
Algomax
Free
Web
model evaluation

Streamlined evaluation for LLM & RAG models with insights into qualitative metrics.

Laminar favicon
Laminar
Freemium
Web
app development

An orchestration engine for building and deploying LLM applications with a visual debugger.

Prompt Mixer favicon
Prompt Mixer
Freemium
Web
prompt engineering

A collaborative tool for creating, testing, and evaluating AI prompts and chains for enhanced productivity.

BenchLLM favicon
BenchLLM
Web
evaluate ai

Evaluate LLM-powered applications efficiently with BenchLLM's flexible testing and reporting tools.

functime favicon
functime
Web
forecasting

Time-series machine learning at scale.

XAgent favicon
XAgent
Web
task solving

An autonomous LLM agent for complex task solving.