AI Tech Suite

Evaluation Tools

The simplest way to find the best AI tools!

Search AI Tools

Free

Freemium

Web

Android

iOS

Featured

New

Verified

Hiring

NSFW

Objective, Inc

Freemium

Web

search api

AI-native search engine optimizing user queries for web and mobile applications.

Confident AI

Freemium

Web

evaluation

Open source evaluation infrastructure for LLMs to enhance their performance and reliability.

Inductor

Freemium

Web

llm evaluation

Prototype, evaluate, and observe LLM applications with Inductor.

QuAC

Web

question answering

Dataset for modeling information seeking dialog through question answering in context.

HEAR Benchmark

Web

audio evaluation

A benchmark for evaluating audio representations across diverse tasks in speech, music, and environmental sound.

BEIR

Free

Web

model evaluation

A Heterogeneous Benchmark for evaluating Information Retrieval models across diverse datasets.

PyTorch-Ignite

Web

neural training

A high-level library for training and evaluating neural networks in PyTorch.

Algomax

Free

Web

model evaluation

Streamlined evaluation for LLM & RAG models with insights into qualitative metrics.

Laminar

Freemium

Web

app development

An orchestration engine for building and deploying LLM applications with a visual debugger.

Prompt Mixer

Freemium

Web

prompt engineering

A collaborative tool for creating, testing, and evaluating AI prompts and chains for enhanced productivity.

BenchLLM

Web

evaluate ai

Evaluate LLM-powered applications efficiently with BenchLLM's flexible testing and reporting tools.

functime

Web

forecasting

Time-series machine learning at scale.

XAgent

Web

task solving

An autonomous LLM agent for complex task solving.