The simplest way to find the best AI tools!
A benchmark suite for evaluating large language models focusing on Vietnamese language understanding.
A benchmark for evaluating audio representations across diverse tasks in speech, music, and environmental sound.
A machine learning benchmark focusing on optimizing data selection for model training.
A benchmark for measuring and reducing malicious use of LLMs through unlearning methods.
A Heterogeneous Benchmark for evaluating Information Retrieval models across diverse datasets.
A scalable and extensible federated learning engine and benchmark.
A benchmark for physical reasoning with 2D puzzles.
Cosine's Genie is a cutting-edge AI software engineering model with exceptional coding abilities.