LLM Battleground

Credits:

Credits Remaining:

Time Until Reset:

Please wait as your Gladiators fight...

(This can take up to 5 minutes.)

You can navigate away from this page. The results will be available once the battle is complete.

You can use the LLM Battleground anonymously, but if you log in, you get more credits and the results of all your past battles will be saved for you to review at any time. This includes the ability to run battles in the background.

Past Battles

About LLM Battleground

LLM Battleground is your ultimate platform for evaluating and comparing large language models (LLMs). Developed by Striker Consulting, this app pits AI models against each other to help you find the best one for your unique needs. Powered by a sophisticated network of AI agents collaborating in real-time, the Battleground ensures you get the most insightful and tailored results possible. There are six agents working together:

The Orchestrator

The charismatic ringmaster who oversees the entire process, ensuring all agents work together seamlessly to deliver the ultimate battle experience.

The Battle Master

The energetic arena official who keeps the battles fair, exciting, and action-packed. This agent dispatches queries to multiple LLMs and gathers the responses. They preside over every clash with flair.

The Archivist

The meticulous librarian who keeps track of all background information like model pricing, using RAG and ensuring no detail is ever lost in the heat of battle.

The Accountant

The serious yet friendly number-cruncher who ensures every calculation is handled perfectly.

The Hacker

The quirky tech wizard who generates the code for the evaluation function. They add a touch of rebellious fun to the mix.

The Evaluator

The high-tech cyberpunk expert who measures, evaluates, and analyzes every move with pinpoint precision.

Service Unavailable

LLM Battleground

LLM Battleground

Enter Your Prompt

Choose Your Gladiators:

LLM Battleground

Prompt Details

Model Responses

Metrics Comparison

Evaluation Function

Past Battles

About LLM Battleground

The Orchestrator

The Battle Master

The Archivist

The Accountant

The Hacker

The Evaluator

Metric