Getting Started

Set up PromptForward and run your first prompt test in minutes.

Key Concepts

Before diving in, let's understand the core components of PromptForward:

Prompts

Prompts are the instructions you give to AI models. In PromptForward, prompts include system messages, user messages, and variables. Each time you save changes, a new version is created, allowing you to track and compare improvements over time.

LLM Providers

LLM Providers are your connections to AI services like OpenAI, Anthropic, Groq, or AWS Bedrock. You configure providers with your API credentials so PromptForward can execute prompts using various models.

Datasets

Datasets are collections of test cases. Each row contains input data (like user questions or chat messages) and optionally expected outputs. Datasets let you systematically test your prompts against many scenarios at once.

Evaluators

Evaluators automatically score AI responses. System evaluators use exact matching, contains matching, or regex. Judge LLM evaluators use another AI model to assess quality based on custom criteria like tone, accuracy, or completeness.

Test Runs

Test Runs execute your prompt against every item in a dataset, applying evaluators to each response. They provide detailed results showing success rates, individual scores, and help you identify areas for improvement.

API Keys

API Keys enable your applications to access PromptForward prompts programmatically. You can set granular permissions to control which prompts and providers each key can access.