OpenAI

Overview

Preclinical can test any OpenAI-compatible chat completion API. This includes:

OpenAI’s official API
Azure OpenAI
Local models (vLLM, Ollama, etc.)
Any API following the OpenAI chat completions format

Configuration

interface OpenAIConfig {
  provider: "openai";
  config: {
    api_key: string;           // API key for authentication
    base_url: string;          // API base URL
    model: string;             // Model to use

    // Optional
    temperature?: number;      // Default: 0.7
    max_tokens?: number;       // Default: 1024
    system_prompt?: string;    // Override agent's system prompt
    timeout_ms?: number;       // Default: 60000
  };
}

Setup Steps

Identify Your Endpoint

Determine your API endpoint:

Provider	Base URL
OpenAI	`https://api.openai.com/v1`
Azure OpenAI	`https://{resource}.openai.azure.com/openai/deployments/{deployment}`
Local/Custom	Your server URL

Get API Key

Obtain an API key from your provider

Add Integration

{
  "name": "My OpenAI Agent",
  "provider": "openai",
  "config": {
    "api_key": "sk-xxxxx",
    "base_url": "https://api.openai.com/v1",
    "model": "gpt-4o"
  }
}

Provider Examples

OpenAI
Azure OpenAI
vLLM
Ollama

{
  "provider": "openai",
  "config": {
    "api_key": "sk-xxxxx",
    "base_url": "https://api.openai.com/v1",
    "model": "gpt-4o"
  }
}

{
  "provider": "openai",
  "config": {
    "api_key": "xxxxx",
    "base_url": "https://myresource.openai.azure.com/openai/deployments/gpt-4",
    "model": "gpt-4",
    "api_version": "2024-02-15-preview"
  }
}

For Azure, the model name should match your deployment name.

{
  "provider": "openai",
  "config": {
    "api_key": "dummy",
    "base_url": "http://localhost:8000/v1",
    "model": "meta-llama/Llama-2-70b-chat-hf"
  }
}

{
  "provider": "openai",
  "config": {
    "api_key": "ollama",
    "base_url": "http://localhost:11434/v1",
    "model": "llama2"
  }
}

How It Works

┌─────────────────┐     ┌─────────────────┐     ┌─────────────────┐
│   Preclinical   │────▶│   /chat/        │────▶│    Your Model   │
│   (Pen Tester)  │◀────│   completions   │◀────│                 │
└─────────────────┘     └─────────────────┘     └─────────────────┘

Preclinical sends a chat completion request
Your endpoint processes the request
Response is captured
Process repeats for configured turns
Full conversation is graded

API Format

Preclinical uses the standard OpenAI chat completions format:

{
  "model": "gpt-4o",
  "messages": [
    {"role": "system", "content": "You are a healthcare assistant..."},
    {"role": "user", "content": "I have chest pain..."},
    {"role": "assistant", "content": "I understand..."},
    {"role": "user", "content": "It's getting worse..."}
  ],
  "temperature": 0.7,
  "max_tokens": 1024
}

Features

Universal Compatibility

Works with any OpenAI-compatible endpoint

System Prompt Override

Optionally override the model’s system prompt

Configurable Parameters

Control temperature, max tokens, and more

Streaming Support

Handles streaming and non-streaming responses

Testing Custom Agents

If you have a chat-based healthcare agent, you can test it by:

Exposing it via an OpenAI-compatible endpoint
Configuring the system prompt to match your agent’s personality
Running Preclinical tests against it

{
  "provider": "openai",
  "config": {
    "api_key": "your-api-key",
    "base_url": "https://your-agent-api.com/v1",
    "model": "healthcare-assistant-v2",
    "system_prompt": "You are a medical triage assistant..."
  }
}

Error Handling

Common Errors

Error	Cause	Resolution
401 Unauthorized	Invalid API key	Check API key
404 Not Found	Invalid model/endpoint	Verify base URL and model name
429 Rate Limit	Too many requests	Automatic retry with backoff
500 Server Error	Provider error	Automatic retry

Rate Limits

Preclinical handles rate limits automatically:

Request → 429 → Wait (Retry-After or exponential backoff) → Retry

Troubleshooting

Connection refused

Verify base URL is correct and accessible
Check for trailing slashes (shouldn’t have one)
Ensure server is running (for local models)

Invalid model

Verify model name matches exactly
For Azure, use deployment name as model
Check model is available on your tier

Authentication errors

Verify API key is correct
Check key has required permissions
For Azure, ensure key is for correct resource

Next Steps

Run a Test

Execute your first test run

Other Integrations

Explore voice-based providers

Getting Started

Core Concepts

Integrations

Configuration

Overview

Configuration

Setup Steps

Provider Examples

How It Works

API Format

Features

Universal Compatibility

System Prompt Override

Configurable Parameters

Streaming Support

Testing Custom Agents

Error Handling

Common Errors

Rate Limits

Troubleshooting

Next Steps

Run a Test

Other Integrations

Getting Started

Core Concepts

Integrations

Configuration

​Overview

​Configuration

​Setup Steps

​Provider Examples

​How It Works

​API Format

​Features

Universal Compatibility

System Prompt Override

Configurable Parameters

Streaming Support

​Testing Custom Agents

​Error Handling

​Common Errors

​Rate Limits

​Troubleshooting

​Next Steps

Run a Test

Other Integrations

Overview

Configuration

Setup Steps

Provider Examples

How It Works

API Format

Features

Testing Custom Agents

Error Handling

Common Errors

Rate Limits

Troubleshooting

Next Steps