There are two main parts of each AI bot: the persona and the background.

AI Persona

Overview: The AI’s name, appearance, voice, and personality

  • Name - The AI’s name
  • Appearance - Either the AI’s profile picture or live avatar option
  • Voice - The voice model the AI uses (see the drop down menu at the top of each provider’s tab to see price estimates)
  • Personality - The AI’s personality type, style of speaking, confidence, professionalism, assertiveness, etc.

Do not include any background or situational context in the Persona. That belongs in the Background.

AI Background

Definition: A general field that lets you input the AI’s relevant situation, background, pains, needs, etc.

Components: There are only two fields for the AI background:

  • Title - The AI’s role and company
  • Situation/context - Any relevant context to the situation. Common data included here for sales scenarios is:
    • Pain points/Specific needs
    • Budget/income level
    • Decision criteria
    • Decision makers
    • Competition knowledge

🛠️ Configuration Options Overview

Replay offers three core configuration options for delivering agent experiences. Each option serves different goals depending on user preference, performance needs, and visual requirements.

ConfigurationBest ForKey Features
Default VoiceWhen you need a flexible, high-quality voice experience with screen sharing but no video.
  • Choose your own TTS provider
  • Screen sharing compatible
  • Low bandwidth usage
Real-Time APILive, real-time conversations where speed and responsiveness are critical.
  • Lowest latency for instant responses
  • Supports live video feeds (screen/webcam)
  • Fixed provider for optimal speed
Video AvatarEngaging users with a visual, lip-synced avatar for a more personal touch.
  • Animated, talking avatar
  • Stackable with other options
  • Customizable appearance

🔊 1. Default Voice Option

Overview: This is the most flexible option for users who want to deliver a high-quality voice experience without video.

Key Features:

  • Provider Choice: Users can select their preferred TTS provider based on quality, latency, and price.
  • Customizable: Supports a wide range of providers, from fast/premium to slower/more cost-effective.
  • Screen Sharing: Fully compatible with screen sharing for guided walkthroughs and demos.
  • No Video: Does not support real-time webcam video, keeping bandwidth and complexity low.

Best For: When users need spoken responses paired with screen interaction, without the need for a visual avatar or webcam feed.


⚡ 2. Real-Time API Option

Overview: A low-latency, high-speed configuration for real-time experiences that require responsiveness and visual interaction.

Key Features:

  • Instant Response: Prioritizes speed for seamless, natural conversations.
  • Live Video Feed Support: Enables agents to view the user’s screen or webcam in real-time, if permissions are granted.
  • Fixed Provider: Not customizable like the default option—comes with a fixed voice/interaction pipeline.

Best For: Live support scenarios, debugging, or any real-time collaborative sessions where fast back-and-forth is critical.


🧑‍💻 3. Video Avatar Option

Overview: This configuration introduces a visually engaging avatar that lip-syncs with the agent’s speech.

Key Features:

  • Lip-Synced Visuals: Adds an animated, talking avatar to make the experience more immersive and personal.
  • Stackable: Can be added on top of either the Default Voice or Real-Time API configurations.
  • Customizable Appearance: Users can often customize the look and style of the avatar to match brand or tone.

Best For: Use cases where visual presence increases engagement—like product walkthroughs, onboarding, or virtual assistants.


Creating Variety

You can mix and match personas and backgrounds to create diverse training roleplays:

Different Difficulty Levels

  • Easy: “Always say yes”
  • Medium: “Quiet, slightly stubborn, but able to be convinced”
  • Hard: “Be rude, interrupt the user, etc.”

Different Buyer Types

  • Technical buyer: Focuses on specifications and integration
  • Financial buyer: Focuses on ROI and cost savings
  • End-user buyer: Focuses on ease of use and features