XBot
  • Overview
    • Introduction
    • How xBot Works
    • Target Audience
    • Key Benefits of xBot
    • Core Concepts of xBot
  • Quick Start
    • Quick Setup
    • Getting Started
      • Zalo Channel
      • Azure Bot Framework
      • FaceBook Channel
      • Team Channel
      • Webchat Channel
      • Email Channel
    • Basic Configuration
    • First AI Flow Setup
    • Initial Testing and Go Live
  • Features
    • Using xBot to Handle End-User Queries
    • Communication Channels
      • Zalo OA
      • Facebook
      • Teams
      • WebChat
      • Email
    • Understanding the Message Handling Flow
    • Understanding AI Bots in xBot
    • Configuring Dispatch Rules in xBot
    • User Functions and Permissions
      • Custom Roles and Permissions
      • Auditing and Monitoring User Activities
    • Cross-Platform Message Type Compatibility
    • AI Flow
      • Core Concepts
      • AI Services
        • Knowledge Base Agent
        • AI Agent
        • AI Proxy Agent
      • Knowledge Base
      • Functions
      • Evaluation Metrics
        • Essential Information
        • Basic Metrics
        • Extra Metrics
  • Integration Guide
    • Integrates with multiple channels
      • API reference
        • Webhook
          • ZaloPushToXBot
          • AzbotPushToXBot
        • Webchat
          • InitForClient
  • References
    • Industry-Specific Use Cases
      • Media and Entertainment
      • Wholesale
      • Transportation and Logistics
      • Manufacturing
      • Energy and Utilities
      • Real Estate
      • Agriculture
      • Travel and Hospitality
      • Healthcare and Wellness
      • Retail and E-Commerce
      • Public Administration
      • Legal
      • Training
      • Education
      • xBot Use Case: Insurance
      • Securities -Use Case
      • Banking - Use Case
      • xBot Use Case: Finance
Powered by GitBook
On this page
  • Step 1 - Gathering Essential Information:
  • Step 2 - Statistical Evaluation:
  1. Features
  2. AI Flow

Evaluation Metrics

Creating a fundamental LLM application may be not too complicated, the challenge lies in its ongoing maintenance and continuous enhancement. This features was created to provide an overview of performance in whole LLM build-in system.

Based on reliable sources, valuable research topics, etc., this approach suggests crucial metrics over time, providing effective insights in order to evaluate system performance.

In general, there are two main steps that an evaluator needs to take to assess whether the use of AI is feasible.

Step 1 - Gathering Essential Information:

Relying on the metrics used for evaluation, the key information have to collect before processing next step.

As a best practice, all information used in the flow should be collected and stored.

Step 2 - Statistical Evaluation:

Each metric needs a different group metadata, depending on how much metadata we have, the corresponding metrics will be generated.

PreviousFunctionsNextEssential Information

Last updated 9 months ago