agents

latest

false

重要 :

新发布内容的本地化可能需要 1-2 周的时间才能完成。

智能体用户指南

上次更新日期 2025年9月25日

对话智能体

注意：对话智能体目前提供公开预览版。

关于 UiPath 对话智能体

对话智能体是新一代 UiPath 智能体，旨在与用户进行多轮动态实时对话。与响应单个提示的自主智能体不同，对话智能体会解释连续的用户消息流并做出反应。它们管理对话上下文、工具执行、人工升级和记忆，以实现更丰富、自适应性更强的自动化体验。将其视为智能数字助理，能够理解上下文并能自然处理模糊问题。

对话智能体对于需要：

持续的澄清或反复交流
基于用户意图的个性化指导
在置信度较低时，人工无缝回退

表 1. 与自主智能体的主要区别
功能	对话智能体	自主智能体
交互模型	多轮往复对话	根据初始提示单次执行任务
主要用例	实时用户支持和协助，交互式信息收集	从定义的提示中执行任务
用户输入	连续的用户聊天消息	单个结构化提示
核心优势	维护对话和处理模糊问题	跨工具执行计划

何时使用对话智能体

当自动化场景涉及实时上下文感知交互时，使用对话智能体。这些智能体最适合用于：

为客户或员工提供自助服务体验，例如服务台支持或入职助理。
通过多步骤流程、表单或决策树进行交互式指导。
上下文对话，用户可以提出后续问题或逐步提供信息。
应用程序、系统或知识库的自然语言界面，使用户能够以对话方式查询信息。

当任务可以在单个提示中完全描述，并预先提供所有必需输入时，请改为使用自主智能体。理想的示例包括：

结构化文档处理（例如，从发票或合同中提取数据）
根据预定义逻辑自动生成报告
具有明确的单次需求的摘要或转换任务

How do conversational agents relate to Autopilot for Everyone?

With several chat experiences available, it's important to know which one to use and when.

Conversational Agents vs. Autopilot for Everyone:

Working together: These two experiences work side-by-side. Conversational agents are not a replacement for Autopilot for Everyone.
Different purposes: Think of Autopilot for Everyone as UiPath's general-purpose agent, optimized for productivity tasks and interacting with the UiPath platform. Conversational agents are specialists that you build for a specific use case (e.g., an HR policy assistant).
Access: You can access your specialized conversational agents directly from within Autopilot for Everyone, making it a central hub for all your conversational needs.

Conversational Agents vs. specialized Autopilots:

Overlap: Both are designed for use-case-specific purposes.
Our recommendation: We recommend building with conversational agents. They provide a much richer and more robust design-time experience for building, testing, and refining your use-case-specific agent.
Key difference: Conversational agents do not currently support file uploads or local desktop automation, whereas specialized autopilots do.

许可

During the public preview, conversational agent executions do not consume Platform or Agent units. However, if the agent uses a paid service, such as a DeepRAG tool, it consumes the necessary units to make the call to that tool.

Official licensing details for conversational agents will become available with the 2025.10 general availability.

注意：与对话智能体开始对话会触发专门用于该聊天的 Orchestrator 流程。此流程始终显示为“正在运行”，因此可以立即响应您的消息。但是，它只有在您实际发送消息时才会消耗资源。在空闲和等待期间，它不会消耗任何资源。

与此同时，请考虑以下内容：

最佳实践

在设计对话智能体时，请考虑以下最佳实践：

从明确的角色开始：定义智能体的语气和作用域（例如，“您是一个友好的人力资源助理...”）。
针对不可预测性的设计：用户可能会提供不完整或不正确的信息。优雅地处理模糊问题。
指导工具使用：确保工具描述中清楚地说明了使用时间以及方式。
通过评估进行迭代：为正常路径和异常路径创建测试用例。相应地更新智能体逻辑。

开始使用对话智能体

构建对话智能体遵循结构化生命周期，包括设计、测试、部署和监控。关键步骤为：

设计智能体：使用 Studio Web 定义智能体的系统提示，配置可用工具，添加上下文基础并设置升级工作流。
测试和评估：使用内置的调试聊天功能测试多轮交互。将真实或模拟对话添加到评估集中，以验证行为和性能。
发布和部署：将智能体作为解决方案包发布到 Orchestrator。确保解决方案文件夹包含用于执行的无服务器和无人值守机器人。
访问和管理：通过实例管理与智能体交互。监控 runtime 行为，查看追踪日志，并根据反馈进行迭代。

构建对话智能体

您可以使用 Studio Web 中与自主智能体相同的低代码设计器创建对话智能体，其主要区别在于为实时多轮对话量身定制。

创建智能体

要开始：

转到Studio.uipath.com 。
选择“新建”按钮，然后选择“智能体”。
选择“对话智能体”类型。
（可选择）向 Autopilot 描述您的智能体，以生成入门配置

图 1. 创建新的对话智能体

配置系统提示

系统提示定义了智能体的角色、目标、行为约束和工具/升级逻辑。用于指导智能体如何：

向用户问候。
处理未知查询。
升级问题或调用工具。
保持一致的语气和风格。

提示：Autopilot 可以根据您的用例生成有效的开始提示。

对话智能体不使用用户提示或数据管理器输入/输出。所有输入都是在对话期间实时收集的。

配置工具

工具使智能体能够在对话过程中执行操作，例如执行自动化任务、运行流程或调用 API 接口。支持的工具类型包括：RPA 工作流、API 工作流、活动和其他智能体（对话智能体除外）。

提示：对于仅执行 API 调用的工作流，我们建议使用 API 工作流，以获得实时聊天的最佳性能。

使用工具级护栏实施 runtime 策略。护栏在测试和 runtime 都适用，在追踪日志中可见。有关详细信息，请参阅“护栏”。

配置上下文

添加上下文基础索引，以向您的智能体提供特定知识源的访问权限。智能体可以查询这些索引，以提供有信息依据的引文支持的响应。有关详细信息，请参阅“上下文”。

配置升级和智能体记忆

对话智能体支持升级工作流和智能体记忆，以改进决策：

通过升级，智能体能够在置信度较低或用户意图不明确的场景下，通过 Action Center 将对话交给人类。对话同步运行，这意味着智能体会暂停所有进一步的交互，直到升级问题解决。
智能体记忆使智能体能够记住并重用之前解决的升级，从而减少冗余并提高效率。

有关详细信息，请参阅升级和智能体记忆。

评估和测试智能体

评估有助于确保您的对话智能体在各种对话路径中表现可靠。此流程与评估自主智能体类似，但适用于对话。

使用“输出”面板模拟真实对话。选择“在云上测试”，以在类似聊天的环境中运行智能体，并使用自然语言与智能体交互。

查看实时执行日志

在聊天右侧，展开完整执行追踪，提供智能体执行的实时追踪。它显示了详细信息，例如：

智能体 LLM 调用和响应
工具调用，包含参数和最终输出

添加测试用例

您可以通过在测试运行后选择“添加到评估集中”，直接在“输出”面板中添加测试用例。已基于以下数据为对话创建评估测试：

对话历史记录：对话中之前切换的记录。
当前用户消息：用户在对话中发出的最新消息。
智能体预期响应。

这样可以测试智能体维护上下文和处理后续问题的能力，这对于获得良好的对话体验至关重要。

图 2. 创建评估集

对于每个评估测试，您可以使用对话构建器界面编辑对话历史记录和当前用户提示。类似的界面可用于定义和优化智能体响应的预期，以确保测试验证准确。

图 3. 编辑评估测试时出现的对话构建器窗口

图 4. 您可以通过选择“添加到评估集中”，从“调试聊天”中导入对话

访问对话智能体

发布和部署对话智能体后，您可以通过 Automation Cloud 的“智能体”部分中的实例管理与其交互。

图 5. 智能体实例管理

Embedding conversational agents in UiPath Apps

You can also embed a conversational agent directly into a UiPath App using the IFrame component.

Create and publish: First, ensure your conversational agent has been created and published.
Add IFrame: Open your App in Studio and add an IFrame component to your page.
Configure the URL: Set the IFrame's Source property to a URL constructed with the following format and parameters: "https://<cloud_env>.uipath.com/<organization>/<tenant>/autopilotforeveryone_/conversational-agents/?agentId=<agent_id>&mode=embedded&title=<title>&welcomeTitle=<welcome_title>&welcomeDescription=<welcome_description>&suggestions=<suggestions>"
See the following table for details.
Publish app: Publish your App. The agent is now be embedded and ready to use!

Table 2. URL parameters
参数	必填	描述
`agentId`	是	The Release ID of the published agent. To find it, navigate to Agents > Conversational agents, click "Chat now" on your agent, and copy the ID from the URL.
`mode`	否	Set to `embedded` to optimize as a right-rail experience within the IFrame. Set to `fullscreen` to optimize as a full-screen experience within the IFrame. The default is `fullscreen`.
`title`	否	The title displayed in the chat component's header. Defaults to the agent's name.
`welcomeTitle`	否	A title for the first-run welcome screen. Defaults to an empty string.
`welcomeDescription`	否	A description for the first-run welcome screen. Defaults to an empty string.
`suggestions`	否	An array of first-run suggested prompts for the user. Defaults to an empty array `[]` 备注： To test on browser directly, wrap in 1 double quotes. Example: `["Hi, what can you do", "Hello, how are you"]` For Apps embedding, wrap in 2 double quotes – you need to escape the string or else UiPath Apps will throw a validation error.(e.g., `[""Hi, what can you do""]`).
`showHistory`	否	A boolean (`true` or `false`) to control the visibility of the chat history panel. Defaults to `true`.

限制

Conversational agents are currently in public preview. We are actively working on adding new capabilities. Please be aware of the following limitations.

功能	描述
用户提示	不需要用户提示：这些智能体不依赖预定义的提示来收集输入。相反，它们会实时接收消息，并逐个响应。
Data manager	Data Manager is currently disabled. Since outputs are emitted dynamically by the conversational agent throughout the conversation, there is no need to configure output arguments. The ability to configure inputs, which would be high-level parameters to initialize a conversation, will be available in a future release.
File uploads	You cannot upload files (e.g., PDFs, images) to the agent during a conversation. The ability to upload files will be available in a future release.
Local desktop automation	The agent cannot run automations that interact with the your local desktop (e.g., via Assistant).
Personal connections	Tools cannot be run using your personal Integration Service connections. Only Shared connections can be used currently.
Tool confirmation	The agent does not ask for confirmation before executing a tool.
Voice interaction	You can only interact with the agent via text commands. Push-to-talk and two-way voice interaction will be available in preview in a future release.
Agent health score	The Agent Score feature for performance evaluation is not yet available.
Instance management	Advanced observability features for monitoring agent performance are not yet available.
User feedback	You cannot provide feedback (e.g., thumbs up/down) on agent responses.
SDKs	Headless and UI SDKs for embedding agents into third-party external applications are not yet available.
Third-party integrations	Conversational agent access through surfaces like Slack, Microsoft Teams, or MSFT Copilot is not yet available.
许可	Official licensing details will be finalized for the general availability release.