- Getting started
- Data security and compliance
- Organizations
- Authentication and security
- Licensing
- Tenants and services
- Accounts and roles
- AI Trust Layer
- External applications
- Notifications
- Logging
- Troubleshooting
- Migrating to Automation Cloud™
About Context Grounding
Context Grounding is a component of the UiPath AI Trust Layer which allows you to bring in your data to generate more accurate, reliable GenAI predictions. Context Grounding is designed to make your business data LLM-ready without the need for any additional subscription to embedding models, vector databases, or large language models (LLMs). You can create representative indexes and embeddings of business data that UiPath GenAI features can reference for contextual evidence at runtime.
Context Grounding is a tenant-scoped platform service designed to support UiPath GenAI experiences (such as GenAI Activities and Autopilot for everyone) by grounding your prompts with relevant information before they are executed by the LLM via retrieval augmented generation (RAG).
Providing RAG as a service to UiPath GenAI experiences helps to:
- Overcome LLM context window limitations: for both small and large models, RAG helps improve accuracy, reliability, scalability, and efficiency of models as they interact with knowledge bases.
- Reduce risk of hallucination through references to ground truth data stores.
- Give generative apps access to specialized and proprietary knowledge sources.
- Give generative apps access to up-to-date sources of information.
- Enable positive feedback loops between data stores and user-queries.
The terminology and core components of Context Grounding include:
Ingestion and indexing: make your business data LLM-ready
- Ingestion: Convert business data into representative embeddings using UiPath-managed embedding models.
- Embedding: A representation of business data than an LLM can understand and search through.
- Index: A folder in a vector database that organizes the embeddings.
- Vector DBs: UiPath-managed vector database that stores embeddings organized in indexes.
Retrieval
- Search through LLM-ready business data to find the most relevant information. Context Grounding uses a variety of extraction, chunking, retrieval, and re-ranking techniques that are optimized based on different data formats and queries.
- Interpret prompt as a query to search through embeddings, and produce the most relevant results based on cosine similarity search. These search results are an intermediate, precursor step to RAG, to augment prompts with relevant context from business data.
Retrieval Augmented Generation
- Ground and update prompts with the most relevant information from the semantic similarity search results, then execute a generation via an LLM hosted through the LLM Gateway of the AI Trust Layer.
Here are some of the key features of Context Grounding:
- Multi-document support: PDF, JSON, CSV, XLS, DOCX, TXT files.
- Managed ingestion and indexing pipelines: UiPath optimizes the ingestion and indexing of data in UiPath-managed vector databases.
- Multiple surfaces: Context Grounding is currently available as part of the UiPath GenAI Activities, AI Trust Layer (with a dedicated UI), and Autopilot for everyone.
- Data retrieval: Query within documents or across datasets using a variety of techniques (e.g. query transformation, embedding, fine tuning, etc.) to ensure search results are highly relevant.
- Retrieval Augmented Generation: Ground prompts via just-in-time (JIT) in-memory or over a knowledge base.
- Proof of knowledge: Provides a citation of the reference source and text from the semantic similarity search.
- Streaming support: Streaming API support to show generation as it is being produced.
- Multilingual support: Ability to ingest and query from documents in all UTF-8 encoded languages.
- Support for multiple data
sources:
- UiPath Orchestrator bucket entities: You can ingest, index, and query data stored within shared folders in Orchestrator bucket entities.
- Document storage systems: Through Integration Service connectors, such as Microsoft OneDrive & SharePoint and Google Drive: Context Grounding can access data directly stored in third-party applications.
- Context Grounding currently supports specific file types: PDF, JSON, CSV, XLS, DOCX, TXT.
- There is a limit of ten indexes per tenant. We recommend you keep a 1-1 relationship with these and the folder path in the data source you want to use.
- Context Grounding respects folder permissions and authorization for shared folder entities. Users who do not have the appropriate permissions may not be able to see, update, delete, or use indexes that are affiliated with folders they do not have permissions to.
- To use Context Grounding through UiPath GenAI Activities, you must use Studio Web or Studio Desktop version 2024.4 or newer. For more information, see the Getting started section.