Feature

RAG Knowledge Base

Ground your AI agent in your actual business knowledge

BPract Agents uses retrieval-augmented generation (RAG) powered by pgvector embeddings to ensure your AI agent gives accurate, contextual answers based on your real business data. Upload documents, crawl your website, or paste text directly. The system chunks, embeds, and indexes your content so the AI retrieves the most relevant passages before generating every response.

Key Benefits

Why RAG Knowledge Base matters for your business.

1

Automatic website crawling ingests your existing pages so the AI knows everything your website says

2

Document upload supports PDF, DOCX, TXT, and CSV files for product manuals, FAQs, and policy documents

3

pgvector semantic search finds contextually relevant content even when visitors use different terminology

4

Chunking and overlap settings are configurable so you control the granularity of retrieval

5

Source attribution shows visitors which documents the AI referenced, building trust and transparency

How RAG Works in BPract Agents

When a visitor asks a question, BPract Agents does not just pass it to the LLM and hope for the best. The system first converts the question into a vector embedding using the same model that indexed your content. It then performs a similarity search across your entire knowledge base using pgvector in PostgreSQL, retrieving the most relevant chunks of content. These chunks are injected into the LLM prompt as context, ensuring the AI response is grounded in your actual business data rather than general training knowledge. This dramatically reduces hallucinations and ensures accuracy.

Multiple Ingestion Methods

  • Website crawler automatically follows links and indexes up to your configured page limit per crawl session
  • File upload handles PDF, DOCX, plain text, and CSV formats with automatic text extraction
  • Direct text input lets you paste FAQs, product specs, or policy documents right in the admin panel
  • Scheduled re-crawling keeps your knowledge base in sync as your website content changes

Embedding and Retrieval Quality

BPract Agents uses state-of-the-art embedding models to convert text into high-dimensional vectors. The pgvector extension in PostgreSQL provides fast approximate nearest-neighbor search, returning the most relevant chunks in milliseconds even across thousands of documents. You can configure chunk size and overlap in the admin panel to balance between retrieval precision and context breadth. Larger chunks preserve more context around key information, while smaller chunks improve precision for specific factual queries.

Frequently Asked Questions

Common questions about RAG Knowledge Base.

What file formats can I upload to the knowledge base?
BPract Agents supports PDF, DOCX, TXT, and CSV file uploads. The system automatically extracts text from these formats and splits them into optimally sized chunks for embedding and retrieval.
How often should I re-crawl my website?
It depends on how frequently your content changes. Most businesses set up a weekly or bi-weekly re-crawl. You can also trigger a manual crawl any time you publish significant new content.
Does the AI ever make up information not in my knowledge base?
RAG dramatically reduces hallucinations by grounding every response in your actual content. You can also configure the system instructions to tell the AI to say it does not know when no relevant content is found, rather than guessing.

Experience RAG Knowledge Base

See how RAG Knowledge Base can transform your website. Start your free trial today.