RAG Knowledge Base
Ground your AI agent in your actual business knowledge
BPract Agents uses retrieval-augmented generation (RAG) powered by pgvector embeddings to ensure your AI agent gives accurate, contextual answers based on your real business data. Upload documents, crawl your website, or paste text directly. The system chunks, embeds, and indexes your content so the AI retrieves the most relevant passages before generating every response.
A diagram showing documents and web pages being chunked, embedded into pgvector, and retrieved by the AI agent to generate accurate responses.
Key Benefits
Why RAG Knowledge Base matters for your business.
Automatic website crawling ingests your existing pages so the AI knows everything your website says
Document upload supports PDF, DOCX, TXT, and CSV files for product manuals, FAQs, and policy documents
pgvector semantic search finds contextually relevant content even when visitors use different terminology
Chunking and overlap settings are configurable so you control the granularity of retrieval
Source attribution shows visitors which documents the AI referenced, building trust and transparency
How RAG Works in BPract Agents
When a visitor asks a question, BPract Agents does not just pass it to the LLM and hope for the best. The system first converts the question into a vector embedding using the same model that indexed your content. It then performs a similarity search across your entire knowledge base using pgvector in PostgreSQL, retrieving the most relevant chunks of content. These chunks are injected into the LLM prompt as context, ensuring the AI response is grounded in your actual business data rather than general training knowledge. This dramatically reduces hallucinations and ensures accuracy.
Multiple Ingestion Methods
- Website crawler automatically follows links and indexes up to your configured page limit per crawl session
- File upload handles PDF, DOCX, plain text, and CSV formats with automatic text extraction
- Direct text input lets you paste FAQs, product specs, or policy documents right in the admin panel
- Scheduled re-crawling keeps your knowledge base in sync as your website content changes
Embedding and Retrieval Quality
BPract Agents uses state-of-the-art embedding models to convert text into high-dimensional vectors. The pgvector extension in PostgreSQL provides fast approximate nearest-neighbor search, returning the most relevant chunks in milliseconds even across thousands of documents. You can configure chunk size and overlap in the admin panel to balance between retrieval precision and context breadth. Larger chunks preserve more context around key information, while smaller chunks improve precision for specific factual queries.
Frequently Asked Questions
Common questions about RAG Knowledge Base.
What file formats can I upload to the knowledge base?
How often should I re-crawl my website?
Does the AI ever make up information not in my knowledge base?
Related Features
Explore more capabilities of BPract Agents.
Experience RAG Knowledge Base
See how RAG Knowledge Base can transform your website. Start your free trial today.