What is permissions-aware data filtering for RAG?

Permissions-aware data filtering is the process of applying authorization policies at the retrieval layer of a RAG pipeline. Instead of retrieving all data and filtering afterward, Cerbos generates a query plan that translates user permissions into native database or vector store filters, ensuring only authorized data is fetched and passed to the LLM.

Why is post-retrieval filtering insufficient for RAG security?

Post-retrieval filtering means sensitive data has already been fetched from the vector store, loaded into memory, and potentially cached. The security boundary was crossed at the point of retrieval. Pre-retrieval filtering with Cerbos ensures unauthorized data is never fetched in the first place, providing a stronger security guarantee.

What vector stores does Cerbos support for RAG authorization?

Cerbos query plans can be translated into native filter syntax for major vector stores including Pinecone, Weaviate, Chroma, Qdrant, and FAISS. The same authorization policies apply regardless of which vector store you use, with consistent audit trails across all of them.

How does Cerbos handle audit logging for RAG data access?

Every authorization check at the retrieval layer is logged with the principal, resource, action, policy version, and decision outcome. These structured logs support compliance requirements for SOC 2, ISO 27001, HIPAA, PCI DSS, and GDPR, and can be streamed to your SIEM or log management platform.

Use cases

RAG authorization

Permissions-aware authorization for RAG pipelines

Q: How does Cerbos integrate with RAG architectures?

Cerbos integrates at the retrieval layer using its query plan API. When a user triggers a RAG query, Cerbos evaluates their permissions and returns a set of conditions that can be translated into native filters for vector stores like Pinecone, Weaviate, Chroma, Qdrant, or FAISS. Only data the user is authorized to access is retrieved and passed to the LLM.

Control what data enters your AI prompts. Cerbos enforces user-level permissions at the retrieval layer, so your RAG systems only surface information each user is authorized to see.

Talk to an engineer

Try Cerbos Hub

Trusted by teams building with security in mind

The retrieval problem

RAG pipelines expose data your users shouldn't see

Over-retrieval from vector stores

Embedding-based retrieval ignores authorization. A similarity search returns the most relevant chunks regardless of whether the requesting user has permission to see them.

Context injection into prompts

Unauthorized data retrieved from the vector store is injected directly into the LLM prompt. The model treats it as ground truth and surfaces it in its response.

Post-hoc filtering is brittle

Filtering results after retrieval means sensitive data has already been fetched, loaded into memory, and potentially cached. The security boundary was already crossed.

No audit trail for data access

Without authorization checks at the retrieval layer, there is no record of which data was accessed, by whom, or under what policy. Compliance teams have no visibility.

Built for production RAG systems

Permission-aware data filtering with Cerbos

Cerbos integrates at the retrieval layer to ensure only authorized data reaches the LLM. The query plan API translates your authorization policies into database-native filters — applied before any data is fetched.

Talk to an engineer

View RAG documentation

Define data access policies

Write YAML policies that define which principals can access which data based on role, department, region, classification level, or any attribute. Policies are version-controlled and testable.

Generate authorization-aware query plans

When a user triggers a RAG query, Cerbos evaluates their permissions and returns a query plan — a set of conditions that can be translated into native database or vector store filters.

Apply filters at the retrieval layer

The query plan is applied as a pre-filter on your vector store or database. Only data the user is permitted to access is retrieved.

Log every data access decision

Every authorization check is logged with the principal, resource, action, policy version, and outcome. Stream structured logs to your SIEM for compliance and forensic analysis.

How Cerbos secures your RAG pipeline

Filter before retrieval, not after. Cerbos translates authorization policies into native query filters so sensitive data never enters the pipeline.

Native vector store integration

Cerbos query plans translate into native filter syntax for Pinecone, Weaviate, Chroma, Qdrant, and FAISS. No custom middleware required.

Pre-retrieval enforcement

Authorization is enforced before data is fetched, not after. Sensitive data never enters the RAG pipeline or the LLM context window.

Sub-millisecond query plan generation

Query plans are generated in under a millisecond. Authorization adds negligible latency to the retrieval pipeline.

Same policies across all layers

The same YAML policies govern data access, API authorization, application permissions, and AI agent behavior. One policy engine, consistent enforcement.

Full audit trail for every data access decision

Trace every data access decision

Log which data was surfaced to which user through which RAG pipeline, with the exact policy version and conditions applied.

Policy lineage for every retrieval

See the exact policy, version, and release behind each data access decision for complete traceability.

Compliance-ready audit trails

Structured logs compatible with SOC 2, ISO 27001, HIPAA, PCI DSS, and GDPR audit requirements. Stream to your SIEM or log management platform.

Real-world scenarios

RAG authorization in practice

HR knowledge base

An employee asks the company chatbot about parental leave policy. Cerbos ensures the RAG pipeline retrieves only policies applicable to the employee’s region, employment type, and seniority level — not compensation data, termination procedures, or policies for other jurisdictions.

Multi-tenant SaaS documentation

A customer queries the support AI. Cerbos filters retrieval to only surface documentation, configuration guides, and known issues for that customer’s specific plan tier and product modules — not internal engineering docs or other tenants’ data.

Financial services research

An analyst asks an AI assistant about market positions. Cerbos enforces information barriers: the retrieval layer only returns research and positions the analyst’s desk is authorized to access, maintaining regulatory compliance with Chinese wall requirements.