For law firms
A pre-configured box, shipped to your office. Plug it in — you're operational.
~70% of requests processed locally. ~30% routed to the best cloud LLMs (Claude, GPT, Mistral) — only anonymized excerpts. Always the best model, never locked in.


Extraction, Q&A, summaries, drafting
Anonymized before sending
The Problem
ChatGPT, Claude, Harvey — your queries pass through third-party cloud servers. Client data leaves your office, potentially waiving privilege.
Harvey is locked to OpenAI. Claude for Legal, to Anthropic. The best model changes every month — your tool doesn't.
ABA Model Rules and state bar ethics opinions increasingly scrutinize AI use. Most legal AI tools weren't built with privilege preservation in mind.
The Solution
A physical box shipped to your office, pre-loaded with the RAGbase software. Plug it in — you're operational.
Local models for everyday tasks. Cloud LLMs (Claude, GPT, Mistral) for complex queries — only anonymized excerpts.
Hardware provided and pre-configured. No infrastructure to manage. No IT department needed. Plug and play.
Model-agnostic. When a better model releases, we deploy it on your box. Your system stays the same.
Firm documents
Emails, DMS, contracts, briefs
RAGbase Box
At your office
~70% Local
Extraction, Q&A, summaries
~30% Cloud LLMs
Complex reasoning
Only anonymized excerpts are sent to cloud LLMs. Full documents never leave the box.
In practice
From client intake to final pleading, every task is assigned to the right sovereignty layer.
Click any task to see difficulty, criticality, model tier, and verification.
Compliance
As state bar ethics opinions and ABA guidance on AI tighten, the RAGbase Box is architected for compliance by default.
Client documents stay on the box at your office. 70% of tasks are processed locally — no external connection.
Built-in de-identification before any data reaches cloud LLMs. Only anonymized excerpts are transmitted.
Full client documents never leave the firm's premises. Reasoning traces and agent logs stay on the box.
Every query, every response, every source is logged. Full audit trail on the box for compliance review.
Neither local models nor cloud providers use your data for training. Contractually guaranteed via enterprise APIs.
Architecture designed for ABA Model Rules 1.1, 1.6, 5.3. Documentation provided for your firm's compliance review.
Plans
Each plan includes the pre-configured box and a monthly subscription. 24-month commitment.
Solo / small team — 1–5 users
Hardware
Mac Mini M4 Pro — 48 GB RAM, 1 TB SSD
Qwen 3 30B, Mistral Small 24B, Llama 3 8B or similar (Q5–Q6)
Mid-size firm — 5–15 users
Hardware
Mac Studio M4 Max — 128 GB RAM, 1 TB SSD
Llama 3.3 70B (Q4–Q5), Qwen 3 30B (Q8), DeepSeek R1 32B or similar
Large firm — 15–30 users
Hardware
Linux dual-GPU station — 2× RTX 5090 (64 GB VRAM), 128 GB RAM, 2 TB NVMe
Llama 3.3 70B (Q5–Q8), Qwen 3 235B-A22B, multi-concurrent inference
For firms with 50–200+ users
Hardware
Multi-GPU custom — 4× RTX 5090+ or RTX PRO, Linux rack server
70B+ full-precision models, multi-concurrent, calibrated to your use cases
24-month commitment. Subscription includes software license, updates, monitoring, and support. Details below.
Monthly subscription
RAGbase software license (agents, RAG, traceability engine)
Local model updates as soon as they release
Intelligent routing to cloud LLMs (~30%) — Claude, GPT, Mistral based on task, anonymized excerpts
Box monitoring, security patches, encrypted backups
Hardware warranty and replacement
Technical support (level depends on plan)
Fine-tuning on your firm's corpus
Subscription is billed monthly. 24-month minimum commitment.
Comparison
| RAGbase Box | Harvey | Claude for Legal | Generic SaaS | |
|---|---|---|---|---|
| Where your data runs | Box at your office (local) + anonymized cloud LLMs | Cloud (Azure OpenAI) | Cloud (Anthropic) | Shared cloud |
| Data privacy | 70% local + anonymized cloud via enterprise APIs | Full documents in cloud | Full documents in cloud | Variable, often unencrypted |
| AI model | Model-agnostic — always the best | Locked to OpenAI | Locked to Claude | Locked to one vendor |
| Pricing | Box + subscription, no per-seat fees | ~$500–1,500/user/mo | ~$200–400/user/mo | Per-seat, monthly |
| Your internal docs | 15+ years indexed, stay at the firm | Manual upload, external cloud | No permanent indexing | Limited, external cloud |
| Attorney-client privilege | Documents never leave the box | Third-party cloud access | Third-party cloud access | Third-party cloud access |
| If you cancel | The box keeps working locally | Everything stops | Everything stops | Everything stops |
Frequently asked questions
Routine tasks (extraction, classification, Q&A, summaries, translation) run directly on the box at your office with no external connection. Complex requests (advanced legal reasoning, long-form drafting) are routed to the best cloud LLMs — Claude, GPT, or Mistral depending on the task. Only anonymized and de-identified excerpts are transmitted, never your full documents.
Full client documents never leave your office. The RAGbase Box processes 70% of tasks locally. For the 30% routed to cloud LLMs, only anonymized excerpts are sent via enterprise APIs that contractually guarantee zero retention and zero training on your data. This architecture was designed to preserve privilege by default.
The box stays at your office with your data. Local models already installed continue working for local tasks. You lose model updates, cloud routing, monitoring, and support. The hardware remains your property after the commitment period.
For routine tasks (70% of volume), local models deliver excellent results with near-instant response times. For complex tasks, routing to the best cloud LLMs (Claude, GPT, Mistral) gives you access to frontier models. The hybrid system combines the best of both worlds.
The RAGbase Box is not locked to any model vendor. When a better local model releases (Llama, Mistral, Qwen, etc.), we deploy it on your box. For cloud, we route to the best model available — Claude, GPT, or Mistral depending on the task. Your interface and workflows stay the same — only the engine improves.
Before any data is sent, it passes through an anonymization and de-identification pipeline built into the box. Only the excerpts necessary for the request are transmitted — never full documents. We use enterprise APIs from each provider (Anthropic, OpenAI, Mistral) that contractually guarantee zero retention and zero training use.
Yes. The listed price includes the pre-configured box (Mac or Linux station depending on plan). The monthly subscription covers the software license, updates, monitoring, support, and hardware warranty. 24-month commitment.
The RAGbase Box architecture is designed for compliance with ABA Model Rules and state bar ethics opinions on AI use. Client documents stay on-premise, cloud data is anonymized, and full audit trails are maintained. We provide documentation to support your firm's compliance review.
Order
A simple three-step process. You're operational within days.
30 minutes
Book a 30-min call with our team. We analyze your needs, query volume, and constraints (number of attorneys, DMS, security requirements).
Quote in 24h
We size the right hardware, select the optimal AI models for your firm, and prepare a detailed quote.
Shipped to your office
The box ships pre-configured to your office. Plug it in, our team completes setup remotely — you're operational.
Book your free 30-minute consultation. No commitment — we answer all your questions.
No commitment · Quote within 24h · Delivered in days
Book 30 minutes with our team. We analyze your needs, size the right box, and ship it to your office.
Free consultation · No commitment