DataCleaner AI is a local-first PII detection and redaction engine. Scan PDFs, emails, databases, and spreadsheets for sensitive data โ powered by your own GPU, not the cloud.
DataCleaner combines regex speed with local AI intelligence โ no compromises on privacy or performance.
All scanning and redaction happens on your machine. No cloud uploads. No API calls. No third party ever sees your data. Your GPU does the heavy lifting โ we never touch it.
Zero Data LeakagePass 1 (Regex): 50+ patterns catch emails, phones, SSNs, credit cards, API keys in milliseconds.
Pass 2 (LLM): Local AI catches contextual PII: names in prose, medical conditions, family relationships, salary figures.
Every scan generates a cryptographically-signed, timestamped audit log. Ready for GDPR Article 30, HIPAA Technical Safeguards, CCPA data inventory, and ISO 27001 evidence collection.
GDPR ยท HIPAA ยท CCPADetects US SSN, UK NI Number, China Resident ID, EU IBAN/SWIFT, passport numbers from 30+ countries, and localized phone/address formats. Built for global compliance teams.
30+ CountriesTerminal-native for ad-hoc scans and shell scripting. REST API available for CI/CD pipelines, automated workflows, and integration with n8n, Zapier, or Make.com.
CLI + REST APIFrom document to compliance-ready output in under 3 seconds per file.
Scan any document, folder, or pipe data from stdin. Supports PDF, DOCX, XLSX, CSV, TXT, JSON, HTML, and more.
dc scan ./contracts/
Regex catches structured PII instantly. Local LLM uncovers hidden contextual data โ names in text, medical data, financial figures.
385 redactions applied
Redacted files saved. Timestamped audit log generated. GDPR Article 30 ready. Nothing ever left your machine.
audit_20260502.json saved
One license. Unlimited documents. No hidden fees. No per-token billing.
For individuals evaluating PII detection.
Single-machine license. Unlimited everything.
For organizations with compliance teams.
All prices in USD. Payments processed securely by Paddle (our authorized Merchant of Record). 30-day money-back guarantee. No questions asked.
DataCleaner's local-first architecture is inherently compliant with the world's strictest data protection laws.
DataCleaner AI is designed from the ground up to support GDPR compliance. Because all data processing occurs entirely on your local machine, DataCleaner operates as a data processing tool under your exclusive control. We never act as a data controller or processor โ you remain the sole data controller at all times.
"We process 500+ GDPR subject access requests per month. DataCleaner cut our redaction time from 3 days to 3 hours. The fact that nothing leaves our secure environment was the deciding factor."
"As a solo developer shipping to EU customers, I was terrified of GDPR fines. DataCleaner runs on my RTX 4070 and catches PII I didn't even know was in my logs. Saved me from a potential โฌ20M penalty."
"We evaluated 7 PII tools. DataCleaner is the only one that runs completely offline. Our security team vetoed every cloud-based option. This one passed audit in a single day."
No. Never. All processing happens on your local machine. DataCleaner has no cloud component, no telemetry, and makes zero outbound network calls. Even license validation is performed offline using cryptographic signatures.
Regex-only mode runs on any computer. For full AI-powered scanning, you need a GPU with at least 8GB VRAM (NVIDIA RTX 3070+, Apple M1+, or AMD RX 7000+) and Ollama installed with a model like Qwen 3.5 (9B) or Llama 3.3 (8B).
Yes. DataCleaner was designed specifically to support GDPR compliance workflows. The local-first architecture means you never lose control of your data. Generated audit logs satisfy Article 30 record-keeping requirements. See our Compliance section for details.
We offer a 30-day money-back guarantee. If DataCleaner doesn't meet your needs, email us for a full refund โ no questions asked. Payments are processed by Paddle, our authorized merchant of record.
After purchase, you'll receive a license key via email. Run dc license activate YOUR-KEY in your terminal. The key is validated offline โ no internet required.
Install in 30 seconds. First scan in under a minute. Zero configuration. Zero data leakage.