Huntington Bank Redacts PII from 400M Documents Using AWS.

Rob Carnell· June 24, 2026 View original

▶ The 60-second brief

Summary

Huntington Bank developed a scalable AWS solution to detect and redact Personally Identifiable Information (PII) and Payment Card Industry (PCI) data from over 400 million documents. This solution drastically reduced processing time from years to months and achieved over 95% redaction accuracy.

Huntington Bank successfully implemented a robust solution leveraging Amazon Web Services to address the challenge of redacting sensitive information from an enormous volume of documents. The bank faced the daunting task of identifying and removing Personally Identifiable Information (PII) and Payment Card Industry (PCI) data from more than 400 million records. Through this AWS-based system, Huntington Bank achieved a remarkable improvement in efficiency and accuracy. The processing time for this massive undertaking was cut from what would have been years down to just a few months, while maintaining an impressive redaction accuracy rate exceeding 95%. This demonstrates a significant advancement in automated data privacy compliance.

Why it matters

This case study provides a practical example of how large-scale data redaction can be automated and accelerated using cloud services, offering a blueprint for other organizations facing similar compliance and data privacy challenges.

How to implement this in your domain

  1. 1Assess current data redaction needs and identify sensitive data types (PII, PCI, etc.).
  2. 2Evaluate AWS services like Amazon Textract, Comprehend, or custom ML models for data detection.
  3. 3Design a scalable cloud architecture for document ingestion, processing, and redaction.
  4. 4Implement robust testing and validation procedures to ensure high redaction accuracy.
  5. 5Integrate the automated redaction solution into existing document management workflows.

Who benefits

BFSIHealthcareLegalGovernmentRetail

Key takeaways

  • Huntington Bank successfully redacted PII/PCI from over 400 million documents using AWS.
  • The solution reduced processing time from years to months.
  • It achieved over 95% redaction accuracy.
  • This demonstrates effective large-scale data privacy compliance with cloud AI.

Original post by Rob Carnell

"In this post, we walk through how Huntington built a scalable AWS solution to detect and redact Personally Identifiable Information (PII) and Payment Card Industry (PCI) data from over 400 million documents, reducing processing time from years to just a few months while achieving…"

View on X

Originally posted by Rob Carnell on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses