Back to Browse

Anonymous Studio | Group 3 | CPSC 4205

5o5
5o5
23 views
May 7, 2026
28:20

Anonymous Studio is a production-grade Personally Identifiable Information (PII) detection and anonymization platform built from the ground up as a senior capstone project. What started as a small Streamlit proof-of-concept evolved into a full enterprise-class pipeline MVP developed over 5 Agile sprints. What it does: - Detects and anonymizes 17+ PII entity types in free text and structured CSV/Excel files up to 500 MB (names, emails, SSNs, phone numbers, locations, and more) - Runs batch jobs as non-blocking background tasks with real-time progress (validated at 300,000+ row scale) - Tracks every job on a Kanban board (Backlog → In Progress → Review → Done) with cryptographic compliance attestations signed by Ed25519 keys - Logs every user and system action to a tamper-resistant audit trail with CSV/JSON export - Exposes a full REST API with Auth0 JWT authentication and Swagger/OpenAPI docs - Enforces fine-grained authorization via OpenFGA — fail-closed, not fail-open Tech Stack: Python 3.12 · Taipy 3.1 (GUI + orchestration) · Microsoft Presidio · spaCy en_core_web_lg · MongoDB · DuckDB · OpenFGA · Auth0 · Prometheus + Grafana · Docker Compose · Pandas / Dask · Ed25519 cryptography · pytest (82+ tests)

Download

0 formats

No download links available.

Anonymous Studio | Group 3 | CPSC 4205 | NatokHD