Skip to content

clairescanlon/serverless-financial-data-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Serverless Financial Data Pipeline

This project aims to create a fully serverless ETL pipeline on AWS for financial data management. It addresses current challenges in data quality and compliance while preparing for future scalability. The pipeline automates data ingestion, processing, and visualization.

View the full project case study on my portfolio: How I Increased Data Management Efficiency by 50% with an AWS ETL Pipeline

Data & Security

The data included in this repository is purely synthetic and for demonstration purposes only. It does not contain any real client, proprietary, or personally identifiable information (PII).

Languages Used

  • Python
  • SQL

AWS Services

  • Amazon S3: Landing zone for CSV uploads. Separate buckets for different data, versioning and SSE-KMS encryption
  • Amazon Aurora Serverless (PostgreSQL): Relational data storage with RDS Data API for batch upserts and schema enforcement
  • AWS Lambda: Lambda function for automated CSV parsing, data cleaning, transformation, batch loading, archiving and error handling
  • AWS IAM: Execution roles with least-privilege policies
  • Amazon CloudWatch: Logs, metrics and custom dashboards for monitoring pipeline execution and performance
  • AWS CloudTrail: Audit trail of API calls and data events for SOX/GDPR compliance
  • AWS KMS: Customer managed key for environment variable decryption and S3/Aurora encryption
  • AWS Secrets Manager: Secure storage and rotation of database credentials accessed by Lambda
  • Amazon SQS: Dead-Letter Queue for failed event capture and retry
  • AWS Config: Continuous configuration monitoring to ensure compliance drift detection
  • Qlik Sense: Interactive analytics and visualization layer enabling users to explore, filter and generate real-time financial reports securely

Features

Note

This is currently under construction. Check back again soon. If you need additional information, send me an email.

  • Data Pipelines
  • ETL (Extract, Transform, Load)
  • Scalable Data Architecture
  • Event-Driven Architecture
  • Financial Data Compliance
  • Database Normalization
  • Error Handling and Monitoring
  • Metadata Management
  • Relational Database
  • Data Lineage Tracking
  • Data Validation
  • Schema Enforcement
  • Database Indexing & Optimization
  • End-to-End Encryption
  • Audit Logging & Access Monitoring

Code Samples

Note

This is currently under construction. Check back again soon. If you need additional information, send me an email.

  • AWS Lambda Function - Coming Soon
  • Data Validation and Transformation - Coming Soon
  • S3 File Upload Handling - Coming Soon
  • Database Code - Coming Soon
  • Qlik Sense Script to Load Data - Coming Soon

Caution

© 2025 Claire Scanlon. All rights reserved. Unauthorized copying, distribution, or derivative use prohibited.

Releases

No releases published

Packages

No packages published

Languages