Pragya is a Data Engineer with 5+ years of experience designing and maintaining scalable data pipelines and cloud-based data architectures using Python, SQL, and relational databases (PostgreSQL, MSSQL, Oracle), with deep understanding of data modeling and transformation. She has strong hands-on experience with AWS Cloud Services (S3, Glue, Redshift, Lambda, Athena) for large-scale data processing and proficiency in ETL/ELT development, query optimization, and orchestration using Airflow and DBT. Pragya has worked with structured and unstructured data, applying advanced data validation and transformation logic, leveraging ORMs (SQLAlchemy), OOP principles, and Python performance optimization.
Maintained MySQL and Redshift schemas, executing Redshift SQL tuning for large analytical queries. Developed ELT workflows using DBT and automated deployments using Terraform. Designed and deployed AWS Lambda jobs to support real-time ingestion and alerting. Managed Docker image creation for deployment and integrated Quicksight dashboards for BI. Used S3 for storing staging data and implemented retention policies to optimize storage. Built user interfaces with Django to provide insights and allow ETL triggering via web UI. Tech stack: Python, AWS Redshift, DBT, MySQL, Terraform, Lambda, ECR, EC2, S3, Quicksight, Django.