About

Welcome to Unpacking Data

Hey there! Welcome to the unpacking data journey. My name is Dan, I am a senior software engineer and I write about software, data engineering and data science. Follow me for all data topics!

Background

I specialize in big data processing, data pipelines, and analytics using tools like Apache Spark, Databricks, and modern Python frameworks. My work focuses on building robust, scalable systems that can process large volumes of data efficiently while maintaining high quality.

Topics I Cover

Data Engineering best practices
PySpark optimization techniques
Testing strategies for data pipelines
Data quality and validation
Performance tuning for big data applications
Property-based testing

Feel free to reach out to me with questions or suggestions for future blog topics!

Welcome to Unpacking Data

Background

Topics I Cover

Subscribe to new posts