This is an alert component

About

Welcome to Unpacking Data

Hey there! Welcome to the unpacking data journey. My name is Dan, I am a senior software engineer and I write about software, data engineering and data science. Follow me for all data topics!

Background

I specialize in big data processing, data pipelines, and analytics using tools like Apache Spark, Databricks, and modern Python frameworks. My work focuses on building robust, scalable systems that can process large volumes of data efficiently while maintaining high quality.

Topics I Cover

  • Data Engineering best practices
  • PySpark optimization techniques
  • Testing strategies for data pipelines
  • Data quality and validation
  • Performance tuning for big data applications
  • Property-based testing

Feel free to reach out to me with questions or suggestions for future blog topics!


Subscribe to new posts