About
Welcome to Unpacking Data
Hey there! Welcome to the unpacking data journey. My name is Dan, I am a senior software engineer and I write about software, data engineering and data science. Follow me for all data topics!
Background
I specialize in big data processing, data pipelines, and analytics using tools like Apache Spark, Databricks, and modern Python frameworks. My work focuses on building robust, scalable systems that can process large volumes of data efficiently while maintaining high quality.
Topics I Cover
- Data Engineering best practices
- PySpark optimization techniques
- Testing strategies for data pipelines
- Data quality and validation
- Performance tuning for big data applications
- Property-based testing
Feel free to reach out to me with questions or suggestions for future blog topics!