InfoQ Homepage Presentations Efficient Data Storage for Analytics with Parquet 2.0
Efficient Data Storage for Analytics with Parquet 2.0
Summary
Julien Le Dem discusses the advantages of a columnar data layout, specifically the features and design choices Apache Parquet uses to achieve goals of interoperability, space and query efficiency.
Bio
Julien Le Dem is the lead for Parquet's java implementation. He also leads Data Processing tool development at Twitter and is on the Apache Pig PMC.
About the conference
Software is Changing the World. QCon empowers software development by facilitating the spread of knowledge and innovation in the developer community. A practitioner-driven conference, QCon is designed for technical team leads, architects, engineering directors, and project managers who influence innovation in their teams.