Introducing Crunchy Data Warehouse: A next-generation Postgres-native data warehouse. Crunchy Data Warehouse Learn more

Posts about Analytics

  • 8 min read

    4 Ways to Create Date Bins in Postgres: interval, date_trunc, extract, and to_char

    Christopher Winslett

    You followed all the best practices, your sales dates are stored in perfect timestamp format …. but now you need to get reports by day, week, quarters, and months. You need to bin, bucket, and roll up sales data in easy to view reports. Do you need a BI tool? Not yet actually. Your Postgres database has hundreds of functions that let you query data analytics by date. By using some good old fashioned SQL - you have powerful analysis and business intelligence with date details on any data set. In...

    Read More
  • 4 min read

    pg_parquet: An Extension to Connect Postgres and Parquet

    Craig Kerstiens

    Today, we’re excited to release pg_parquet - an open source Postgres extension for working with Parquet files. The extension reads and writes parquet files to local disk or to S3 natively from Postgres. With pg_parquet you're able to: • Export tables or queries from Postgres to Parquet files • Ingest data from Parquet files to Postgres • Inspect the schema and metadata of existing Parquet files Export tables or queries from Postgres to Parquet files Ingest data from Parquet files to Postgres I...

    Read More
  • 14 min read

    Vehicle Routing with PostGIS and Overture Data

    Paul Ramsey

    The Overture Maps collection of data is enormous, encompassing over 300 million transportation segments , 2.3 billion building footprints , 53 million points of interest , and a rich collection of cartographic features as well. It is a consistent global data set, but it is intimidatingly large -- what can a person do with such a thing? Building cartographic products is the obvious thing, but what about the less obvious. With an analytical engine like PostgreSQL and Crunchy Bridge for Analy...

    Read More
  • Window Functions for Data Analysis with Postgres

    Elizabeth Christensen

    SQL makes sense when it's working on a single row, or even when it's aggregating across multiple rows. But what happens when you want to compare between rows of something you've already calculated? Or make groups of data and query those? Enter window functions. Window functions tend to confuse people - but they’re a pretty awesome tool in SQL for data analytics. The best part is that you don’t need charts, fancy BI tools or AI to get some actionable and useful data for your stakeholders. Window...

    Read More
  • PostGIS meets DuckDB: Crunchy Bridge for Analytics goes Spatial

    Marco Slot

    Crunchy Data is excited to announce the next major feature release for Crunchy Bridge for Analytics : Geospatial Analytics . We have developed a variety of features to connect Postgres and PostGIS to S3 and public web servers to make spatial data access easier than ever. This release includes: • Creating an analytics table directly from a geospatial data set by providing only the URL, for ad-hoc queries and data transformations. • Creating a regular PostGIS table directly from a URL. • Automat...

    Read More
  • 4 min read

    Postgres Materialized Views from Parquet in S3 with Zero ETL

    Marco Slot

    Data pipelines for IoT applications often involve multiple different systems. First, raw data is gathered in object storage, then several transformations happen in analytics systems, and finally results are written into transactional databases to be accessed by low latency dashboards. While a lot of interesting engineering goes into these systems, things are much simpler if you can do everything in Postgres. Crunchy Bridge for Analytics is a managed PostgreSQL offering that integrates DuckDB...

    Read More
  • 12 min read

    Postgres Powered by DuckDB: The Modern Data Stack in a Box

    Marco Slot

    Postgres for analytics has always been a huge question mark. By using PostgreSQL's extension APIs, integrating DuckDB as a query engine for state-of-the-art analytics performance without forking either project could Postgres be the analytics database too? Bringing an analytical query engine into a transactional database system raises many interesting possibilities and questions. In this blog post I want to reflect on what makes these workloads and system architectures so different and what br...

    Read More
  • 4 min read

    Query Hugging Face Datasets from Postgres

    Elizabeth Christensen

    If you missed the database news lately, you could have missed that we just fused DuckDB with Postgres to build a really fast analytics platform based on Postgres. There’s so many interesting things you can do with this platform so expect to hear from me again 😉. Today I just want to show off one really simple trick for getting big data sets or training data into Postgres through Hugging Face. Hugging Face is a community repository of datasets, LLMs, models and other resources for Machine L...

    Read More
  • 6 min read

    Running TPC-H Queries on Iceberg Tables from PostgreSQL

    Önder Kalacı

    We recently introduced support for querying Iceberg tables from PostgreSQL in Crunchy Bridge for Analytics. Iceberg defines a way to store tables in data lakes (usually as Parquet files in S3) with support for snapshots and other important database features, and it is designed with high performance analytics in mind. If you’re new to Crunchy Bridge, it offers a fully managed PostgreSQL experience. Crunchy Bridge for Analytics extends these capabilities, enabling you to query and interact wit...

    Read More
  • 11 min read

    Crunchy Bridge Adds Iceberg to Postgres & Powerful Analytics Features

    Marco Slot

    In April we launched Crunchy Bridge for Analytics , which is a managed PostgreSQL option that enables fast and seamless querying of your data lake. Our initial release was focused on building a rock solid foundation for high performance analytics in PostgreSQL. We have since been hard at work turning it into a comprehensive analytics solution. Our goals in building Crunchy Bridge for Analytics are to: • Make it very easy to query data files (incl. Parquet/CSV/JSON/Iceberg) in object stores like...

    Read More