Blog
Latest Thinking From The Team

Dec 7, 2023
General
Combining ClickHouse And AWS SageMaker For Machine Learning
In previous articles, we looked at performing machine learning tasks such as forecasting and anomaly detection directly within…

Dec 3, 2023
General
Combining Cube And ClickHouse For User Facing Analytics
As we know, ClickHouse is a powerful database which provides high performance even in the face of a large number of concurrent users…

Dec 1, 2023
General
Introducing AWS Bedrock, Knowledge Bases And Agents
In this video, we introduce AWS Bedrock, which is a new service for working with large language models (LLMs). We begin by introducing…

Nov 13, 2023
General
Why Cloud Data Warehouses Are Too Expensive For Emerging Data Requirements
ClickHouse recently published an excellent article describing how cloud data warehouses such as Snowflake and Redshift are coming under…

Nov 7, 2023
General
Time Series Classification Using ClickHouse Machine Learning Functions
This article is part of a series where we look at doing data science work within ClickHouse . Articles in this series include forecasting…

Nov 6, 2023
General
Linear Regression Using Clickhouse Machine Learning Functions
This article is part of a series where we look at doing data science work within ClickHouse . Articles in this series include forecasting…

Oct 23, 2023
General
Comparing ClickHouse Cloud With Snowflake Pricing
Introduction In this article we analyse the differences in pricing between Snowflake and ClickHouse Cloud , with the aim of comparing TCO…

Oct 17, 2023
General
Anomaly Detection Using ClickHouse
This article is part of a series where we look at doing data science work within ClickHouse . Articles in this series include forecasting…

Oct 15, 2023
General
Forecasting Using ClickHouse Machine Learning Functions
This article is part of a series where we look at doing data science work within ClickHouse . Articles in this series include forecasting…

Oct 12, 2023
ClickHouse
How We Built A Crypto Analytics Platform Based On ClickHouse
One of our most significant projects with ClickHouse involved developing an analytics platform for Crypto and Web3 data. The aim of the…

Oct 7, 2023
General
Why ClickHouse Cloud Disrupts The Data Warehouse Industry
In recent years, when I have been asked by businesses which data warehouse to choose, there were 3 candidates at the top of the list…

Oct 6, 2023
General
The New Architecture For Cloud Native Data
The architectural patterns for building and managing data warehouses in the cloud are changing dramatically. These changes are likely to be…

Jun 17, 2023
ClickHouse
Combining dbt And ClickHouse
dbt is a popular open-source tool that is used for defining and running data transformations within data warehouses. Typically, it has…

Jun 8, 2023
Ensemble CI
Using Git Branches And Pull Requests As Part Of Your dbt Workflow
How using source control, Git and branching and merging strategies can improve your data delivery.

Jun 1, 2023
ClickHouse
Why We Went "All In" On ClickHouse
Historically, enterprise data and analytics systems have been built around batch processing. This involves collecting new data into batches…

May 9, 2023
Ensemble CI
Using Multiple Databases For Your dbt Workflow
How introducing multiple databases as part of a dbt deployment improves your data quality.

May 8, 2023
Ensemble CI
What Is dbt And Why Is It Such A Game Changer For Data Teams?
How dbt is a simple yet transformative tool for data teams.

May 5, 2023
Ensemble CI
How dbt Helps Data Engineers Work Like Software Engineers
How dbt helps Data Engineers adopt practices that have traditionally been used by Software Engineers

May 1, 2023
Ensemble CI
Introducing Ensemble CI, A Continuous Delivery Platform For Data Engineers
Introducing Ensemble, a CI/CD platform built specifically for Data Engineers who use dbt.

Jan 1, 2023
Snowflake
When To Use Snowflake
The Data and Analytics tooling market is large and crowded. In categories including data warehouses, data lake technology, ETL tooling and…

Nov 17, 2022
General
Introduction To AWS Kinesis
Amazon Kinesis is a cloud-based service from AWS that's essential for real-time data streaming and analysis. Below are its key components:…

Nov 1, 2022
Streaming
From Historical Dashboards To Automated Interventions
Historically, business intelligence initiatives have focussed on providing self service dashboards and reports which are used to surface…

Oct 20, 2022
Streaming
Is Real Time Analytics Possible With A Data Warehouse
Relational Data Warehouse technology has been the beating heart of business intelligence for many decades. Typically, Data Warehouses act as…

Oct 19, 2022
Tools
What Is Dash, And How Can It Benefit Your Business?
As we know, many business intelligence are overly focussed on dashboards as the means of disseminating information, and we believe this is…

Oct 19, 2022
General
Why Embedded Analytics Beat Dashboards & Reports
Most Business Intelligence or analytics projects today aim to deliver reports and dashboards as the eventual output. The idea is that…

Oct 19, 2022
Tools
Why You Need An Orchestrator In Your Data Stack
Data Orchestration platforms such as Airflow, Dagster and Prefect can be used to execute and co-ordinate all of the data related pipelines…

Oct 19, 2022
General
The Risks Of Consumption Based Pricing For Analytics Tools
Broadly, there are three ways to price any subscription software product: Consumption Based - where the price charged is based entirely on…

Oct 19, 2022
Streaming
Real Time Data and Analytics
The vast majority of Business Intelligence and Analytics solutions in place today operate on out-of-date, backwards looking, historical data…

Oct 19, 2022
Streaming
Real Time Closed Loop Analytics
Many companies today are interested in using data to improve their business and customer experience. Most of these initiatives ultimately…

Oct 19, 2022
General
Why OpenTelemetry Has The Potential To Simplify IT Operations
Telemetry is about collecting information from remote sources and bringing it back into a centralised location for analysis and monitoring…

Oct 19, 2022
Data Strategy
Rise Of The Citizen Data Scientist
Enabling Business Users To Carry Out Their Own Data Science and Analytics

Sep 3, 2022
Databricks
Databricks Structured Streaming Example
Databricks structured streaming is the modern way to work with streaming data in Spark.

Sep 3, 2022
Streaming
Materialised Views On Event Streams
Imagine we have a stream of events representing new orders: Much of the move to event driven architecture is about responding to these…

Sep 3, 2022
Streaming
Using Real Time Data To Guide Employee Next Best Action
There has been a long held concern as to whether “Artificial Intelligence” will ultimately, replace humans en masse in the workforce. For…

Sep 3, 2022
Streaming
Why Event Driven Architecture and Streaming Data Improves Audit & Compliance
Imagine we have a database table of customer details which sits behind a typical web application: Name Address Gender Lifetime Spend Ben…

Sep 3, 2022
General
Big Datas Elephant In The Room
For many years, "Big Data" was a major buzzword in enterprise IT. The theory was that as data volumes grew, we would need different…

Sep 3, 2022
Streaming
How Moving From Batch To Real Time Data Integration Improves The Customer Experience
Over time, the typical business acquires more and more applications. Some will be bespoke, some off the shelf, some cloud hosted SaaS tools…

Sep 3, 2022
General
Is Your Procurement Department Throttling Your Ability To Innovate?
Big Corp Enterprise LTD are in the business of making widgets. They are a huge business and have led their industry for decades. However…

Sep 3, 2022
Streaming
Moving From Batch To Streaming Extract, Transform and Load
There are many situations in Enterprise IT where we need to move, copy or integrate datasets. For example, populating a centralised data…

Sep 3, 2022
Tools
Why Serverless Is The Future For Data & Analytics Platforms
In the bad old days, the first step when building a database, analytics or business intelligence solution would be to order or provision a…

Sep 1, 2022
Streaming
Moving Towards Real Time Data & Analytics
As your business operates day to day, a number of events are taking place. Examples include orders, dispatches, customer enquiries or…

Feb 3, 2022
General
From Titanic Releases To 10,000 Speedboats With Continuous Delivery
Does your software release process look anything like this.... Releases are big events marked months ahead on the calendar; A lot of…

Feb 1, 2022
General
Step Away From The Dashboard
The humble dashboard is limiting the potential of what businesses achieve with data and analytics. The idea of a dashboard always sounds…

Jan 4, 2022
General
Three Areas Of Opportunity With Data and Analytics
Data and analytics are crucial for businesses who are looking to make better decisions, operate more efficiently and build great employee…

Jan 2, 2022
Streaming
In Business, Everything Is An Event
Imagine a simple customer interaction for an eCommerce business: A customer visits the website and browses various products; A few days…

Jan 2, 2022
General
From Strategic To Operational Analytics
Historically, businesses have used their data and analytics over strategic time horizons. For instance, collecting information about what…

Jan 1, 2022
General
Our Approach and Partnership Model
Ensemble are a new professional and managed services firm who help companies build sophisticated real-time data and analytics solutions…

Jan 1, 2022
General
Why We Launched Ensemble
Data and analytics are a huge enabler for businesses who are looking to make better decisions, operate more efficiently and build great…