Open Data Blend March 2022 Update

25th April 2022

By Open Data Blend Team

The Open Data Blend March 2022 update includes a new cloud storage integration through the Open Data Blend for Python library, the introduction of consulting services, and the launch of an affiliate programme.

Open Data Blend Datasets

English Prescribing Data for February 2022 Is Available

We have updated the Prescribing dataset with the latest available NHS English Prescribing Data which includes activity up until February 2022. You can download the data from the Open Data Blend Datasets Prescribing page, analyse it directly in supported BI tools through the Open Data Blend Analytics service, or instantly explore insights through the Open Data Blend Insights service.

Google Cloud Storage Support in Open Data Blend for Python

Last month, we extended Open Data Blend for Python to support Azure Blob Storage, Azure Data Lake Storage (ADLS) Gen2, and Amazon S3 as target file systems. This month, we have added support for Google Cloud Storage.

Open Data Blend for Python Google Cloud Storage Support

With a few simple lines of code, you can quickly ingest our datasets into your data lake in Google Cloud. Once ingested, you can interactively query and analyse the ORC and Parquet files using data lake analytics services like Google BigQuery and Databricks.

Here is an example of how easy it is to copy our datasets to Google Cloud Storage using Open Data Blend for Python:

import opendatablend as odb

dataset_path = 'https://packages.opendatablend.io/v1/open-data-blend-road-safety/datapackage.json'
access_key = '<ACCESS_KEY>' # The access key can be set to an empty string if you are making a public API request

# Specify the resource name of the data file. In this example, the 'date' data file will be requested in .parquet format.
resource_name = 'date-parquet'

# Get the data and store the output object using the Google Cloud Storage file system
configuration = {
    "service_account_private_key_file" : "<PATH_TO_SERVICE_ACCOUNT_PRIVATE_KEY_FILE>",
    "bucket_name" : "<BUCKET_NAME>", # e.g. odbp-integration
    "bucket_location" : "<BUCKET_LOCATION>" # e.g. europe-west2
}

output = odb.get_data(dataset_path, resource_name, access_key=access_key, file_system="google_cloud_storage", configuration=configuration)

# Print the file locations
print(output.data_file_name)
print(output.metadata_file_name)

Want to learn more about how Open Data Blend for Python can help you to integrate our datasets? Head over to the GitHub or PyPI page.

Introducing Open Data Blend Consulting

We know that even with access to data that has been optimised for analytics, there is still quite a bit of work needed to make the most out of it. That is why we have introduced Open Data Blend Consulting, a set of analytics consulting services to accelerate the journey from dataset acquisition to business value.

Open Data Blend Consulting

Open Data Blend Consulting focuses on four technology areas that can be used to transform Open Data Blend datasets into a competitive edge for businesses: Data Engineering, SQL Lakehouse, Business Intelligence Insights, and Advanced Analytics.

Open Data Blend Consulting Service Areas

You can learn more about Open Data Blend Consulting here.

Introducing the Open Data Blend Affiliate Programme

We have launched the Open Data Blend Affiliate Programme which rewards individuals or companies with up to 20% recurring revenue on the Open Data Blend sales that they generate.

Open Data Blend Affiliate Programme

You can learn more about the Open Data Blend Affiliate Programme here.

Follow Us and Stay Up to Date

Follow us on X and LinkedIn to keep up to date with Open Data Blend, open data, and open-source data analytics technology news. Be among the first to know when there's something new.

Blog hero image by Caleb George on Unsplash.

Created and maintained by