Starburst Data Products
Starburst Enterprise is a fully supported, production-tested and enterprise-grade distribution of open source Trino (formerly Presto® SQL). It improves performance and security while making it easy to deploy, connect, and manage your Trino environment. Through connecting to any source of data – whether it’s located on-premise, in the cloud, or across a hybrid cloud environment – Starburst lets your team use the analytics tools they already know & love while accessing data that lives anywhere. For more information on what Starburst Enterprise is and the use cases it solves, please refer to Starburst’s partner profile page in the Collibra Marketplace.
A Starburst data product is a schema that contains one or more data sets, which are represented as views and/or materialized views in the data source where they are located. The Starburst JDBC driver in the Collibra Marketplace will automatically ingest the basic technical metadata for the views and materialized found within each Starburst data product, but it will not ingest the metadata for the data product itself (see list below). This integration will automatically bring that information into the Collibra to provide more insight into what each data product contains and how it should be used.
Data Products Metadata
- Data product name
- Data product summary
- Data product description
- Data domain
- Catalog (the data source where the data sets are located)
- Data product owners
- SQL query that defines the data set
Additionally, this integration will perform the following tasks to ensure Starburst data products are easy to locate in Collibra and provide the business context needed to understand how best to use them.
- Create assets in Collibra for each published data product in Starburst
- Create assets for each domain the data products are associated with
- Extract the aforementioned data products metadata from Starburst
- Add the metadata to the corresponding assets in Collibra
- Link the data domain asset to the appropriate data product asset
- Link the data product asset to the appropriate data sets
Support for this integration is community based. Please visit the starburst-collibra/data_products GitHub repo to report any issues or to suggest changes to the code. Since this a community supported integration, there are no SLAs for responses to issues or pull requests. We will, however, do our best to address these items in a timely manner.
Initial release of this integration.
- Collibra Data Intelligence Cloud
- Python 3.9+
- Collibra API v2
- Collibra Platform v2021+
- Starburst Enterprise 380-e LTS+
- Jupyter notebook
License and Usage Requirements
- Collibra Data Intelligence Cloud
No previous versions of this listing is available.
The following terms shall apply to the extent you receive the source code to this offering.
Notwithstanding the terms of the Binary Code License Agreement under which this integration template is licensed, Collibra grants you, the Licensee, the right to access the source code to the integrated template in order to copy and modify said source code for Licensee’s internal use purposes and solely for the purpose of developing connections and/or integrations with Collibra products and services.
Solely with respect to this integration template, the term “Software,” as defined under the Binary Code License Agreement, shall include the source code version thereof. Except with respect to the foregoing, all remaining terms of the Binary Code License Agreement shall apply to the license of integration template hereunder.