Overview
The RDI-DATA Collibra JSON Metadata Importer is an enterprise-grade, cloud-native Java application purpose-built to automate and accelerate the ingestion of technical metadata into the Collibra Data Catalog at industrial scale. Designed with true vendor independence, it seamlessly reads metadata in JSON format and maps it to Collibra’s model without requiring code changes or additional setup — regardless of the data source. The application automatically detects any changes in metadata content, ensuring the Data Catalog is continuously and accurately updated with the latest information. By capturing and flagging metadata changes at the source, the Importer provides full lifecycle traceability, ensuring that organizations maintain an accurate, living representation of their data assets.
Engineered for performance and resilience, the Importer’s metadata extraction and Collibra import modules are fully decoupled to maximize scalability and flexibility. Each data source is dynamically associated with a Collibra community through parameter files, with schema-level domains automatically generated to streamline onboarding. Built natively on Google Cloud Platform leveraging Cloud Run, Cloud Storage, Pub/Sub, and Secret Manager, it ensures robust security and operational efficiency. Real-time process statistics support KPI tracking and rapid issue remediation—making this solution the backbone of any industrial-strength metadata integration strategy for Collibra.
