Documentation

gitHub

Avro file or schema

Hackolade easily imports the schema from .avsc or .avro files, located on your local file system or on a shared directory, to represent the corresponding Entity Relationship Diagram and schema structure.  When multiple files are selected, you have the choice to either combine the schemas of the selected files (default), or to create a separate schema in the model for each selected file.

 

Cloud Selection - combine Avro schemas

 

Avro files and schemas can also be reverse-engineered from AWS S3, Azure Blob Storage, and Google Cloud Storage.

 

AWS S3

Give a meaningful name to the connection to identify it for later, and provide proper URI to your S3 bucket, and optional folder path.

 

Cloud Storage - AWS S3 connection for Avro schema

 

 

If the S3 bucket is private, you must also provide authentication parameters (Access key id and Secret access key):

Cloud Storage - AWS S3 authentication avro schema

 

If you wish to handle AWS authentication through the credentials file, you may leave blank the Access Key ID and Secret Access Key fields, knowing that Hackolade supplies credentials following the recommendations described here.

 

 

Azure Blob Storage

Give a meaningful name to the connection to identify it for later, and provide proper Container name and Storage account name.

 

Cloud Storage - Azure connection avro schema

 

If the storage account is private, you must also provide your Storage access key:

Cloud Storage - Azure authentication avro schema

 

 

If you wish to filter files, you may enter a file name prefix:

Cloud Storage - Azure prefix blob name avro schema

 

Google Cloud Storage

Give a meaningful name to the connection to identify it for later, and provide proper URI to your GCS bucket, and optional folder path.

Cloud Storage - Google connection avro schema

 

If the  bucket is private, you must also access to the Private key:

Cloud Storage - Google authentication avro schema

 

 

Confluent Schema Registry on Confluence Cloud

Give a meaningful name to the connection to identify it for later, choose Cloud as a source, and provide the URL to your Schema Registry:

 

Confluent Schema Registry - Cloud connection avro schema

 

Then provide the API key and API secret:

Confluence Schema Registry - Cloud auth avro schema

 

Confluence Schema Registry on-premises

Give a meaningful name to the connection to identify it for later, choose on-premise as a source, and provide the URL to your Schema Registry:

Confluent Schema Registry - on-prem connection avro schema

 

Then provide your username and password:

Confluent Schema Registry - on-prem auth avro schema

 

 

Azure Schema Registry for Event Hubs

Give a meaningful name to the connection to identify it for later, choose Azure Schema Registry, and provide the URL to your Schema Registry:  Currently, it is not possible to automatically retrieve the list of Schema Groups, so you should provide the Schema Group concerned.  If you need to access more than one Schema Group, you may create one connection per Schema Group.

Azure Schema Registry connection settings avro schema

 

Then you need to provide the authentication parameters:

Azure Schema Registry connection auth params avro schema

 

Hackolade communicates with the Azure Schema Registry via REST APIs.  If you already use Hackolade for Cosmos DB, the following steps may have already been performed.  Otherwise, please follow the instructions below:

 

The Hackolade application must be registered so Azure accepts the REST API calls, as per these instructions.  The Application (client) ID and the Directory (tenant) ID are retrieved are retrieved from the App registration Overview screen:

Azure App registration overview

 

Note: it is critical to assign the proper role to the application just registered.  This is done following the steps outlined here

Azure IAM role assignment

 

Finally, the Application secret is obtained from the Certificates & secrets screen of the App registration: 

Azure App registration secrets screen

 

If you don't know how to generate some of the above values, you may want to consult this document.

 

 

Pulsar Schema Registry

Give a meaningful name to the connection to identify it for later, choose the Pulsar connection type, provide the URL to your Schema Registry

Pulsar connection settings avro schema