Streaming · Event platform connector
The Confluent data catalog your whole team can trust.
The Dawiso Confluent Kafka data catalog turns your Confluent Cloud into a searchable inventory: clusters, topics, Stream Governance schemas and consumer groups.
First things first
What is a data connector?
A data connector is the bridge between a tool in your stack and the catalog that gives you a unified view of it. Once a connector is configured, it reaches into the source system on a schedule, reads out the metadata - schemas, tables, dashboards, jobs, ownership, lineage - and represents it inside the catalog. Your actual rows and values stay where they are.
Connectors are the reason a data catalog can answer questions like "which Power BI dashboard depends on this Snowflake table?" or "who owns the orders topic in Kafka?" - automatically, without anyone keeping a spreadsheet up to date.
Three properties separate a good connector from a brittle one: it should be read-only and safe, it should be incremental so a full re-scan isn't required for every refresh, and it should resolve lineage across system boundaries, not just inside one tool.
About the platform
What is Confluent Kafka?
Confluent Kafka is the managed Apache Kafka platform from the company founded by Kafka's original creators. Teams use Confluent Cloud (or Confluent Platform on-prem) when they want Kafka without running brokers, with extras around it: Stream Governance, Schema Registry, ksqlDB, RBAC and a Stream Catalog for topic discovery.
Confluent's Stream Catalog handles topics inside Confluent. What it doesn't cover is the Power BI report consuming the downstream warehouse table, or the data product the business actually owns end to end. That's where the Dawiso Confluent Kafka data catalog joins the picture: read-only, metadata-only, and cross-platform.
Architecture
How Dawiso connects to Confluent
A small read-only role on the Confluent side. The Dawiso scanner pulls metadata on a schedule. Everything ends up in your catalog, business-readable.
Source
Confluent Cloud environment
- Clusters & REST endpoint
- Topics & partitions
- Stream Governance schemas
- Consumer groups & ACLs
Dawiso scanner
Read-only metadata
- Schema & object discovery
- Dependency resolution
- SQL flow parsing (optional)
- Sampling on opt-in
Catalog
Dawiso platform
- Searchable metadata
- Lineage & ownership
- Business glossary
- Policy & classifications
Connection details
- Protocol
- Confluent Cloud REST API + Schema Registry API
- Authentication
- API Key + Secret for cluster · API Key + Secret for Schema Registry · service-account ACLs
- Lineage
- Topics, schemas and consumer groups cataloged from cluster metadata and the Stream Governance Schema Registry, with ownership and schema versions documented in Dawiso
Setup
Connect Confluent in 4 steps
- 01
Get cluster ID and REST endpoint
In Confluent Cloud, open Cluster Overview > Cluster settings. Copy the Cluster ID and the REST endpoint URL. Store them for the connection form.
- 02
Create cluster API key
Under Cluster Overview > API keys, create a Service-account API key with ACLs scoped to DescribeConfigs on Cluster, Describe on Consumer Group, Describe + DescribeConfigs on Topic.
- 03
Create Schema Registry API key
From the environment's Stream Governance API panel, copy the endpoint URL and create a Schema Registry API key + secret. Both pairs are needed to ingest topics and schemas.
- 04
Connect and run ingestion
Provide cluster + Schema Registry endpoints and credentials in Dawiso. Pick projects via regex, save the data source, run ingestion. Scheduled incremental sync keeps the catalog current.
Capabilities
What you get with the Confluent connector
-
Topic & cluster catalog
Every topic across every Confluent environment is searchable, with partition count, retention, owner and the team that built the producer.
-
Stream Governance sync
Avro, Protobuf and JSON schemas from the Confluent Schema Registry, with version history and diff view between releases.
-
Consumer group visibility
Consumer groups are pulled into the catalog alongside the topics they read, so it is clear which groups are registered against each topic.
-
PII in events
Classify schema fields once. Dawiso flags every Confluent topic carrying email, IBAN or government IDs across all clusters and environments.
-
Data contracts
Promote a schema to a contract. Block breaking changes before they ship and notify subscribers when the contract evolves.
-
RBAC & ACL visibility
Cluster ACLs and service-account assignments are pulled into the catalog so business sees who can read what, where.
Business value
Why teams turn on the Confluent connector
- 0
Silent schema breakages
Schema diffs and consumer notifications stop a Protobuf rename from taking down three downstream services overnight.
- −80%
Time to find a topic
New engineers stop pinging Slack to ask 'which topic has order events.' They search the Confluent Kafka data catalog and read.
- EU-grade
Streaming governance
PII flowing through Confluent is classified, auditable and policy-tracked the same way as data sitting in your warehouse.
Ready to catalog your Confluent?
Set up the connector in an afternoon. See your first lineage graph the same day.
Frequently asked questions
Does Confluent have a data catalog?
What is Kafka metadata?
What is the difference between Kafka and Confluent Kafka?
What permissions does Dawiso need in Confluent?
Does Dawiso consume from our topics?
How does Dawiso compare to Confluent Stream Catalog?
Does the connector support Confluent Platform on-prem?
Explore more connectors
Confluent is one of 30+ connectors. Bring your whole stack into the catalog.
-
Data Warehouse Snowflake -
Data Lakehouse Databricks -
Business Intelligence Power BI -
Business Intelligence Tableau -
Data Warehouse Google BigQuery -
Data Warehouse Amazon Redshift