Skip to main content

What is EZ-CDC?

EZ-CDC is a Change Data Capture (CDC) platform that captures database changes in real-time and streams them to analytics platforms. It's designed for teams that need reliable, high-performance data replication without the operational complexity of managing infrastructure.

The Problem We Solve

Modern data teams face several challenges when trying to replicate data:

  1. Batch ETL is too slow: Traditional ETL jobs run hourly or daily, leaving analytics dashboards stale
  2. Existing CDC tools are complex: Kafka, Debezium, and similar tools require significant expertise to operate
  3. Security concerns: Many solutions require opening inbound ports or exposing databases to the internet
  4. Performance overhead: Java-based CDC tools consume significant CPU and memory

How EZ-CDC Works

EZ-CDC uses a pull-based architecture where workers in your cloud account:

  1. Connect to your source database and subscribe to the change stream (e.g., PostgreSQL logical replication)
  2. Capture every INSERT, UPDATE, and DELETE operation in real-time
  3. Transform and batch the changes for efficient loading
  4. Stream to your destination (e.g., StarRocks) via optimized protocols
PostgreSQLSourceEZ-CDC WorkerSnapshotCDC StreamStarRocksSinkWAL StreamStream Load
Simple CDC Flow: PostgreSQL to StarRocks via EZ-CDC

Key Differentiators

1. BYOC (Bring Your Own Cloud)

Your data never leaves your AWS account. Workers run on EC2 instances in your VPC, connecting directly to your databases. The EZ-CDC control plane only manages orchestration—no data passes through our infrastructure.

2. Pull-Based Security Model

Workers poll for jobs from the control plane—no inbound connections required. This means:

  • No public IPs on workers
  • No inbound security group rules
  • Compatible with the strictest network policies

3. Built with Rust

The core CDC daemon (dbmazz) is written in Rust, delivering:

  • 12× faster than Debezium in benchmarks
  • ~5MB memory footprint vs hundreds of MB for Java alternatives
  • SIMD-optimized parsing for maximum throughput

4. Zero-Ops Experience

EZ-CDC handles all operational complexity:

  • Automatic replication slot management
  • Schema change detection and handling
  • Exactly-once delivery guarantees
  • Built-in monitoring and alerting

EZ-CDC vs Alternatives

FeatureEZ-CDCDebezium + KafkaFivetran
LatencySub-secondSecondsMinutes-Hours
Memory Usage~5MB~500MB+N/A (managed)
Data ResidencyYour cloudYour cloudVendor cloud
Inbound ConnectionsNone requiredKafka portsVendor IPs
Pricing ModelPer-workerSelf-managedPer-row

Use Cases

EZ-CDC is ideal for:

  • Real-time Analytics: Power dashboards with live data
  • Data Warehousing: Stream changes to your analytics platform
  • Microservices Sync: Keep services in sync without tight coupling
  • Audit Logging: Capture every change with full history
  • Event Sourcing: Build event-driven architectures

Next Steps

Ready to get started? Continue to: