What is EZ-CDC?
EZ-CDC is a Change Data Capture (CDC) platform that captures database changes in real-time and streams them to analytics platforms. It's designed for teams that need reliable, high-performance data replication without the operational complexity of managing infrastructure.
The Problem We Solve
Modern data teams face several challenges when trying to replicate data:
- Batch ETL is too slow: Traditional ETL jobs run hourly or daily, leaving analytics dashboards stale
- Existing CDC tools are complex: Kafka, Debezium, and similar tools require significant expertise to operate
- Security concerns: Many solutions require opening inbound ports or exposing databases to the internet
- Performance overhead: Java-based CDC tools consume significant CPU and memory
How EZ-CDC Works
EZ-CDC uses a pull-based architecture where workers in your cloud account:
- Connect to your source database and subscribe to the change stream (e.g., PostgreSQL logical replication)
- Capture every INSERT, UPDATE, and DELETE operation in real-time
- Transform and batch the changes for efficient loading
- Stream to your destination (e.g., StarRocks) via optimized protocols
Key Differentiators
1. BYOC (Bring Your Own Cloud)
Your data never leaves your AWS account. Workers run on EC2 instances in your VPC, connecting directly to your databases. The EZ-CDC control plane only manages orchestration—no data passes through our infrastructure.
2. Pull-Based Security Model
Workers poll for jobs from the control plane—no inbound connections required. This means:
- No public IPs on workers
- No inbound security group rules
- Compatible with the strictest network policies
3. Built with Rust
The core CDC daemon (dbmazz) is written in Rust, delivering:
- 12× faster than Debezium in benchmarks
- ~5MB memory footprint vs hundreds of MB for Java alternatives
- SIMD-optimized parsing for maximum throughput
4. Zero-Ops Experience
EZ-CDC handles all operational complexity:
- Automatic replication slot management
- Schema change detection and handling
- Exactly-once delivery guarantees
- Built-in monitoring and alerting
EZ-CDC vs Alternatives
| Feature | EZ-CDC | Debezium + Kafka | Fivetran |
|---|---|---|---|
| Latency | Sub-second | Seconds | Minutes-Hours |
| Memory Usage | ~5MB | ~500MB+ | N/A (managed) |
| Data Residency | Your cloud | Your cloud | Vendor cloud |
| Inbound Connections | None required | Kafka ports | Vendor IPs |
| Pricing Model | Per-worker | Self-managed | Per-row |
Use Cases
EZ-CDC is ideal for:
- Real-time Analytics: Power dashboards with live data
- Data Warehousing: Stream changes to your analytics platform
- Microservices Sync: Keep services in sync without tight coupling
- Audit Logging: Capture every change with full history
- Event Sourcing: Build event-driven architectures
Next Steps
Ready to get started? Continue to:
- Architecture Overview - Understand how components interact
- BYOC Model - Learn about our security-first deployment
- Quickstart - Deploy your first CDC pipeline