ADR-002: PostgreSQL as Primary Storage

Status	Date	Deciders
Accepted	2024-01-15	Core Team

Context

go-mink needs a reliable, production-ready storage backend for events. The storage must support:

ACID Transactions: Atomic event appends with version checks
Optimistic Concurrency: Detect concurrent modifications
Efficient Queries: Load events by stream, position, or time
Subscriptions: Support for change notifications
Scalability: Handle millions of events
Operational Maturity: Well-understood operations, backup, monitoring

We need to choose between:

Dedicated event stores (EventStoreDB)
Relational databases (PostgreSQL, MySQL)
Document databases (MongoDB)
Custom solutions

Decision

We will use PostgreSQL as the primary storage backend with a custom schema optimized for event sourcing.

Schema Design

CREATE TABLE mink_events (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    stream_id VARCHAR(255) NOT NULL,
    version BIGINT NOT NULL,
    type VARCHAR(255) NOT NULL,
    data JSONB NOT NULL,
    metadata JSONB DEFAULT '{}',
    global_position BIGSERIAL,
    timestamp TIMESTAMPTZ DEFAULT NOW(),
    
    CONSTRAINT unique_stream_version UNIQUE (stream_id, version)
);

CREATE INDEX idx_events_stream_id ON mink_events(stream_id);
CREATE INDEX idx_events_global_position ON mink_events(global_position);
CREATE INDEX idx_events_type ON mink_events(type);
CREATE INDEX idx_events_timestamp ON mink_events(timestamp);

Key Features Used

JSONB: Flexible event data storage with indexing
BIGSERIAL: Global ordering via global_position
UNIQUE Constraint: Optimistic concurrency via (stream_id, version)
LISTEN/NOTIFY: Real-time subscriptions

Concurrency Control

-- Atomic append with version check
INSERT INTO mink_events (stream_id, version, type, data, metadata)
SELECT $1, COALESCE(MAX(version), 0) + 1, $2, $3, $4
FROM mink_events
WHERE stream_id = $1
HAVING COALESCE(MAX(version), 0) = $5
RETURNING *;

Consequences

Positive

Familiar Technology: Most teams already know PostgreSQL
Operational Maturity: Proven backup, replication, monitoring
JSONB Power: Flexible schema with query capabilities
Transaction Support: Can combine event writes with other operations
Cloud Availability: Available on all major cloud providers
No Vendor Lock-in: Open source, portable
Cost Effective: No additional licensing for event store
Ecosystem: Rich tooling (pgAdmin, pg_dump, etc.)

Negative

Not Purpose-Built: Less optimized than dedicated event stores
Subscription Limitations: LISTEN/NOTIFY has message size limits
Partitioning Complexity: Manual partitioning for very large datasets
No Built-in Projections: Must implement projection engine ourselves

Neutral

Schema Management: Need to handle migrations
Connection Pooling: Standard PostgreSQL considerations apply

Alternatives Considered

Alternative 1: EventStoreDB

Description: Purpose-built event store database.

Pros:

Optimized for event sourcing
Built-in subscriptions
Projections in JavaScript

Rejected because:

Additional infrastructure component
Less familiar to most teams
Licensing considerations for enterprise features
Smaller community in Go ecosystem

Alternative 2: MongoDB

Description: Document database with change streams.

Pros:

Flexible schema
Good Go driver
Change streams for subscriptions

Rejected because:

Weaker transaction guarantees (improved in recent versions)
Less suitable for strict ordering requirements
JSONB in PostgreSQL provides similar flexibility

Alternative 3: Apache Kafka

Description: Distributed event streaming platform.

Pros:

Excellent for high-throughput
Built-in partitioning
Strong ecosystem

Rejected because:

Complex infrastructure
Not designed for entity-level streams
Retention policies complicate event sourcing
Overkill for many use cases

Alternative 4: Custom File-Based Store

Description: Append-only log files.

Rejected because:

Would need to build everything from scratch
No query capabilities
Complex replication/backup
Not suitable for production

Implementation Notes

Connection Pooling

We use pgx for connection pooling:

config, _ := pgxpool.ParseConfig(connStr)
config.MaxConns = 25
config.MinConns = 5
pool, _ := pgxpool.NewWithConfig(ctx, config)

Subscription via Polling

For reliable subscriptions, we use polling with checkpoints rather than LISTEN/NOTIFY:

func (s *Subscriber) Poll(ctx context.Context, fromPosition uint64) ([]StoredEvent, error) {
    return s.db.Query(ctx, `
        SELECT * FROM mink_events 
        WHERE global_position > $1 
        ORDER BY global_position 
        LIMIT 100
    `, fromPosition)
}

ADR-002: PostgreSQL as Primary Storage

Context

Decision

Schema Design

Key Features Used

Concurrency Control

Consequences

Positive

Negative

Neutral

Alternatives Considered

Alternative 1: EventStoreDB

Alternative 2: MongoDB

Alternative 3: Apache Kafka

Alternative 4: Custom File-Based Store

Implementation Notes

Connection Pooling

Subscription via Polling

References