Job Description: We are seeking an experienced PostgreSQL Database Administrator
(DBA) to manage and maintain our on-premise PostgreSQL infrastructure deployed across multiple
data centers. The DBA will be responsible for implementing, monitoring, and optimizing logical
replication between data centers, ensuring high availability, data integrity, and automatic
recovery in case of network or hardware failures.
The role involves working on a one-master-at-a-time replication model, ensuring seamless master
rotation every six months, and maintaining synchronization between the data centers even during
internet or network interruptions.
Key Responsibilities:
- Design, configure, and maintain PostgreSQL databases for high availability and performance
across multiple on-premise data centers.
- Implement and manage logical replication with one master node active at a time.
- Ensure replication auto-start and auto-sync in case of network or connectivity disruptions.
- Plan and execute scheduled master rotations (every six months) between data centers with
minimal downtime.
- Continuously monitor database health, replication status, and performance metrics.
- Develop and maintain scripts for automated failover, recovery, and sync resumption after
outages.
- Perform backup and restore operations ensuring data consistency across environments.
- Optimize database performance through indexing, query tuning, and partitioning strategies.
- Collaborate with DevOps, Networking, and Application teams to ensure database reliability
and security.
Required Skills & Qualifications
- Strong hands-on experience (5+ years) administering PostgreSQL (v12 or higher) in production
environments.
- Proven expertise in setting up and managing logical replication, streaming replication, and
high-availability clusters.
- Experience with multi–data center replication and failover/fallback mechanisms.
- Proficient in designing auto-start and auto-sync replication scripts (using Bash, Python, or
Ansible).
- Understanding of network routing, firewall, and latency optimization for database
replication across sites.
- Experience with PgBouncer, Patroni, or repmgr for HA management.
- Solid understanding of backup and disaster recovery strategies.
- Familiarity with monitoring tools (Prometheus, Grafana, pgAdmin, Zabbix, or similar).
- Strong problem-solving and analytical skills, with a focus on reliability and fault
tolerance.
Preferred Qualifications
- Certification in PostgreSQL Administration or Linux Systems.
- Experience with Docker-based PostgreSQL deployments.
- Prior work with Python Flask or web applications connecting to PostgreSQL backends.
- Knowledge of PostgreSQL replication conflict resolution and logical decoding.
Key Environment Details
- Deployment Model: On-Premise
- Data Centers: DC1, DC2 (active replication)
- Replication Type: Logical (one master active)
- Failover Behavior: Auto start on network recovery, auto sync after downtime
- Master Rotation: Scheduled every 6 months