Back to all roles

[Remote] Staff Platform Engineer

Remote-first Full-time Now hiring

Note: The job is a remote job and is open to candidates in USA. Rezdy is hiring a Staff DevOps Engineer to join their new product, Manifest, in a dynamic environment. The role involves owning critical infrastructure, improving developer experience, and collaborating closely with product engineers and DevOps leadership.

Responsibilities

  • Work on a team with two other platform engineers
  • Own and evolve the infrastructure that supports Manifest, including AWS environments, networking, compute, data services, observability, CI/CD, and operational tooling
  • Work with Pulumi and TypeScript to define, maintain, and improve infrastructure as code across the platform
  • Support and improve our containerized application platform, including deployment pipelines, rollback mechanisms, and runtime configuration
  • Help operate and harden our data infrastructure, including connection pooling, backups, disaster recovery, replication, and safe schema-change practices
  • Partner with engineers to improve the reliability and safety of releases, including database migrations, deployment workflows, environment management, and production readiness checks
  • Improve CI/CD workflows so that builds, tests, infrastructure changes, and deployments are fast, reliable, and easy for engineers to understand
  • Lead observability and incident readiness work, including alerting, dashboards, SLOs, runbooks, incident response practices, and post-incident follow-up
  • Help ensure the platform is secure, cost-conscious, and maintainable as the product scales
  • Mentor engineers on infrastructure, operations, reliability, and production ownership

Skills

  • Deep production experience with AWS, especially services such as ECS/Fargate, RDS/Aurora PostgreSQL, VPC networking, load balancing, IAM, KMS, Secrets Manager, CloudFront, WAF, and related managed services
  • Experience designing and operating systems that serve a global user base, seamless multi-region availability, and disaster recovery procedures
  • Treats reliability, scalability, performance, and observability as a first-class design constraint, building these into designs from the start rather than bolting them on later
  • Strong infrastructure-as-code experience. Pulumi with TypeScript is ideal, but deep experience with Terraform or another mature IaC approach is also valuable
  • Strong operational knowledge of PostgreSQL, including performance investigation, connection pooling, backups, replication, locking, migrations, and safe schema-change patterns
  • Experience designing and maintaining CI/CD systems, ideally with GitHub Actions, OIDC-based cloud authentication, container builds, environment promotion, required checks, and deployment gates
  • Experience supporting containerized production workloads and improving deployment safety, rollback strategies, and runtime reliability
  • Strong observability and incident response experience, including metrics, logs, traces, alerting, dashboards, runbooks, and post-incident learning
  • The ability to work effectively in ambiguity, make pragmatic tradeoffs, and communicate clearly with both infrastructure specialists and product engineers
  • A track record of raising the engineering bar through reusable patterns, documentation, automation, mentoring, and thoughtful technical leadership

Company Overview

  • The world’s leading online booking and distribution platform powering the experiences industry. It is a sub-organization of Checkfront. It was founded in 2011, and is headquartered in Sydney, New South Wales, AUS, with a workforce of 51-200 employees. Its website is http://rezdy.com.
  • Apply To This Job

    More remote roles

    [Remote] Portfolio Administrator - Affordable Housing - California

    Remote-first Full-time

    [Remote] Clinical Analyst

    Remote-first Full-time

    [Remote] AI Factory, Value Engineer

    Remote-first Full-time

    [Remote] Sr. Manager of Sales Operations

    Remote-first Full-time

    [Remote] Senior Team Manager Campaign Operations

    Remote-first Full-time

    [Remote] CRM Operations Manager

    Remote-first Full-time

    [Remote] SAP Supply Chain Consultant

    Remote-first Full-time

    [Remote] Cloud Security Engineer

    Remote-first Full-time

    [Remote] Customer Support Specialist

    Remote-first Full-time

    [Remote] Environmental Engineer (Water Resources)

    Remote-first Full-time

    Family Nurse Practitioner - Remote (California Licensed)

    Remote-first Full-time

    Experienced Customer Service Representative – Remote Travel Industry Support

    Remote-first Full-time

    Remote Part‑Time Data Entry Specialist – E‑Commerce Product Management & Inventory Control at arenaflex

    Remote-first Full-time

    IOS Developer ARG (Remote)

    Remote-first Full-time

    Web Developer, Designer, SEO & marketing

    Remote-first Full-time

    Experienced Data Entry Analyst – Remote Opportunity with arenaflex

    Remote-first Full-time

    Sales Development Representative, New Business Mid-Market

    Remote-first Full-time

    Case Management Care Coordinator - REMOTE ! M-F 8:30 - 5:30 Pacific or Mountain Time

    Remote-first Full-time

    Experienced Customer Service Associate – Delivering Exceptional Service in a Dynamic Remote Environment

    Remote-first Full-time

    Entry-Level Remote Data Entry Specialist – Accurate Data Management & Process Optimization at arenaflex

    Remote-first Full-time