Senior Site Reliability Engineer

Welcome to Real Work From Anywhere.

The only fully location independent job board. We hand pick every job on this site. Live and work from anywhere.

💜 Love this site? plz tweet about us

Send new remote jobs to
every week!

Join ourDiscord community|Subredditfor real-time job alerts!

Sponsored
Sponsor logo
Simplify your job search with Professional Headshots
Get 3x more responses to your job applications with studio-quality professional headshots right from your home — no studio needed!

Senior Site Reliability Engineer at Owner - Work From Anywhere

👋 About Owner.com
Owner is the all-in-one platform that restaurants use to succeed online.

Thousands of restaurant owners use our tools to build their website, drive online orders, create their own branded app, manage their customer relationships, and set up marketing automations.

You can think of it as Shopify meets HubSpot, but specifically for restaurants.

Learn more about the problems we are solving for our customers here.

🌎 Our vision
We’re starting by helping independent restaurants succeed online.

But it’s not just restaurants that need our help. Most local businesses are struggling with these same problems. Huge technology corporations are taking their customers, bleeding their profits, and making it hard for them to survive.

Once we nail the solution for restaurants – we’ll scale it into every other local business type.

In the future we envision, tens of millions of local business owners will use our technology to succeed in the digital age.

🚀 Our traction
In just over 3 years we've generated tens of millions in revenue, served millions of guests, and processed hundreds of millions of online orders.

More importantly, we’ve helped thousands of restaurant owners save their businesses - and not  only survive, but thrive.

⭐ Our team
Our team grew from under 100 to nearly 200 talented people in 2024. We’ve got top talent from the most successful companies in SMB software, including: Shopify, HubSpot, DoorDash, ServiceTitan, Rappi, Faire and Stripe.

We’ll be scaling even faster in 2025 to keep pace with our customer growth.

🌆 Where we work
Owner is a remote-first, global company headquartered in San Francisco, with a sales hub in Toronto. For a few of our roles we prioritize in-person collaboration at one of our office locations. Most of our employees are distributed throughout the globe. Please review the role description and discuss with your recruiter for more details on location!

🔍 Why we’re looking for you
Owner’s restaurant-commerce platform is growing fast, and our infrastructure needs to grow even faster. We’re looking for a mission-driven Senior SRE/DevOps Engineer to keep our systems always-on, observable, and deploy-ready while helping developers ship with confidence. You’ll split your time between site-reliability engineering (designing for uptime, performance, and resiliency) and DevOps enablement (tooling, CI/CD, and automation).

Your work will directly power the websites, ordering flows, payments, and mobile apps that thousands of restaurants-and millions of diners-depend on every day.

Our Stack
Infrastructure & Ops: AWS, Terraform, ECS/Fargate, Postgres, MongoDB, Kafka, Datadog, Cloudflare, GitHub, Buildkite
Backend: Node.js, TypeScript, NestJS, Mikro-ORM
Frontend: React, React Native, Vue.js
(You don’t need to know every tool--depth in similar technologies is great.)
🔍 Why we’re looking for you
Owner’s restaurant-commerce platform is growing fast, and our infrastructure needs to grow even faster. We’re looking for a mission-driven Senior SRE/DevOps Engineer to keep our systems always-on, observable, and deploy-ready while helping developers ship with confidence. You’ll split your time between site-reliability engineering (designing for uptime, performance, and resiliency) and DevOps enablement (tooling, CI/CD, and automation).

Your work will directly power the websites, ordering flows, payments, and mobile apps that thousands of restaurants-and millions of diners-depend on every day.

Our Stack
Infrastructure & Ops: AWS, Terraform, ECS/Fargate, Postgres, MongoDB, Kafka, Datadog, Cloudflare, GitHub, Buildkite
Backend: Node.js, TypeScript, NestJS, Mikro-ORM
Frontend: React, React Native, Vue.js
(You don’t need to know every tool--depth in similar technologies is great.)

💥 The impact you will have

  • Design for reliability: Set SLOs/SLIs, build self-healing architectures, and drive incident-prevention projects that keep our APIs and real-time ordering flows <100 ms p95.
  • Own observability: Level-up dashboards, alerts, and distributed tracing so teams can detect issues before customers do.
  • Automate deployments: Evolve our Buildkite pipelines and Terraform modules to give engineers <10-minute, one-click rollouts (and clean rollbacks).
  • Champion security & compliance: Harden infra with least-privilege IAM, threat-model topology changes, and guide SOC 2 / PCI efforts.
  • Partition & scale data-stores: Tune Postgres for multi-TB workloads, maintain Mongo sharding, and shepherd Kafka topic management as event volume climbs.
  • Lead incident response: Rotate with the on-call SREs, run blameless post-mortems, and convert findings into durable fixes.
  • Mentor & collaborate: Pair with product engineers on capacity reviews, guide junior devs on Docker best-practices, and evangelize “you build it, you run it.”
  • 🤝 Who you’ll work with

  • Partners daily with backend, frontend, and data engineers across three time-zones
  • Collaborates with Product, Customer Support, and Restaurant Success teams to keep the customer experience seamless
  • ✅ Minimum requirements

  • 5+ years running production workloads on AWS (or GCP/Azure) with infrastructure-as-code (Terraform/CDK/CloudFormation)
  • Hands-on experience operating container orchestration (ECS, EKS, Kubernetes, Nomad, etc.) and designing blue/green or canary rollouts
  • Depth in at least two of our core datastores (Postgres, MongoDB, Kafka) including backup/restore, upgrades, and performance tuning
  • Fluency with CI/CD pipelines (we use Buildkite + GitHub Actions) and a knack for automating everything with shell, Python, or TypeScript
  • Proven track record setting up monitoring/alerting in Datadog, Prometheus, or similar, with clear SLO/SLA ownership
  • Strong grasp of linux networking, load balancing (Cloudflare/ELB), and CDN/edge-security concepts
  • Excellent incident-management and root-cause analysis skills; able to write crisp RCAs and follow through on action items
  • Passion for customer-centric thinking, rapid iteration, and continuous learning
  • 🌟 Bonus points

  • Experience with NestJS or other Node.js backends at scale
  • Prior work in PCI-DSS or SOC 2 environments
  • Familiarity with GitOps workflows (Argo CD, Flux)
  • Exposure to mobile CI (React-Native pipelines), LaunchDarkly/feature-flags, or chaos-engineering
  • 🏆 Pay & benefits

  • The estimated base salary range for this role is $170K - $210K, plus a generous pre-IPO equity package.
  • 100% remote across the U.S. or Canada (option to drop into our SF office)
  • Comprehensive health, dental, and vision coverage
  • Home-office stipend, top-tier laptop, and any tools you need to excel
  • Twice-annual team off-sites

  • 🚩 Notice - Employment Scams
    Communication from our team regarding job opportunities will only be made by an Owner employee with an @owner.com email address.
    We do not conduct interviews over email or chat platforms, and we will never ask you to provide personal or financial information such as your mailing address, social security number, credit card numbers or banking information.  If you believe you are being contacted by scammer, please mark the communication as "phishing" or “spam” and do not respond.
    👋 About Owner.com
    Owner is the all-in-one platform that restaurants use to succeed online.

    Thousands of restaurant owners use our tools to build their website, drive online orders, create their own branded app, manage their customer relationships, and set up marketing automations.

    You can think of it as Shopify meets HubSpot, but specifically for restaurants.

    Learn more about the problems we are solving for our customers here.

    🌎 Our vision
    We’re starting by helping independent restaurants succeed online.

    But it’s not just restaurants that need our help. Most local businesses are struggling with these same problems. Huge technology corporations are taking their customers, bleeding their profits, and making it hard for them to survive.

    Once we nail the solution for restaurants – we’ll scale it into every other local business type.

    In the future we envision, tens of millions of local business owners will use our technology to succeed in the digital age.

    🚀 Our traction
    In just over 3 years we've generated tens of millions in revenue, served millions of guests, and processed hundreds of millions of online orders.

    More importantly, we’ve helped thousands of restaurant owners save their businesses - and not  only survive, but thrive.

    ⭐ Our team
    Our team grew from under 100 to nearly 200 talented people in 2024. We’ve got top talent from the most successful companies in SMB software, including: Shopify, HubSpot, DoorDash, ServiceTitan, Rappi, Faire and Stripe.

    We’ll be scaling even faster in 2025 to keep pace with our customer growth.

    🌆 Where we work
    Owner is a remote-first, global company headquartered in San Francisco, with a sales hub in Toronto. For a few of our roles we prioritize in-person collaboration at one of our office locations. Most of our employees are distributed throughout the globe. Please review the role description and discuss with your recruiter for more details on location!

    Please mention that you found the job on Real Work From Anywhere, this helps us grow. Thanks.

    About the job

    Posted on

    May 9, 2025

    Apply before

    Jun 9, 2025

    Job type

    Full-Time

    Category

    Region

    Worldwide

    Share this job

    Similar Jobs

    Scroll company logo
    Scroll
    Senior / Staff Site Reliability Engineer
    22d ago
    Canonical company logo
    Canonical
    Senior Site Reliability / Gitops Engineer
    28d ago
    Canonical company logo
    Canonical
    Site Reliability / Gitops Engineer
    28d ago
    Clearer company logo
    Clearer
    Senior Software Engineer
    22d ago
    Clearer company logo
    Clearer
    Senior Software Engineer
    22d ago