DevOps / Production Engineer

Together For Talent

San Francisco, CA, USA

Published: 6/14/2022

Technology

Full Time

Job Description

Job DescriptionDevOps / Production Engineer | AI-Powered Logistics PlatformLocation: San Francisco, CA (On-site, 5 days/week)Salary: $170K–$210K + Competitive EquityBased in San Francisco, we are a rapidly growing AI startup that’s transforming one of the most outdated yet essential sectors in logistics. We’re building intelligent systems that make operations faster, smarter, and more efficient! Backed by some of Silicon Valley’s top early-stage investors, and with explosive growth and a clear path ahead, this is a rare opportunity to have meaningful ownership at an early inflection point in our organization. About the RoleWe’re looking for a Production Engineer who enjoys living at the intersection of software development, infrastructure, and reliability. This role is ideal for someone who takes pride in keeping systems running smoothly, shipping safely, and enabling product teams to move fast without breaking things. You’ll own critical production systems, design resilient infrastructure, and build the tooling that makes deploying software reliable, repeatable, and boring (in the best way).What's in it for you:

Ownership from Day One: Build the foundation of core systems and product architecture.
Elite Team: Work alongside engineers and founders from Google, LinkedIn, and leading AI research labs.
Momentum: 100%+ month-over-month revenue growth and growing customer demand.
Backed by Top Investors: Supported by First Round Capital, Pear VC, and other top-tier firms.
Fast Hiring Process: End-to-end hiring process typically completed within two weeks.

What You'll Own

End-to-end responsibility for the availability and reliability of production services.
Design and implement fault-tolerant, self-healing systems
Architect and scale CI/CD pipelines to support frequent, safe, and reversible deployments
Improve the developer experience, making best practices the default
Support AI-related infrastructure, data workflows, and human-in-the-loop systems
Manage databases, including performance optimization, backups, and compliance
Lead incident response efforts and post-incident reviews, turning failures into long-term fixes
Build observability systems using monitoring, logging, and alerting tools

What You'll Bring

1 - 6 years in Production Engineering, DevOps, or SRE
Direct ownership of uptime and reliability for live production systems
Experience operating in startup or fast-moving environments
Demonstrated ability to scale CI/CD systems in real-world production settings
Strong backend engineering experience, especially in Python
Deep familiarity with Google Cloud Platform (GCP)
Infrastructure-as-code experience (e.g., Terraform, CloudFormation)
Experience managing relational databases in production
Comfort working across CI/CD, cloud infra, and application code

If you're ready to join an exciting start-up poised for lots of growth in the near-future, apply today! This role is not eligible for visa sponsorship at this time