Job Description
Job DescriptionDevOps / Production Engineer | AI-Powered Logistics PlatformLocation: San Francisco, CA (On-site, 5 days/week)Salary: $170K–$210K + Competitive EquityBased in San Francisco, we are a rapidly growing AI startup that’s transforming one of the most outdated yet essential sectors in logistics. We’re building intelligent systems that make operations faster, smarter, and more efficient! Backed by some of Silicon Valley’s top early-stage investors, and with explosive growth and a clear path ahead, this is a rare opportunity to have meaningful ownership at an early inflection point in our organization. About the RoleWe’re looking for a Production Engineer who enjoys living at the intersection of software development, infrastructure, and reliability. This role is ideal for someone who takes pride in keeping systems running smoothly, shipping safely, and enabling product teams to move fast without breaking things. You’ll own critical production systems, design resilient infrastructure, and build the tooling that makes deploying software reliable, repeatable, and boring (in the best way).What's in it for you:
- Ownership from Day One: Build the foundation of core systems and product architecture.
- Elite Team: Work alongside engineers and founders from Google, LinkedIn, and leading AI research labs.
- Momentum: 100%+ month-over-month revenue growth and growing customer demand.
- Backed by Top Investors: Supported by First Round Capital, Pear VC, and other top-tier firms.
- Fast Hiring Process: End-to-end hiring process typically completed within two weeks.
What You'll Own
- End-to-end responsibility for the availability and reliability of production services.
- Design and implement fault-tolerant, self-healing systems
- Architect and scale CI/CD pipelines to support frequent, safe, and reversible deployments
- Improve the developer experience, making best practices the default
- Support AI-related infrastructure, data workflows, and human-in-the-loop systems
- Manage databases, including performance optimization, backups, and compliance
- Lead incident response efforts and post-incident reviews, turning failures into long-term fixes
- Build observability systems using monitoring, logging, and alerting tools
What You'll Bring
- 1 - 6 years in Production Engineering, DevOps, or SRE
- Direct ownership of uptime and reliability for live production systems
- Experience operating in startup or fast-moving environments
- Demonstrated ability to scale CI/CD systems in real-world production settings
- Strong backend engineering experience, especially in Python
- Deep familiarity with Google Cloud Platform (GCP)
- Infrastructure-as-code experience (e.g., Terraform, CloudFormation)
- Experience managing relational databases in production
- Comfort working across CI/CD, cloud infra, and application code
If you're ready to join an exciting start-up poised for lots of growth in the near-future, apply today! This role is not eligible for visa sponsorship at this time
