Job description
Description
Security teams are not short on tools. They are short on time, alignment, and the ability to actually close exposure. That's why most breaches could have been prevented with a tool that was already in use.
Nagomi was built to change that.
We are the first Agentic Exposure Ops Platform, designed to prevent the preventable breach by turning fragmented signals into coordinated action. Our agents investigate risk, drive remediation, and verify that issues stay closed, continuously, without adding overhead.
Nagomi is helping organizations move from reactive workflows to continuous execution, replacing manual effort with systems that actually move the work forward.
We are seeking an experienced Platform Engineer to join our platform team. The team that owns the core infrastructure, observability, networking, and security foundations that the entire product runs on. In this role, you will design, implement, and maintain our multi-cloud infrastructure, ensuring high availability, scalability, and security across multiple environments and regions. You will work closely with all R&D teams to streamline delivery, ship platform features, and keep our systems reliable and secure as we scale.
Key Responsibilities
Design, implement, and manage cloud infrastructure across multiple clouds, environments, and geographic regions
Architect and implement multi-region, highly available cloud solutions
Lead infrastructure automation initiatives using Infrastructure as Code (IaC) principles
Operate and scale the underlying data infrastructure and platform services (data stores, ingestion, and workflow orchestration) that other teams build on
Operate our observability stack (logging pipelines, metrics and monitoring, alerting, and tracing) for distributed systems
Optimize system performance, reliability, and cost-efficiency across global infrastructure
Contribute to the development of infrastructure-related features and reliability products
Participate in system design discussions, architectural decisions, and the design and implementation of reliability features that enhance product stability
Collaborate with all R&D teams to deliver features and improve product capabilities
Act as a cross-team advisor, reviewing infrastructure-affecting changes for impact before they ship
Troubleshoot and resolve complex infrastructure and networking issues
Requirements
5+ years of experience in DevOps, SRE, or platform engineering roles
Strong expertise with at least one major cloud platform (AWS, GCP, or Azure) in multi-region deployments
Experience with Kubernetes in production
Extensive experience with infrastructure automation tools (Terraform, CloudFormation, etc.)
Solid networking knowledge including VPCs, load balancing, DNS, security groups, and edge/CDN concepts
Experience designing and managing multi-region, multi-environment infrastructure for scalable applications
Proficiency in at least one programming language (Python, Go, JavaScript)
A security-conscious mindset and solid grasp of cloud security fundamentals (secrets management, IAM, least privilege)
Experience operating data systems and pipelines in production, such as ClickHouse (or similar columnar/OLAP data stores) and Temporal (or comparable workflow orchestration engines)
Is this role relevant for you?