Job description

Description Security teams are not short on tools. They are short on time, alignment, and the ability to actually close exposure. That's why most breaches could have been prevented with a tool that was already in use. Nagomi was built to change that. We are the first Agentic Exposure Ops Platform, designed to prevent the preventable breach by turning fragmented signals into coordinated action. Our agents investigate risk, drive remediation, and verify that issues stay closed, continuously, without adding overhead. Nagomi is helping organizations move from reactive workflows to continuous execution, replacing manual effort with systems that actually move the work forward. We are seeking an experienced Platform Engineer to join our platform team. The team that owns the core infrastructure, observability, networking, and security foundations that the entire product runs on. In this role, you will design, implement, and maintain our multi-cloud infrastructure, ensuring high availability, scalability, and security across multiple environments and regions. You will work closely with all R&D teams to streamline delivery, ship platform features, and keep our systems reliable and secure as we scale. Key Responsibilities Design, implement, and manage cloud infrastructure across multiple clouds, environments, and geographic regions Architect and implement multi-region, highly available cloud solutions Lead infrastructure automation initiatives using Infrastructure as Code (IaC) principles Operate and scale the underlying data infrastructure and platform services (data stores, ingestion, and workflow orchestration) that other teams build on Operate our observability stack (logging pipelines, metrics and monitoring, alerting, and tracing) for distributed systems Optimize system performance, reliability, and cost-efficiency across global infrastructure Contribute to the development of infrastructure-related features and reliability products Participate in system design discussions, architectural decisions, and the design and implementation of reliability features that enhance product stability Collaborate with all R&D teams to deliver features and improve product capabilities Act as a cross-team advisor, reviewing infrastructure-affecting changes for impact before they ship Troubleshoot and resolve complex infrastructure and networking issues Requirements 5+ years of experience in DevOps, SRE, or platform engineering roles Strong expertise with at least one major cloud platform (AWS, GCP, or Azure) in multi-region deployments Experience with Kubernetes in production Extensive experience with infrastructure automation tools (Terraform, CloudFormation, etc.) Solid networking knowledge including VPCs, load balancing, DNS, security groups, and edge/CDN concepts Experience designing and managing multi-region, multi-environment infrastructure for scalable applications Proficiency in at least one programming language (Python, Go, JavaScript) A security-conscious mindset and solid grasp of cloud security fundamentals (secrets management, IAM, least privilege) Experience operating data systems and pipelines in production, such as ClickHouse (or similar columnar/OLAP data stores) and Temporal (or comparable workflow orchestration engines)