תיאור המשרה
Description
About us
Penlink is a global leader in digital intelligence solutions. Our advanced technologies simplify complex data, empowering public safety organizations to make informed decisions quickly and effectively. We believe in the power of data-driven intelligence to accelerate clarity in decision-making for global security, strategic operations, and the most critical missions. Headquartered in the US with offices worldwide.
About the Role
We are looking for a skilled Observability Engineer with strong expertise in monitoring and cloud-native deployments. The ideal candidate is passionate about system reliability, performance optimization, and building scalable infrastructure across modern cloud platforms.
Key Responsibilities:
Design, implement, and maintain observability solutions using tools such as Datadog, Grafana, and the Elastic Stack (ELK)
Build and optimize monitoring, alerting, and logging pipelines to ensure high system reliability and fast incident response
Develop dashboards and metrics that provide deep visibility into system performance and business KPIs
Ensure best practices in cloud architecture, including security, scalability, and cost optimization
Troubleshoot production issues and implement proactive solutions to prevent recurrence
Requirements
At least +3 years with hands-on experience with observability and monitoring tools (Datadog, Grafana, Elastic/ELK)
Strong understanding of logging, metrics, tracing, and alerting concepts
Proven experience with AWS and Azure cloud platforms
Solid understanding of containerization and orchestration (Docker, Kubernetes is a plus)
Strong problem-solving skills and a proactive mindset toward system reliability
Experience with distributed systems and microservices architecture
Scripting skills (Python, Bash, etc.)
Advantage:
Familiarity with CI/CD pipelines and DevOps best practices
Familiarity with Infrastructure as Code and automation tools
Knowledge of SRE principles (SLIs, SLOs, error budgets)