[Remote] Azure Infrastructure Engineer / DevOps
Note: The job is a remote job and is open to candidates in USA. Diverse Lynx is seeking an Azure Infrastructure Engineer / DevOps specialist to provision, configure, and operate the cloud infrastructure for a production Agentic AI platform on Azure. The role involves designing and managing various Azure services, implementing CI/CD pipelines, and ensuring security and scalability of the platform.
Responsibilities
- Provision and manage all Azure OCM infrastructure: AKS, Azure PostgreSQL, Event Hub (Kafka), Service Bus, Blob Storage, Container Registry, Key Vault, and API Gateway
- Design and configure AKS pod-per-workflow autoscaling using KEDA (event-driven, queue depth and Event Hub lag) and HPA (CPU/memory and custom SLA metrics)
- Build and maintain CI/CD pipelines (GitHub Actions) for containerized Python services and infrastructure-as-code
- Implement and manage the full observability stack: OpenTelemetry instrumentation, Dynatrace APM, Splunk log aggregation, and Azure Monitor / App Insights
- Configure and enforce security controls: Azure Private Link, managed identities, RBAC, Key Vault secrets management, HIPAA-compliant encryption, and Zero Trust network model
- Manage containerization: Docker image builds, Azure Container Registry, vulnerability scanning, and image lifecycle
- Support performance and load testing infrastructure, validating autoscaling under burst load scenarios
- Manage infrastructure for the parallel run and production cutover, including rollback capability
Skills
- 5+ years infrastructure engineering / DevOps with 3+ years on Azure in production environments
- AKS expertise: pod lifecycle, namespace management, KEDA, HPA, cluster autoscaler, multi-AZ deployment
- Azure services: Event Hub (Kafka API), PostgreSQL, Service Bus, Key Vault, Container Registry, Private Link, API Gateway
- CI/CD: GitHub Actions, Docker, Helm, infrastructure-as-code (Terraform or Bicep)
- Observability: OpenTelemetry, Dynatrace, Splunk, Azure Monitor
- Azure security: managed identities, RBAC, Private Link, encryption at rest and in transit, Zero Trust
- Experience migrating services from private cloud or on-premises to Azure with zero-downtime strategy
- HIPAA or equivalent compliance experience; security controls for PHI-handling systems
- Experience with Python-based microservices and containerized AI workloads on AKS
Company Overview
Company H1B Sponsorship