Full Time
TBD
8
Nov 3, 2025
Job Qualifications Education & Experience:
• 9+ years of experience in DevOps, Site Reliability Engineering, or cloud infrastructure leadership roles.
• Proven experience setting infrastructure strategy and guiding organizations through scaling challenges.
• Expert-level knowledge of Linux, AWS, and Cloudflare for large-scale, distributed environments.
• Deep expertise in containerization and orchestration with Docker and Kubernetes. ? Mastery of CI/CD pipelines (GitLab
preferred) and infrastructure automation using Terraform.
• Advanced knowledge of SQL and Snowflake operations, performance tuning, and security.
• Strong programming skills in Python for automation, tooling, and system integration.
• Extensive experience building observability platforms with Grafana and related tools. ?
• Demonstrated success leading i
• Track record of mentoring engineers, influencing technical direction, and collaborating at executive levels.
• Thought leadership in cloud infrastructure and DevOps (open source contributions, conference speaking, publications).
• Deep expertise in infrastructure security, compliance, and regulatory frameworks.
• Experience managing multi-cloud or hybrid-cloud strategies.
• Proven success leading remote-first engineering organizations through rapid scaling.
Technical Skills:
• Expert with AWS, GCP, or Azure (multi-cloud experience preferred)
• Strong hands-on experience with Kubernetes, Docker, Helm
• Terraform / Pulumi or other IaC tools ? CI/CD tools (GitHub Actions, GitLab CI, Jenkins, CircleCI, ArgoCD, etc.)
• Observability stack (Prometheus, Grafana, ELK, Datadog, New Relic, etc.)
• Networking, DNS, VPC, VPN, Load Balancers ? Strong scripting & automation skills (Python, Bash, Go preferred)
• Experience with distributed systems, caching, messaging queues (Kafka, RabbitMQ)
• Security best practices, IAM, secrets management, compliance frameworks