[Remote] Infrastructure/SRE Engineer - Qualcomm - Remote, UK
Note: The job is a remote job and is open to candidates in USA. Qualcomm Technologies International Ltd is seeking an Infrastructure/SRE Engineer to help build and evolve the cloud platform that powers Edge Impulse. The role involves managing AWS infrastructure, maintaining Kubernetes clusters, and driving CI/CD pipelines, ultimately enabling engineering teams to ship reliably and securely at scale.
Responsibilities
- Help manage and improve AWS infrastructure using Terraform across multiple environments
- Help maintain and scale Kubernetes clusters with Karpenter, including GPU node pools for ML workloads and troubleshooting scheduling and node lifecycle issues
- Contribute to our Cilium-based networking layer, including network policy management, traffic observability via Hubble, and cluster connectivity
- Build and maintain CI/CD pipelines using Buildkite
- Manage deployments through ArgoCD with automated health checks and rollout strategies
- Develop internal tooling and automation in TypeScript
- Implement observability and alerting with Datadog
- Champion security best practices including secrets management and access controls
- Participate in incident response, capacity planning, and cost optimization initiatives
Skills
- Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience
- OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience
- OR PhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience
- 2+ years of work experience with Programming Language such as C, C++, Java, Python, etc
- 4+ years in SRE, DevOps, or Platform Engineering roles
- Strong AWS experience (EKS, RDS, S3, IAM, VPC)
- Proficiency with Terraform in production environments
- Solid Kubernetes knowledge including networking, RBAC, and autoscaling (Karpenter)
- Experience with Cilium or eBPF-based networking, including network policies, observability with Hubble, and service mesh concepts
- Exposure to GPU-accelerated workloads on Kubernetes is a plus
- Experience with ArgoCD or similar GitOps workflows
- CI/CD pipeline experience (Buildkite or similar)
- Proficiency in TypeScript, Python, or Bash
- Familiarity with Datadog, HashiCorp Vault, PostgreSQL, and Redis
- Strong communicator comfortable on a fully distributed team
Company Overview
Company H1B Sponsorship