Design and maintain cloud-based infrastructure (AWS, GCP, or Azure) for AI development pipelines. -Automate infrastructure using tools like Terraform, Ansible, or similar. -Monitor and improve system performance, reliability, and scalability. -Identify and resolve infrastructure bottlenecks or deployment issues. -Summarize your troubleshooting, design, and optimization decisions clearly and concisely.
You’re a great fit if
Fluent in English with strong writing and communication skills.
Expertise in DevOps and Infrastructure as Code (IaC): containers (Docker), orchestration (Kubernetes), CI/CD (GitHub Actions, CircleCI, etc.).
3–5 years of experience in DevOps, cloud infrastructure, or SRE roles is a plus.
Bachelor’s degree (or pursuing one) in Computer Science, Engineering, or related field. Master's or PhD preferred.
Deep interest in AI/ML infrastructure, cloud computing, or secure system design.
4+ years of professional experience in DevOps, SRE, or infrastructure roles with a focus on IaC
Deep hands-on experience with Terraform (preferred), Pulumi, or AWS CloudFormation
Proficiency with at least one major cloud provider (AWS preferred; Azure or GCP acceptable)
Strong understanding of networking, IAM, VPCs, security groups, and resource policies
Comfortable writing modular, DRY IaC, and using state management practices responsibly
About the role
Flexible workload — work from anywhere, on your own schedule
High impact — your craft directly improves models used by top AI labs & Fortune 500 teams
Clear ownership — know exactly what success looks like and have autonomy to deliver
Growth potential — consistent high performers spearhead new programs and mentor incoming SMEs
Interview process
Complete a screening with Zara, our AI interviewer in English, to learn more about your background and experience.
Domain-specific Zara interview to assess your DevOps expertise, including Infrastructure as Code, CI/CD pipelines, and VMs.