Job Description
- Job Description:
- Design, deploy and maintain cloud infrastructure for data and ML workloads using Infrastructure as Code.
- Manage and evolve AWS-based data platform components running on Kubernetes (EKS).
- Provision and maintain services such as EMR on EKS, SageMaker, MWAA (Managed Airflow), Lambda, API Gateway and Step Functions.
- Implement and maintain IAM roles, permissions and governance policies aligned with compliance requirements.
- Support orchestration frameworks used by data teams (DBT, Airflow, Step Functions).
- Collaborate with data engineers to troubleshoot infrastructure or platform issues affecting pipelines.
- Participate in platform observability initiatives (metrics, logging and monitoring).
- Maintain Terraform modules and deployment pipelines.
- Support platform migrations and organizational AWS changes when required.
- Contribute to platform reliability, scalability and operational excellence.
- Requirements:
- 3+ years of experience working with AWS cloud infrastructure
- Strong experience with Terraform or similar Infrastructure as Code tools
- Experience deploying and operating containerized workloads on Kubernetes / EKS
- Solid understanding of AWS IAM, roles and security best practices
- Experience with serverless architectures (Lambda, API Gateway, Step Functions)
- Experience supporting data or ML platforms from an infrastructure perspective
- DevOps mindset and experience managing CI/CD or infrastructure automation
- Strong troubleshooting skills across distributed systems.
- Benefits:
- Remote
- Professional development opportunities
Apply tot his job
Apply To this Job