You would work on the new ML and AI platform involving several Sprint teams around the new AI/ML product. We are looking for experts: DevOps Engineer or MLOps Engineer.
Job Responsibilities:
- Platform Ops 70% / coding (platform automation) 30%
- Participate in the development of highly scalable platforms for extracting, analyzing, and processing large amounts of contextual data from a plethora of sources, both in real-time and in batch modes.
- Working closely with other engineers and scientists, develop solutions to accelerate model development, validation, and experimentation cycles, and integrate models and algorithms in production systems at a very large scale.
Basic Qualifications:
- 7+ years of hands-on experience deploying the platforms or software at production scale
- 7+ years of experience in the complete software development lifecycle including design, coding, code reviews, testing, build processes, deployments, and operations.
- Expert experience in DevOps or ML Ops
- Experience in programming languages (automation of the infrastructure) ie. Python, Java
Experience in one of the areas:
- AWS Cloud Infrastructure (e.g., Kubernetes, Terraform, CICD)
- MLOps: e.g., Kubeflow or Knative or Kserve or Nvidia Triton or ML Flow
Preferred Qualifications:
- MS or PhD in Computer Science or equivalent experience.
- Experience working with large-scale distributed systems, preferably on cloud platforms (e.g., AWS, Azure, Google Cloud).
- Experience dealing with real-world large-scale datasets.
- Strong understanding and passion for statistical/mathematical modeling and data analysis.
- Working hours: 7-10 AM or 19-21. Need to overlap with one of these slots. Besides that can work normal hours but standard employment hours - 8hs