← Back to jobs
Atlanta, GA, USA
No related jobs found
Job Responsibilities include:
Provide L2/L3 production support for AI/ML applications, pipelines, and model deployments.
Monitor model performance, data quality, and drift, responding to alerts and incidents.
Troubleshoot and resolve issues across ML workflows, cloud infrastructure, and integrations.
Support CI/CD pipelines for model deployment, versioning, and release management.
Perform root cause analysis and implement preventive and reliability improvements.
Maintain operational dashboards, runbooks, and documentation while ensuring SLA/SLO compliance.
Required Qualifications:
Bachelor’s or master’s degree in computer science, Data Science, Engineering, or a related field.
8–10 years of experience in application or production support, with exposure to AI/ML systems.
Strong hands‑on experience supporting production AI/ML platforms, including ML pipelines, model deployments, and monitoring.
Proficiency in Python, cloud platforms (AWS/Azure/GCP), and incident management with strong troubleshooting and RCA skills
Any Graduate
No related jobs found
← Back to jobs