What You’ll Do:
- Design and maintain systems, monitoring and alerting to detect anomalies and performance degradation of ML models; implement automation to streamline workflows
- Support with model development and deployment to production, ensuring seamless integration with existing infrastructure
- Develop and maintain dashboards and reports to provide insights into model performance and data quality
- Investigate and resolve model-related incidents, including data quality issues and prediction errors
- Design and implement cloud solutions (GCP) to containerize, deploy, version, and monitor data science models
- Collaborate and communicate effectively with data scientists, engineers and architects on model validation, testing, and quality assurance
- Document processes and best practices, and lead training sessions to upskill data scientists in Data Science Ops, promoting ops excellence and continuous learning
Minimum Basic Requirement:
- Bachelor's or graduate degree in Computer Science, Software Engineering, or a related quantitative field (such as engineering, machine learning, data science, operations research, or economics) with a minimum of 8 years of industry experience
- Expertise in cloud computing, with hands-on experience using GPC or similar services
- Prior DS Ops or ML Ops experience, with a proven track record of success
- Expertise in monitoring and alerting tools, such as PagerDuty, with experience designing and managing alerting systems to ensure optimal model performance and data quality
- Ability to collaborate effectively across functions and with diverse stakeholders, including data scientists, engineers, analysts, and finance teams, in mid- or large-scale companies
- Strong communication skills, with the ability to articulate complex technical concepts and solutions to both technical and non-technical audiences
Preferred Qualifications:
- Strong programming skills in SQL, Python, Ruby, and Bash, with a solid understanding of Linux and experience with frameworks like scikit-learn, Keras, PyTorch, and TensorFlow
- Proficiency with DS / ML Ops frameworks like Airflow, Kubeflow, MLFlow or similar
- Cloud solution design and implementation expertise, with experience building DS / ML Ops pipelines on GCP
Benefits at Credit Karma includes:
- Medical and Dental Coverage
- Retirement Plan
- Commuter Benefits
- Wellness perks
- Paid Time Off (Vacation, Sick, Baby Bonding, Cultural Observance, & More)
- Education Perks
- Paid Gift Week in December