The VP of Machine Learning will be responsible for building and leading a team that uses big data analytics and machine learning techniques to improve the reliability of our datacenter services and reduce operational costs. The ideal candidate will have experience in data center operations, server, network, storage, and security equipment behavior, and using big data and machine learning algorithms to predict failures. The candidate will be responsible for gathering and organizing monitoring information about datacenter equipment, developing algorithms that learn normal behavior for these systems, identifying anomalies in the environment that can be used to predict future failures, and automating the response to detected conditions. The candidate will be expected to demonstrate the core values of collaboration, passion, reliability, and honesty while being results-oriented and focused on achieving operational efficiencies from the machine learning system being developed. The candidate must also be able to organize people, budgets, and schedules to achieve a financial plan that formed the business case for this project.
Responsibilities:
Build and lead a team that uses big data analytics and machine learning techniques to improve the reliability of our datacenter services and reduce operational costs.Gather and organize monitoring information about datacenter equipment.Develop algorithms that learn normal behavior for these systems.Identify anomalies in the environment that can be used to predict future failures.Automate the response to detected conditions.Organize people, budgets, and schedules to achieve a financial plan that formed the business case for this project.
Requirements:
Bachelor's or Master's degree in Computer Science, Data Science, or a related field.Experience in data center operations, server, network, storage, and security equipment behavior.Experience using big data and machine learning algorithms to predict failures.Strong programming skills in Python, R, or other languages.Experience with machine learning frameworks such as TensorFlow, Keras, or PyTorch.Strong analytical and problem-solving skills.Excellent communication and collaboration skills.Results-oriented and focused on achieving operational efficiencies from the machine learning system being developed.
JR009901