AIML - Sr Machine Learning Engineer, LLM Optimization, Data and ML Innovation
Job Description
Summary
As part of Apple's AI and Machine Learning org, we encourage and create groundbreaking technology for large-scale ML systems, computer vision, natural language processing, and multi-modal understanding. As a Machine Learning Engineer in the LLM Optimization team, you will have the opportunity to be part of an innovative ML organization that enables LLM for Apple products. The team focuses on crafting and implementing ML-based solutions to improve runtime latency, training time, memory usage, time to first token, and decoding speed across all Apple applications. The team is strategically positioned for significant contributions both in the short term (on well-known Apple products) and in the long term (on highly ambitious, high-risk, high-reward projects). This role emphasizes shipping ML-based features and products. You will innovate across the entire end-to-end ML production pipeline.
Are you fearless in trying new things and willing to iterate on ideas? We value team members who can quickly prototype towards high-quality implementations. Come and join us!
Are you fearless in trying new things and willing to iterate on ideas? We value team members who can quickly prototype towards high-quality implementations. Come and join us!
Description
In this role, you will leverage your background to:
Design and implement ML-based solutions to improve runtime latency, training time, memory usage, time to first token, and decoding speed for Apple applications.
Innovate across the entire end-to-end ML production pipeline, including dataset creation, neural network architecture design, model training, fine-tuning methods, training time optimization, on-device and server side inference.
Quickly prototype and iterate to achieve high-quality implementations for pioneering machine learning algorithms.
Collaborate with hardware and software teams to integrate research findings into market-ready solutions.
Translate theoretical ideas into tangible innovations, demonstrating their industrial applicability.
Minimum Qualifications
- Strong ML background
- Proficiency in Programming Languages and Frameworks: Python, C++, PyTorch/TensorFlow/Jax
- Experience with Natural Language Processing(NLP), ML optimization - with a focus on LLMs
- Outstanding communication and technical writing skills, capable of conveying complex concepts clearly and efficiently
- MS in Computer Science, or equivalent experience
Preferred Qualifications
- Phd
- Notable achievements validated by quality publications in ML optimization, with a focus on LLMs