at Apple
Location
Sunnyvale, United States of America
Compensation
$147k–$272k USD
Type
full time
Posted
9 months ago
Market range · company + function + seniority
p25 · target · p75 · n=515
Posted $272k · in the market band
Posting health
Aging · 65Tailor your résumé to this role in 30 seconds.
Free account · ATS keyword check · per-job bullet rewrite by Claude.
We are looking for an AIML Engineer with a strong background in developing foundation models for generative AI and multimodal systems that integrate various types of real-time sensor data such as video and audio with other modalities like text. You will not only work on cutting-edge projects to advance our AI capabilities, but also contribute to practical features in Apple products and bring impact to millions of users. You will collaborate with others to drive data requirements, validation strategies, and key performance indicators, and conduct algorithm research and development that serves product needs. A successful candidate will stay up-to-date with the latest advancements in AI, machine learning, and computer vision, applying this knowledge to drive innovation, but also take a practical approach to problem solving and software engineering to deliver clean, modular, testable code.
Experience building models for multimodal perception system.
Experience working with LLMs and VLMs.
Software engineering skills and proficiency in Python and PyTorch.
Curiosity and willingness to learn new things in order to improve the quality of their solutions.
BS and a minimum of 3 years relevant industry experience.
MS or PhD in computer vision, computer graphics, machine learning, computer science, computer engineering or related fields.
Experience in developing, training/tuning foundation models and multimodal LLMs.
Experience with training and troubleshooting generative architectures such as diffusion, reinforcement learning, flow matching or normalizing flow at scale.
Experience applying reinforcement learning to help train foundation models a plus.
Excellent communication and experience working with multi-functional teams.
Self-motivated with proven track record to optimally prioritize and deliver tasks on schedule.
The Video Computer Vision organization is working on breakthrough technologies for future Apple products. Our team delivers cutting-edge AI, machine learning, computer vision and graphics algorithms that power technologies including human understanding, perception, digital humans, AI agents, and health applications. In this role, you will collaborate with world-class experts in AI, ML, Software, and Hardware to tackle fundamental challenges in human-centric solutions that will impact millions of users across Apple's ecosystem.
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location.Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant
At Apple, we believe accessibility is a fundamental human right. You’ll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong.
Learn about accessibility in Apple’s workplace
Learn about reasonable accommodations for job applicants
Apple accepts applications to this posting on an ongoing basis.