Back to Search
Overview
Mid-Level

AIML - Machine Learning Researcher, DMLI- Image/Video Generation

Confirmed live in the last 24 hours

Apple

Apple

Cupertino
On-site
Posted March 31, 2026

Job Description

Summary

We are hiring a researcher with a strong technical background in Image/Video generation and editing, as well as Multimodal Foundation Models. You will play a critical role in the research and development of multimodal foundation models for image/video/3D generation, editing, animation, and many more. As a member of the team, you will have the opportunity to develop fundamental model capabilities, collaborate with team members with diverse backgrounds to work on ambitious projects, and collaborate broadly across Apple with world-class engineers and researchers to advance our products and delight millions of users.

Description

As a member of our fast-paced group, you’ll have the unique and rewarding opportunity to shape upcoming products from Apple. We are looking for people with excellent applied machine learning, computer vision/graphics experience, and solid engineering skills in creating outstanding model capabilities and product features. This role will have the following responsibilities: - Developing, fine-tuning, and evaluating foundational image generation and image editing models, as well as unified multimodal foundation models capable of both visual understanding and generation. - Developing, fine-tuning, and evaluating domain-specific image generation and editing models for various tasks and applications in Apple’s AI-powered products. - Conducting innovative research and transferring pioneering research in generative AI to production-ready technologies. - Understanding product requirements, translating them into modeling tasks and engineering tasks.

Minimum Qualifications

PhD, MS or equivalent experience Experience in machine learning, deep learning and statistical modeling. Experience in developing models for computer vision tasks, such as object detection, visual question answering. Experience in image generation models, such as VAE, GAN, and diffusion models Proficiency in one of the following deep learning frameworks: PyTorch, Jax, Tensorflow Proficiency in one of following languages: Python, Go, Java, C++

Preferred Qualifications

Experience in developing state-of-the-art image generation/editing models. Good interpersonal skills and team player.

machine learningai