Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Apple Inc
Computer-Hardware
Zürich
- Art der Anstellung: Studierende
- Vor Ort
- Aktiv auf der Suche
Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Über diesen Job
Summary
Posted:
Weekly Hours: 40
Role Number:200626383-4170
Application Deadline: 30th November 2025
Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It’s the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you’ll do more than join something — you’ll add something.
In the Siri Attention and Invocation team we act as the front door to our users’ interactions with Siri on almost every shipping Apple device. We work hard to make sure that Siri responds only when intended, in an efficient and privacy-preserving manner.
Description
We are looking for an intern to explore speech synthesis, audio generation and reasoning techniques. The ideal candidate will be very familiar with audio generation, speech synthesis and large language models.
Responsibilities
- Develop audio generation and speech synthesis methods
- Explore methods for LLM reasoning
- Build automated evaluation pipelines to assess quality of the synthetic data
- Optimize developed models for efficient inference
Minimum Qualifications
- Bachelor’s degree in Computer Science or equivalent
- Demonstrable experience in training deep learning system on multiple GPUs in Pytorch
- Demonstrable experience in audio, text to speech, speech to text technologies
Preferred Qualifications
- Studying for Masters Degree or PhD in Computer Science, Machine Learning or equivalent
- Demonstrable experience with diffusion and/or autoregressive audio generation models
- Publications in audio generation at well known conferences
At Apple, we’re not all the same. And that’s our greatest strength. We draw on the differences in who we are, what we’ve experienced, and how we think. Because to create products that serve everyone, we believe in including everyone. Therefore, we are committed to treating all applicants fairly and equally. We will work with applicants to make any reasonable accommodations.
Unternehmens-Details
Apple Inc
Computer-Hardware
