Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Apple Inc
Computer-Hardware
Zürich
- Art der Anstellung: Studierende
- Vor Ort
Internship - Machine Learning Engineer - Speech Generation and Audio Understanding
Über diesen Job
Summary
Posted:
Weekly Hours: 40
Role Number:200626383-4170
Application Deadline: 30th November 2025
Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It’s the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you’ll do more than join something — you’ll add something.
In the Siri Attention and Invocation team we act as the front door to our users’ interactions with Siri on almost every shipping Apple device. We work hard to make sure that Siri responds only when intended, in an efficient and privacy-preserving manner.
Description
We are looking for an intern to explore speech synthesis, audio generation and reasoning techniques. The ideal candidate will be very familiar with audio generation, speech synthesis and large language models.
Responsibilities
- Develop audio generation and speech synthesis methods
- Explore methods for LLM reasoning
- Build automated evaluation pipelines to assess quality of the synthetic data
- Optimize developed models for efficient inference
Minimum Qualifications
- Bachelor’s degree in Computer Science or equivalent
- Demonstrable experience in training deep learning system on multiple GPUs in Pytorch
- Demonstrable experience in audio, text to speech, speech to text technologies
Preferred Qualifications
- Studying for Masters Degree or PhD in Computer Science, Machine Learning or equivalent
- Demonstrable experience with diffusion and/or autoregressive audio generation models
- Publications in audio generation at well known conferences
Unternehmens-Details
Apple Inc
Computer-Hardware
