Site Reliability Engineer, Software Development
Site Reliability Engineer, Software Development
Site Reliability Engineer, Software Development
Site Reliability Engineer, Software Development
Apple Inc
Computer-Hardware
Neubiberg
- Art der Anstellung: Vollzeit
- 59.000 € – 86.500 € (von XING geschätzt)
- Vor Ort
Site Reliability Engineer, Software Development
Über diesen Job
Summary
Posted:
Weekly Hours: 40
Role Number:200618042-6887
Shape the future of how Apple delivers software to millions of customers. Our Software Delivery team is seeking a Site Reliability Engineer to build the next generation of release technologies that power Apple's development lifecycle. If you are passionate about solving complex problems at scale, we want to hear from you.
Description
We are a team dedicated to engineering excellence, reusable design, and simplicity. We foster a supportive, growth-focused culture where we mentor each other and work together to build resilient, high-quality systems.
Responsibilities
- Ensure System Reliability: Design, build, and maintain robust, scalable, and observable systems for our core software delivery services.
- Automate: Reduce operational toil by developing automation and tooling to prevent and rapidly resolve production issues.
- Improve Incident Response: Own and refine our incident management processes to ensure high availability.
- Collaborate with Engineers: Partner with development teams to create elegant, high-quality solutions that support the entire workflow, from source code to customer release.
- Improve and Modernize Systems: Use a proactive approach to identify and eliminate technical debt to enhance long-term reliability and maintainability.
Minimum Qualifications
- Experience as a Site Reliability Engineer, DevOps Engineer, or Software Engineer focused on infrastructure in a large-scale distributed environment.
- Strong software development skills in a language like Swift, Go, or Python, and a high degree of comfort with shell scripting (Bash).
- Hands-on experience building and managing systems with container orchestration tools (Kubernetes, Docker).
- Deep understanding of networking (TCP/IP, DNS, HTTP) and experience using observability tools (monitoring, logging, tracing) to diagnose complex issues.
- Excellent problem-solving and communication skills, with a strong sense of ownership and drive.
Preferred Qualifications
- Proven experience leading initiatives to reduce technical debt, refactor systems, or improve performance and latency.
- Expertise in performance analysis and capacity planning for global, distributed systems.
- Experience with large-scale distributed databases (e.g., Cassandra, FoundationDB) or messaging systems (e.g., Kafka).
- Demonstrated ability to lead incident response for high-impact outages.
- Familiarity with using Generative AI (GenAI) or Large Language Models (LLMs) to accelerate operational tasks, such as automating runbooks, generating scripts, or analyzing incident data.
Gehalts-Prognose
Unternehmens-Details
Apple Inc
Computer-Hardware