Description & Requirements
Introduction: A Career at HARMAN Automotive
We’re a global, multi-disciplinary team that’s putting the innovative power of technology to work and transforming tomorrow. At HARMAN Automotive, we give you the keys to fast-track your career.
- Engineer audio systems and integrated technology platforms that augment the driving experience
- Combine ingenuity, in-depth research, and a spirit of collaboration with design and engineering excellence
- Advance in-vehicle infotainment, safety, efficiency, and enjoyment
About the Role
As an MPS – Site Reliability Engineer (SRE), you will play a critical hands-on role in ensuring the reliability, availability, and performance of a SaaS platform. You will work closely with DevOps teams and business units to resolve production disruptions, improve monitoring, and strengthen Site Reliability and DevOps practices across the organization.
What You Will Do
- Monitor application performance and infrastructure metrics in a 24/7 operational model.
- Perform first-level analysis and troubleshooting of production issues using traces, metrics, and dumps from APM tools.
- Collaborate with DevOps teams to maintain and improve infrastructure and platform stability.
- Develop basic automation and scripts to generate operational and performance reports.
- Prepare, analyze, and present performance and reliability metrics to stakeholders.
- Participate in and lead incident and problem management calls.
- Analyze historical data to identify trends, perform problem clustering, and support root cause analysis.
What Makes You Eligible
- Bachelor’s degree in Computer Science / Electronics & Communication / Electrical Engineering or a related engineering discipline.
- 3 to 6 years of experience supporting production systems in a SaaS or enterprise environment.
- Experience working in incident and problem management processes.
- Hands-on exposure to Linux-based operating systems (Ubuntu / Red Hat preferred).
What You Need to Be Successful
- Experience monitoring 24/7 SaaS environments with a solid understanding of Infrastructure & Platform (I&P) management.
- Basic working knowledge of Cloud platforms and Kubernetes.
- Working understanding of Docker and container orchestration (Kubernetes) for troubleshooting.
- Strong understanding of monitoring and observability tools; Datadog experience is a strong plus.
- Basic scripting skills for troubleshooting and automation.
- Good understanding of networking fundamentals: TCP/IP, HTTP, DNS, IP addressing, VPN, especially in cloud setups.
- Experience debugging production-level issues using logs, traces, metrics, and dumps.
- Ability to analyze data manually and create meaningful reports and insights.
- Strong communication skills to collaborate effectively with cross-functional teams.
Bonus Points if You Have
- Hands-on experience with Datadog or similar APM and observability tools.
- Exposure to cloud networking concepts in AWS, Azure, or GCP.
- Experience contributing to SRE or DevOps maturity initiatives.
- Experience using tools such as Jira, Mattermost, and Confluence.
- Prior experience in automation of monitoring or reporting workflows.
What We Offer
- Competitive salary and benefits package
- Opportunities for professional growth and development
- Collaborative and dynamic work environment
- Access to cutting-edge technologies and tools
- Recognition and rewards for outstanding performance through BeBrilliant
- Chance to work with a renowned German OEM
- You are expected to work all 5 days in a week in office