Back

Sr. Site Reliability Engineer (SRE) – Kubernetes SME

Job Description

Position Overview 

As part of the Siemens DISW cloud operations organization, this position makes significant contributions towards the delivery of DevOps solutions that support best-in-class cloud based microservice applications.  Our team is looking for an engineer who is excited about automatic automation of Kubernetes solutions.  SREs discover ways to help promote the availability of services and applications, improve processes through remediation of manual and/or repetitive tasks, and solve complex technical problems in a fast-paced, collaborative, inclusive, and iterative environment. 

The candidate will support the Siemens Xcelerator platform and will be responsible for identifying, managing, improving, and reporting on availability, resiliency, reliability, and stability efficiencies. This includes providing technical guidance and leadership to drive solutions, create & enhance processes that deliver excellence. A strong relationship with the various product teams of the Xcelerator platform is necessary to support core objectives. This roles success will be defined by product teams within DISW business units meeting their SLAs.

Responsibilities 

• Provide & lead the design, deployment, automation, and scripting solutions to drive standardized capabilities, visibility, and efficiency around a Kubernetes environment.

• Collaborate with other technical platforms and partners to engineer automated and integrated solutions between tools, services, teams that increase availability, reliability, and performance of a Kubernetes environment.

• Own and ensure the internal and external SLA’s meet and exceed expectations

• Be part of maintaining a 24x7, global, highly available SaaS environment 

• Participate in an on-call rotation that supports our production infrastructure

• Troubleshoot production availability incidents that often span across multiple teams and services.

• Lead production incident post-mortems, and contribute to solutions to prevent problem recurrence; with the goal of automated response to all non-exceptional service conditions

• Communicate to business and technical partners on incidents as they occur when they impact system performance or availability at a critical level

Required Knowledge/Skills, Education, and Experience

• Bachelor’s Degree with at least 2+ years of IT experience or equivalent experience.

• 4+ years experience with containerization, specifically Kubernetes 

• 3+ years experience with monitoring tools (Datadog, or equivalent tools)

• 3+ years experience with automation via scripting & API development

• 2+ years experience with Amazon Web Services (AWS) services

• 2+ years experience Terraform, CloudFormation, Ansible, or equivalent tools

Qualified Applicants must be legally authorized for employment in the United States. Qualified Applicants will not require employer sponsored work authorization now or in the future for employment in the United States.

Preferred Knowledge/Skills, Education, and Experience

• **Siemens Teamcenter software**

• Desired certifications include: Datadog, Kubernetes, Security, AWS or Azure certification

• 2+ years experience as a Site Reliability Engineer or equivalent role

• 2+ years experience with issue/incident tracking tool (ServiceNOW, ServiceDesk, Jira or equivalent tools) 

• 2+ years experience with open source tools (Linux, Python, Git, Ansible)

• 2+ years experience Enterprise IT environment with distributed environments

• Networking concepts, including firewalls, VPN, routing, load balancers, security and DNS

• Senior level system administration experience, including troubleshooting, support, mentorship/training, and oversight

At Siemens we are always challenging ourselves to build a better future.  We need the most innovative and diverse Digital Minds to develop tomorrow’s reality.  Find out more about the Digital world of Siemens here:  www.siemens.com/careers/digitalminds

#LI-PLM 

#DISW

#LI-HYBRID 

           #LI-AA1


Organization: Digital Industries

Company: Siemens Industry Software Inc.

Experience Level: Experienced Professional

Full / Part time: Full-time



Equal Employment Opportunity Statement
Siemens is an Equal Opportunity and Affirmative Action Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to their race, color, creed, religion, national origin, citizenship status, ancestry, sex, age, physical or mental disability unrelated to ability, marital status, family responsibilities, pregnancy, genetic information, sexual orientation, gender expression, gender identity, transgender, sex stereotyping, order of protection status, protected veteran or military status, or an unfavorable discharge from military service, and other categories protected by federal, state or local law.

EEO is the Law
Applicants and employees are protected under Federal law from discrimination. To learn more, Click here.

Pay Transparency Non-Discrimination Provision
Siemens follows Executive Order 11246, including the Pay Transparency Nondiscrimination Provision. To learn more, Click here.

California Privacy Notice
California residents have the right to receive additional notices about their personal information. To learn more, click here.

Can't find what you are looking for?