The Siemens PLM Innovation and Research team is looking for a passionate Cloud Operations Engineer to support the next generation of PLM software products running in the cloud. As a key member of the Digital Industries Software Organization, you will have the unique opportunity to drive, shape, build, and operate the cloud infrastructure supporting SaaS product offerings from Siemens while getting hands on experience integrating with many Siemens PLM products.
You will be part of a strong team in a fast-paced, start-up like environment where agile development is embraced, and innovation is encouraged. At Siemens, everyone can positively impact millions of customers and you will be called on to identify and realize these opportunities.
Siemens is a high growth organization working on many products and software changing the world. Be part of this fantastic new opportunity and inspiring culture of relentless innovation towards Ingenuity for Life.
The person in this role, you will be working closely with other internal teams to deploy, secure, and maintain the infrastructure systems hosting Siemens cloud services and apps. You will analyze, design, implement and validate strategies for continuous integration, build, test and deployment to Amazon Web Services or other cloud provider infrastructure while ensuring high availability on production and non-production systems.
The person in this role designs and implement automated, dynamic environments to support the needs of development teams, and collaborate with functional and technical team members to develop deployment strategies for existing and new types of development and operations services, while leading the enablement of processes and teams. The person in this role will be on an Agile Scrum team along with other Operation Engineers, and will participate a daily scrum meeting, updating story tasks, and providing daily updates to the team.
Job tasks include:
- Hands-on design, analysis, development and troubleshooting of highly distributed large-scale cloud production systems and event-driven, cloud-based services
- Primarily Linux Administration, managing a fleet of Linux and Windows VMs as part of the application solutions
- Ownership of reliability, up time, system security, cost, operations, capacity and performance-analysis
- Monitor and report on service level objectives for a given applications services. Work with the business, Technology teams and product owners to establish key service level indicators.
- Ensuring the repeatability, traceability, and transparency of our infrastructure automation
- Support on-call rotations for operational duties that have not been addressed with automation
- Create and maintain monitoring technologies and processes to improve the visibility of our applications' performance and business metrics and keep operational workload in-check.
- Partnering with security engineers and developing plans and automation to aggressively and safely respond to new risks and vulnerabilities.
- Develop, communicate, collaborate, and monitor standard processes to promote the long-term health and sustainability of operational development tasks.
- Participate in technical training events, game day scenarios, and professional conferences
Required Knowledge/Skills, Education, and Experience
- BS/MS Computer Science; MIS or related field. MBA is a plus.
- 3 years of demonstrated expertise building and managing highly scaled production infrastructure in the cloud (AWS required; GCP, Azure, OpenStack a plus)
- Highly organized and detail-oriented, with excellent, demonstrated process management skills; project and goal-oriented
- Personable, approachable, and readily accepting of change; able to work cohesively with a variety of talented individuals within the organization
Preferred Knowledge/Skills, Education, and Experience
- 3 Years of experience in system administration, application development, infrastructure development or related areas
- 3 years of reading, understanding and writing code to support the cloud systems
- AWS Certified Solutions Architect
- 2+ years of experience in Infrastructure-a-Code using Terraform and CloudFormation (plus)
- Versatility with troubleshooting diverse sets of hosting technologies is strongly desired. These include web server platforms, application platforms, operating systems, network components, virtualization technologies, storage, and database platforms
- Expertise with cloud- continuous-deployment- based software development lifecycles (e.g. CI/CD)
- Cloud database operations and deployment experience (Document Dbs, Graph Dbs, SQL Dbs), Caching operations & deployment experience (memcache, Redis)
- Expertise with Lean/Agile deployment processes (Blue/Green, ZDT, Canary, load balancers/DNS strategies A/B test, feature flagging methodologies)
- Familiarity with site and infrastructure monitoring systems (like ELK, Grafana)
- Ability to design and manage escalation response plans from monitoring, react, respond, remediate and retrospect in culturally aligned (proactive, customer-focused, collaborative, data-driven) ways
- Expertise with SDLC branching, SCM, and code deployment systems (git/gitflow, gitlab, etc.
Organization: Digital Industries
Company: Siemens Industry Software (India) Private Limited
Experience Level: Mid-level Professional
Job Type: Full-time