Zum Inhalt springen Zum Footer springen

Data Scientist - NLP Focussed

Job ID
492123
Veröffentlicht seit
22-Jan-2026
Organization
Foundational Technologies
Tätigkeitsbereich
Research & Development
Unternehmen
Siemens Technology and Services Private Limited
Erfahrungsniveau
Experienced Professional
Beschäftigungsart
Vollzeit
Arbeitsmodell
Arbeiten vor Ort
Vertragsart
Unbefristet
Standort(e)
  • Pune - Maharashtra - Indien

Hello Visionary!

We empower our people to stay resilient and relevant in a constantly changing world. We’re looking for people who are always searching for creative ways to grow and learn. People who want to make a real impact, now and in the future.

Does that sound like you? Then it seems like you’d make a great addition to our vibrant team.

Are you a Data Scientist skilled in NLP, semantic understanding, and turning complex data into actionable insights? Passionate about solving tough problems with cutting‑edge ML and AI, especially in building data and ontologies? If yes, we want to connect with you!

At Siemens Infrastructure – Buildings Software, we’re advancing sustainable and autonomous building technologies. We’re looking for a passionate Data Scientist to help semantically enrich building data points in close collaboration with our ontology team.

In this exciting role, you’ll work across a range of projects—from building advanced NLP models for unstructured building data to developing end‑to‑end machine learning solutions on cloud platforms. If you're driven to create real impact by bringing structure and meaning to complex datasets, and you’re eager to grow your expertise in a collaborative environment, this opportunity is perfect for you.

What You'll Do (Your Responsibilities):


As a Data Scientist, you will be instrumental in:


  • Strategic Collaboration:Work closely with stakeholders to gain a deep understanding of business objectives and translate them into effective, scalable data science strategies.
  • Model Development: Design, develop, and implement advanced machine learning and statistical models to address complex business challenges and deliver measurable value.
  • NLP for Semantic Enrichment: Leverage Natural Language Processing (NLP) techniques to extract, classify, and semantically enrich unstructured building data, collaborating closely with ontology experts to ensure accuracy and consistency.
  • Data Preparation & Exploration: Conducting in-depth exploratory data analysis, meticulous data cleansing, and insightful feature engineering to prepare datasets for analysis, including text-based data.
  • Insight Generation: Exploring and using various data mining and machine learning techniques to extract valuable insights and discover hidden patterns from large datasets.
  • Predictive & Prescriptive Solutions: Developing powerful predictive and prescriptive models, algorithms, and prototypes to directly support business decision-making.
  • IoT/IIoT Expertise: Applying your solid understanding of handling IoT and IIoT data, including traditional use cases like anomaly detection for multi-sensor systems.
  • Forecasting Mastery: Showcase your hands‑on experience in forecasting using both traditional machine learning techniques and advanced deep neural network models.
  • Statistical Rigor: Performing thorough statistical analysis, hypothesis testing, and A/B testing to rigorously evaluate the effectiveness of models and algorithms.
  • Cloud-Based ML Systems: Gaining hands‑on experience in building end‑to‑end machine learning systems on leading cloud platforms such as AWS or Azure.
  • Effective Communication: Clearly communicating findings and insights to both technical and non-technical team members through compelling reports, presentations, and data visualizations.
  • Deployment & MLOps: Understanding and applying CI/CD processes in product deployment, ensuring seamless delivery of solutions.
  • Technical Proficiency: Demonstrating an understanding of Dockerization and REST APIs for scalable and efficient deployments.
  • Continuous Learning: Staying abreast of the latest trends and advancements in data science, machine learning, AI, and NLP technologies, identifying opportunities to apply them to improve business outcomes.
  • Software Development: Engaging in hands-on software development, primarily with Python.

What You'll Bring (Your Qualifications):


We're looking for someone who can hit the ground running and grow with us!


  • Experience: 5+ years of experience in data science and/or data analysis with a proven track record of successfully developing and deploying ML models and algorithms.
  • NLP Expertise: Demonstrated experience with Natural Language Processing (NLP) techniques, including text classification, entity recognition, topic modeling, and semantic search.
  • Education: A Master's or Ph.D. degree in Data Science, Computer Science, Statistics, Applied Mathematics, Computational Linguistics, or a closely related quantitative field. (B.E. / M.Sc. / MCA / B. Tech in Computer Science / Applied Mathematics or related fields with a good academic record will also be considered.)
  • Programming Prowess: Strong hands-on programming skills in Python are crucial, with experience in relevant NLP libraries (e.g., NLTK, spaCy, Hugging Face, Gensim).
  • Proven Experience: Exposure to the industrial engineering domain, building management systems, or experience working with structured/unstructured data in a sophisticated domain is highly preferred. Experience with ontologies or knowledge graphs is a significant plus.
  • Tool Proficiency: Proficient in the use of various data science tools and libraries.
  • Analytical Foresight: Excellent analytical and statistical skills to derive meaningful insights.
  • Collaboration & Proactivity: A collaborative, team-oriented demeanour with a proactive attitude and a strong willingness to learn new technologies.
  • Communication Skills: Excellent verbal, written, and presentation skills to articulate complex ideas clearly.

Why You'll Love Working Here:


  • Impactful Work: Supply to projects that make a real difference in how buildings are run and understood, driving efficiency and sustainability.
  • Growth Opportunities: Continuous learning and development in a fast-paced, innovative environment, especially in the intersection of AI, NLP, and IoT.
  • Collaborative Culture: Work alongside a team with varied strengths of talented engineers, including data scientists and ontology authorities, who are passionate about data science.
  • Cutting-Edge Technology: Access to the latest tools and technologies in AI, machine learning, and NLP.
 

Join us and be yourself!


This role is based in Pune, where you’ll get the chance to work with teams impacting entire cities, countries - and the shape of things to come.

We value your unique identity and perspective and are fully committed to providing equitable opportunities and building a workplace that reflects the diversity of society. Come bring your authentic self and create a better tomorrow with us.

We’re Siemens. A collection of over 312,000 minds building the future, one day at a time in over 200 countries. We're dedicated to equality, and we encourage applications that reflect the diversity of the communities we work in. All employment decisions at Siemens are based on qualifications, merit and business need. Bring your curiosity and imagination and help us shape tomorrow.

Find out more about Siemens careers at: www.siemens.com/careers