- Pune - - India
Data Scientist - NLP Focussed
Hello Visionary!
We empower our people to stay resilient and relevant in a constantly changing world. We’re looking for people who are always searching for creative ways to grow and learn. People who want to make a real impact, now and in the future.
Does that sound like you? Then it seems like you’d make a great addition to our vibrant team.
Are you a passionate Data Scientist with a knack for turning complex data into actionable insights, especially when it comes to natural language processing and semantic understanding? Do you thrive on solving challenging business problems using cutting-edge machine learning and AI technologies, particularly in the realm of building data and ontologies? If so, we want to hear from you!
At Siemens Infrastructure – Buildings Software, we're at the forefront of creating sustainable pioneering autonomous building technologies. We're looking for a talented and enthusiastic Data Scientist to join our dynamic team and play a pivotal role in the semantic enrichment of building data points, working closely with our ontology experts.
In this exciting role, you'll have the opportunity to work on diverse projects, from developing advanced NLP models for unstructured building data to building end-to-end machine learning solutions on cloud platforms. If you're eager to make a tangible impact by bringing structure and meaning to complex data and grow your expertise in a collaborative environment, this is the place for you!
What You'll Do (Your Responsibilities):
As a Data Scientist, you will be instrumental in:
- Strategic Collaboration: Partnering with stakeholders to deeply understand business objectives and translate them into robust data science strategies.
- Model Development: Designing, developing, and implementing advanced machine learning and statistical models to tackle complex business challenges.
- NLP for Semantic Enrichment: Applying Natural Language Processing (NLP) techniques to extract, classify, and semantically enrich unstructured building data points, working closely with ontology experts.
- Data Preparation & Exploration: Conducting in-depth exploratory data analysis, meticulous data cleansing, and insightful feature engineering to prepare datasets for analysis, including text-based data.
- Insight Generation: Exploring and utilizing various data mining and machine learning techniques to extract valuable insights and discover hidden patterns from large datasets.
- Predictive & Prescriptive Solutions: Developing powerful predictive and prescriptive models, algorithms, and prototypes to directly support business decision-making.
- IoT/IIoT Expertise: Applying your working knowledge of handling IoT and IIoT data, including traditional use cases like anomaly detection for multi-sensor systems.
- Forecasting Mastery: Leveraging your hands-on experience in forecasting using both traditional machine learning methods and advanced Deep Neural Networks.
- Statistical Rigor: Performing thorough statistical analysis, hypothesis testing, and A/B testing to rigorously evaluate the effectiveness of models and algorithms.
- Cloud-Based ML Systems: Gaining hands-on experience in developing end-to-end machine learning systems on leading cloud-based platforms like AWS or Azure.
- Effective Communication: Clearly communicating findings and insights to both technical and non-technical stakeholders through compelling reports, presentations, and data visualizations.
- Deployment & MLOps: Understanding and applying CI/CD processes in product deployment, ensuring seamless delivery of solutions.
- Technical Proficiency: Demonstrating an understanding of Dockerization and REST APIs for scalable and efficient deployments.
- Continuous Learning: Staying abreast of the latest trends and advancements in data science, machine learning, AI, and NLP technologies, identifying opportunities to apply them to improve business outcomes.
- Software Development: Engaging in hands-on software development, primarily with Python.
What You'll Bring (Your Qualifications):
We're looking for someone who can hit the ground running and grow with us!
- Experience: 5+ years of experience in data science and/or data analysis with a proven track record of successfully developing and deploying ML models and algorithms.
- NLP Expertise: Demonstrated experience with Natural Language Processing (NLP) techniques, including text classification, entity recognition, topic modeling, and semantic search.
- Education: A Master's or Ph.D. degree in Data Science, Computer Science, Statistics, Applied Mathematics, Computational Linguistics, or a closely related quantitative field. (B.E. / M.Sc. / MCA / B. Tech in Computer Science / Applied Mathematics or related fields with a good academic record will also be considered.)
- Programming Prowess: Strong hands-on programming skills in Python are essential, with experience in relevant NLP libraries (e.g., NLTK, spaCy, Hugging Face, Gensim).
- Domain Knowledge: Exposure to the industrial engineering domain, building management systems, or experience working with structured/unstructured data in a complex domain is highly preferred. Experience with ontologies or knowledge graphs is a significant plus.
- Tool Proficiency: Proficient in the use of various data science tools and libraries.
- Analytical Acumen: Excellent analytical and statistical skills to derive meaningful insights.
- Collaboration & Proactivity: A collaborative, team-oriented attitude with a proactive mindset and a strong willingness to learn new technologies.
- Communication Skills: Excellent verbal, written, and presentation skills to articulate complex ideas clearly.
Why You'll Love Working Here:
- Impactful Work: Contribute to projects that make a real difference in how buildings are managed and understood, driving efficiency and sustainability.
- Growth Opportunities: Continuous learning and development in a fast-paced, innovative environment, especially in the intersection of AI, NLP, and IoT.
- Collaborative Culture: Work alongside a diverse team of talented professionals, including data scientists and ontology experts, who are passionate about data science.
- Cutting-Edge Technology: Access to the latest tools and technologies in AI, machine learning, and NLP.
Join us and be yourself!
At Siemens, we believe our strength comes from diversity—of thought, background, and experience. We are committed to fostering an inclusive workplace where everyone has equitable opportunities to learn, grow, and succeed. Bring your authentic self and help us create a better tomorrow.
Make your mark in our exciting world at Siemens.
This role is primarily based in Pune. You may have opportunities to visit other locations within India based on business needs. In return, you’ll get to work with teams that are actively shaping the future of smart infrastructure worldwide.
We’re Siemens—a global network of over 300,000 minds building a better future, one day at a time, across more than 200 countries.
Find out more about Siemens careers at:
https://new.siemens.com/global/en/company/jobs.html