Offers “Merckgroup”

Expires soon Merckgroup

Data Analytics Engineer (all genders)

  • Internship
  • Darmstadt, GERMANY
  • IT development

Job description



Data Analytics Engineer (all genders)

Your role: 

The Merck Life Science Analytics Center of Excellence (ACE) is responsible for designing, developing, testing, and supporting automated data analyses, models and algorithms on Life Science’s data management and analytics platform (Palantir Foundry, Hadoop and other components).

In this role, you will be part of a growing, global team of data analytics engineers, who collaborate in DevOps mode, in order to enable Merck's Life Science business with state-of-the-art technology to leverage data as an asset and to take better informed decisions.  The Foundry platform comprises multiple different technology stacks, which are hosted on Amazon Web Services (AWS) infrastructure or on-premise Merck’s own data centers. This position will be project based and may work across multiple smaller projects or a single large project utilizing an agile project methodology. In addition you will:

 

·  Extract, manipulate, transform data from multiple sources to support such analyses. Build and ontologize new datasets in the process.
·  Review code developed by other analytics engineers and check against platform-specific standards, cross-cutting concerns, coding and configuration standards and functional specification of the pipeline
·  Work out the best possible balance between technical feasibility and business requirements (the latter can be quite strict)
·  Besides working on projects, act as third level support for critical applications; analyze and resolve complex incidents/problems.

 

Who you are:

·  M.Sc. degree in Computer Science, Engineering, Mathematics, Physical Sciences or related fields 
·  Several years of experience in data analysis with experience in advanced analytics, machine learning, natural language processing, information extraction and information retrieval
·  Develop high quality data analyses, analytical models and algorithms supporting our data analytics use cases and solving critical business problems
·  Experience with and knowledge in performing regression, classification, natural language processing, named-entity recognition
·  Python/ Pyspark code proficiency: familiarity with coding libraries such as: Scikit-learn, Numpy, BeautifulSoup (tensorflow or Keras)
·  Experience developing and applying NLP and machine learning methods in java, python, or scala
·  Experience in Python libraries for text data analyses and machine learning such as NLTK, Spacy, ScikitLearn, Tensorflow, Word2Vec, Bert
·  Experience developing or applying text analytics solutions in Hadoop data lake environment is an added asset
·  Familiar with optical character recognition (OCR), and other methods to convert scanned documents to semi-structured formats for data analysis prep
·  Experience in manipulating database data using SQL.  Familiarity with views, functions, stored procedures and exception handling.
·  Performing regression, classification, natural language processing, named-entity recognition (among others)
·  Participate in end to end project lifecycle, from requirements analysis to go-live and operations of an application
·  Acts as business analyst for developing requirements for Foundry pipelines
·  Document technical work in a professional and transparent way. Create high quality technical documentation
·  Deploy applications on Foundry platform infrastructure with clearly defined checks
·  Implementation of changes and bug fixes via Merck's change management framework
·  DevOps project setup following Agile principles (e.g. Scrum)

 

 

Job Requisition ID: 210548

Location: Darmstadt

Career Level: B - Recent University Graduate(

Job Segment: Analytics, Database, Computer Science, SQL, Developer, Management, Technology

Make every future a success.
  • Job directory
  • Business directory