Senior Data Engineer/Operator

Owkin

  • Nantes, Loire-Atlantique Paris
  • CDI
  • Temps-plein
  • Il y a 2 mois
Think about your LinkedIn code!About the role:You will be part of the Data Operations team and report to the Director of Data Operations.The Data Operations team's role is to ensure Owkin accesses well structured, documented & curated datasets that are fit for AI-modeling.You will be involved in the efforts of the company related to extraction and curation and alignment of multimodal data. You will work with the team whose role spans from finding the raw data in our data partners information systems to providing a fit-for-AI dataset which then can be used by data scientists. This involves many steps such as extraction, cleaning, curation, imputation,, jointure of different datasets (e.g. histology slides with patient tabular information), harmonization.Owkin has a particular focus on histology and genomics data (always complemented with patient Electronic Medical Records) but also many other modalities as our projects span various therapeutic areas and medical expertise.In particular, you will:
  • Ensure timely delivery of AI-ready datasets
  • Act as a scrum master for the data operations team
  • Act as a tech lead for the data engineering team
  • Co-found and maintain the data engineering roadmap (this includes writing user stories and specifications)
  • Ensure continuous integration of technical requirements from our data platform (interact with engineering and product teams)
  • Act as a data operator and actually download, document, describe and curate datasets for several projects
The responsibilities missions described are not an exhaustive list; additional tasks may be assigned or the scope of the job may change as necessitated by business demands.Position is based in our Paris or Nantes offices or remotely in France.About youRequired qualifications / experience:
  • Master degree in mathematics, statistics, computer science or related field
  • Previous data scientist experience on biomedical datasets including histology slides, omics, and DICOM images
  • Technical skills: Python, pandas, SQL, development, packaging and use of CLI tools (optional: javascript, cloud storage, micro-services)
  • Knowledge of healthcare Information System, main techniques and softwares
  • Knowledge of FAIR principles, data governance, Common Data Models (CDMs)
  • Experience working in AGILE environments and knowledge of AGILE roles and principles
  • Ability to handle a portfolio of projects and activities: operational/delivery v.s. development and automation
  • Organized, with a sense of urgency and priorities, business-driven
  • Interpersonal skills are required in order to collaborate effectively and autonomously internally and externally
  • Experience working with private and sensitive personal information
  • Good team player
  • Fluent in English
Please submit your CV in English#LI-MD1

Owkin