Data Scientist - Python (Mid-senior, Senior)
Pathway
- Paris
- 50 000-90 000 €/an
- CDI
- Temps-plein
- Our primary developer offering is an ultra-performant Data Processing Framework (unified streaming + batch) with a Python API, distributed Rust engine, and capabilities for data source integration & transformation at scale (Kafka, S3, databases/CDC,...).
- The single-machine version is provided on a free-to-use license (`pip install pathway`).
- Major data use cases are around event-stream data (including real-world data such as IoT), and graph data that changes over time.
- Our enterprise offering is currently used by leaders of the logistics industry, such as DB Schenker or La Poste, and tested across multiple industries. Pathway has been featured in Gartner's market guide for Event Stream Processing.
- Learn more at
- be working with spatiotemporal data with advanced schemas (time-changing graph models)/
- be designing data cross-sections, proposing analytics metrics and KPI’s in line with clients’ objectives, selecting clustering algorithms, and preparing visualizations, to enable fast data exploration and insight discovery – all within our product.
- be designing dashboards in SQL with some Python elements/extensions.
- be directly helping us with Customer Conversion and Adoption within Customer organizations, by contributing to both deployment instances and “demonstrators” of our product, performed on client data sets.
- work directly with our Product Owner and CTO to propose and implement extensions to our product, based on repetitive client needs.
- depending on your seniority, implement machine learning algorithms on spatiotemporal event streams and other geospatial data.
- Ready for hands-on contribution to the product, helping to ensure the success of demonstrators for clients, and contribution to product codebase.
- Intuitive, with good visual taste, and good common sense judgment.
- Committed to beautiful user-centered design: you know that stories are made for people, and you are willing to listen to what they have to say.
- Curious at heart and thrilled to work with real-world data, especially spatio-temporal data.
- Like trains, trucks, cranes, pythons, pandas, and other things that move.
- Not afraid to switch between the roles of data scientist, data-vis magician, statistician, engineer, and detective, at a moment’s notice.
- Have 2 years+ experience in positions related to Data Science.
- Have a very good working knowledge of Python.
- Know SQL. Are able to work with tables and other data types (arrays, json,…).
- Would be able to implement the Transit Node Routing algorithm in Python just based on reading its Wikipedia article.
- Have experience with git, build systems, and CI/CD.
- Have at least basic undergrad textbook familiarity with graph algorithms, finite automata, and text (string) search algorithms.
- Understand statistical concepts, such as correlated random variables, significance, and non-Gaussian noise.
- Prepared to be quizzed & grilled by the datasets you encounter, everyday. Here are some questions you should be able to answer off the top of your head: what can “-273.15” signify; why “65535” is a suspicious integer value; how many months does it take a containership to go around the world; and, roughly what order of g-force is attained by an astronaut in a space rocket at liftoff?
- Respectful of others
- Fluent in English
- Showing a portfolio: code on github, visualization works, a research paper or a PhD thesis with an original statistical / probabilistic analysis or experiment design,…
- Successful track-record in Data Science or algorithms contests (Kaggle, Codeforces,…)
- Experience in topics linked to logistics/moving assets.
- Familiarity with some form of GIS software.
- Familiarity with Pandas, SciPy, NetworkX, and similar tools from the Python stack.
- Experience in Data Visualization and UX.
- Some knowledge of French, Polish, or German.
- Join an intellectually stimulating work environment.
- Be a pioneer: you get to work with a new type of data processing.
- Work in one of the hottest data/AI startups in France.
- Uncover exciting career prospects.
- Make significant contribution to our success.
- Join & co-create an inclusive workplace culture.
- Type of contract: Permanent employment contract
- Preferable joining date: February 2023. The positions (at least 2) are open until filled.
- Compensation: annual salary of €50K-€70K (mid) up to €60K-€90K (senior, upper band negotiable) + Employee stock option plan.
- Location: Remote work from home. Possibility to work or meet with other team members in one of our offices:
- Paris Area – Drahi X-Novation Center, Ecole Polytechnique, Palaiseau.
- Paris – Agoranov (where Doctolib, Alan, and Criteo were born) near Saint-Placide Metro (75006).
- Wroclaw – University area.