
Senior Site Reliability Engineer
- France
- CDI
- Temps-plein
- Building for scale: Design and develop effective tooling, alerts, and responses with your team to identify and address scalability and reliability risks, utilizing your expertise in backend systems and cloud technology.
- Exciting Challenges: Take on the responsibility of maintaining the reliability and availability of our cloud platforms, tackling complex problems and driving improvements to enhance performance.
- Talend people to work with: Work closely with highly skilled professionals seeking state-of-the-art and future-proof solutions, collaborating with Architecture and Product teams to design and develop new infrastructure features and optimize cloud-related practices.
- Professional Growth: Act as a resource for fellow engineers, sharing your knowledge and expertise in cloud engineering, production service operations, incident management, and troubleshooting.
- Continuous Learning: Stay updated on the latest industry trends and technologies, contributing to adopting best practices and driving continuous improvement within our cloud environment.
- Reliability and Scalability: Ensure high reliability and availability of our cloud platforms, collaborating with cross-functional teams to implement new infrastructure features and optimize performance.
- Cloud Optimization: Define and evangelize cloud-related optimizations and best practices, driving improvements in reliability, scalability, and performance.
- Problem Solving: Analyze complex issues at the infrastructure, systems, network, and application levels, making recommendations and decisions to resolve them effectively.
- Knowledge Sharing: Share your expertise with fellow engineers, guiding cloud technologies, automation, security, and best practices.
- On-Call Support: Participate in on-call duties to maintain the availability and performance of our cloud infrastructure, providing regular updates on project status and activities.
- Bachelor's or Master’s degree in Computer Science or a relevant field.
- 2+ years of experience in management with technical Leadership experience.
- Excellent English & French communication skills, both oral and written.
- Proficiency with Cloud Networking concepts, Kubernetes and container orchestration.
- Proficiency with observability tooling such as Prometheus, Open Telemetry, distributed tracing, and SIEM such as Splunk.
- Relevant and hands-on experience with Kubernetes, Git, Docker containers, Helm, GitOps method, Infrastructure as Code (IaC) tools such as Terraform and Ansible, secret-management tools (e.g. Vault, AWS SSM).
- Comfortable with or willing to learn Github CI
- Polyvalency is a plus, and some experience working, deploying and troubleshooting with at least 2 of those technologies: Hashicorp Vault, SignalSciences Web Application, Gloo Gateway, or any similar solutions will help the team
- Ability to take a rotating on-call shift.
- Strong analytical skills for solving complex problems and driving innovative solutions.
- Team player who can support his manager finding complex solutions
- Demonstrated ability to collaborate with development teams and provide expert guidance on implementing reliability best practices, ensuring systems are robust, scalable, and highly available.
- Curiosity and the desire to learn.
- Genuine career progression pathways and mentoring programs
- Culture of innovation, technology, collaboration, and openness
- Flexible, diverse, and international work environment