Aller au contenu principal
Algolia recrutement

Manager Site Reliability Engineering H/F Algolia

Paris 8e - 75
CDI
Résumé de l'offre
  • Bac +2
  • Bac +3, Bac +4
  • Bac +5
  • Secteur informatique • ESN

Les missions du poste

As a Site Reliability Engineering Manager in the Production Engineering team of Algolia, you will lead the Fleet team of Site Reliability Engineers responsible for the provisioning and the global reliability of the Search Products at scale.

Your team will focus on creating pragmatic solutions to optimize the Search Products availability and costs at scale, depending on the needs of the customer, the Product teams, and the different engineering teams that deliver a unique Search Experience to our customers.

You will manage a team of experienced Individual Contributors who are responsible for :
- Operating and scaling the entire Search fleet, ensuring global performance and reliability.
- Reducing and maintaining the level of incidents through actionable KPIs and well-defined SLOs, while coaching and delegating Tier 3 support responsibilities.
- Running and continuously improving our in-house Edge Load Balancer.
- Building, operating, and enhancing a robust backup and restore system to ensure compliance with our SLAs.
- FinOps responsibilities, including monitoring infrastructure costs at scale and identifying optimization opportunities.

YOUR ROLE WILL CONSIST OF :
- Collaborating with senior leadership to define the overall technical direction and strategy for the organization, and ensure that the SRE team's goals and initiatives are aligned with this strategy.
- As well as building and maintaining strong relationships with stakeholders across the organization, as you represent the SRE organization in cross-functional meetings.
- You will also stay close to product and design teams to ensure that the user experience is always top of mind.
- You are expected to provide leadership, guidance and mentorship to your team members, helping them to develop their technical skills and knowledge of best practices in site reliability engineering. You will continuously evaluate and improve the performance of the SRE team, and you will identify and implement initiatives to drive operational excellence and improve overall service reliability.
- Establishing and enforcing engineering processes and best practices that ensure high-quality, reliable, and scalable systems, as well as working with other teams to promote the adoption of these processes and practices across the organization.
- You will BE responsible for defining and maintaining service level agreements (SLAs) and key performance indicators (KPIs) for your team's services, and you will work with other teams to ensure that these SLAs and KPIs are being met. As well as leading cross-functional efforts to resolve complex technical issues and mitigate operational risks across multiple teams and domains.
- Along with your team you will help design and implement monitoring, alerting, and metrics systems to ensure the availability, performance, and reliability of your team's services, and you continuously refine and improve these systems.
- Collaborating with other technical teams to identify opportunities to automate processes, as well as designing and implementing automated tools and systems to support these processes.
- As manager, you will also manage the budget for your team, ensuring that resources are being used efficiently.
- Finally, you will BE responsible for documenting your team's projects and processes, and ensuring that this documentation is up-to-date and accessible to all stakeholders.

YOU MIGHT BE A FIT IF YOU HAVE :
- 4+ years of engineering management experience
- You are fluent in Agile methodology and can lead a project from the idea to Production
- You are an excellent communicator, collaborating with Product managers, Technical Program Managers, and Individual Contributors to your team
- You are comfortable managing a large team regrouping all seniority levels, and accompanying Individual Contributors in their growth and development
- You know how to deploy an application from laptop to production, are able to fully automate IT, and you are comfortable with Production requirements (Observability, Alerting...)
- You are knowledgeable in DEVOPS principles and CI/CD pipelines
- You are knowledgeable in Configuration Management and Infrastructure as Code such as Chef and Terraform
- You are knowledgeable in at least one programming language (Python, Golang, Ruby.) and are familiar with software craftsmanship
- Full professional English proficiency
- Ability to make decisions and take ownership for them

WE'RE LOOKING FOR SOMEONE WHO CAN LIVE OUR VALUES :

GRIT - Problem-solving and perseverance capability in an ever-changing and growing environment.

TRUST - Willingness to trust our co-workers and to take ownership.

CANDOR - Ability to receive and give constructive feedback.

CARE - Genuine care about other team members, our clients and the decisions we make in the company.

HUMILITY - Aptitude for learning from others, putting ego aside.

Bienvenue chez Algolia

Algolia is set to enable every company to create world-class Search and Discovery experiences with an API-first approach. Performance and Scalability is at the heart of our mission : we power 1.5 trillion searches a year, for 10K+ customers all over the world.

Manager Site Reliability Engineering H/F
  • Paris 8e - 75
  • CDI
Publiée le 18/04/2025 - Réf : ALGOL_NAJ34Dl

Finalisez votre candidature

sur le site du recruteur

Créez votre compte pour postuler

sur le site du recruteur !

Ces offres pourraient aussi
vous intéresser

Sonepar France recrutement
Sonepar France recrutement
Aulnay-sous-Bois - 93
CDI
Voir l’offre
plus de 1 mois
Equans France recrutement
Equans France recrutement
Colombes - 92
CDI
Voir l’offre
il y a 17 jours
IDEX recrutement
IDEX recrutement
Puteaux - 92
CDI
Voir l’offre
il y a 18 jours
Voir plus d'offres
Les sites
L'emploi
  • Offres d'emploi par métier
  • Offres d'emploi par ville
  • Offres d'emploi par entreprise
  • Offres d'emploi par mots clés
L'entreprise
  • Qui sommes-nous ?
  • On recrute
  • Accès client
Les apps
Application Android (nouvelle fenêtre) Application ios (nouvelle fenêtre)
Informations légales CGU Politique de confidentialité Gérer les traceurs Aide et contact
Nous suivre sur :