Vacature Site Reliability Engineer

2 weken geleden


Amsterdam, Nederland Together AI Voltijd

As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, and mature automation to our operating environments and codebase. You specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems. Requirements ~7+ years of professional SRE or related experience ~ Bachelor's degree in Computer Science or a related field or equivalent work experience ~ Expert knowledge of Ansible (roles, playbooks), Terraform, and Kubernetes ~ Proficiency in programming/scripting languages ~ Direct experience in monitoring and observability practices ~ Advanced knowledge of cloud services ~ Ability to thrive in a collaborative environment involving different stakeholders and subject matter expertsResponsibilities Be on an on-call (PagerDuty) rotation to respond to incidents that impact availability Build and run our infrastructure with Ansible, Terraform, and Kubernetes to enable scaling to a massive number of concurrent users Build monitoring systems to ensure the highest quality service for our customers Design and implement operational processes (such as deployments and upgrades) Debug production issues across all services and levels of the stack Identify improvements for the product architecture from the reliability, performance and availability perspectives Plan the growth of Together AI’s infrastructure About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at


  • Site Reliability Engineer

    2 weken geleden


    Amsterdam, Noord-Holland, Nederland Salt Voltijd

    Salt is currently hiring a Site Reliability Engineer for a client of ours in Amsterdam.OverviewAs a Senior Site Reliability Engineer I, you'll apply software engineering principles to operations — driving reliability, scalability, and performance across systems and services. You'll design and implement robust, automated solutions that reduce operational...


  • Amsterdam, Noord-Holland, Nederland Brookwood Recruitment Ltd Voltijd

    About the Role:We are looking for a highly skilledSenior Site Reliability Engineer Ito join our team. You will focus on ensuring the reliability, performance, scalability, and efficiency of critical systems and services, while reducing operational toil through automation. This role involves designing and implementing complex technical solutions, leading...

  • Site Reliability Engineer

    2 weken geleden


    Amsterdam, Nederland Together AI Voltijd

    As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, and mature automation to our operating environments and codebase. You specialize...

  • Site Reliability Engineer

    16 uur geleden


    Amsterdam, Noord-Holland, Nederland Cognizant Voltijd

    Site Reliability Engineer (Business Platform Operations)What makes Cognizant a unique place to work? The combination of rapid growth on a global scale in an international and innovative environment This is creating a lot of opportunities for people like you — ambitious and pro-active with an entrepreneurial spirit who creates and seizes opportunities.At...


  • Amsterdam, Nederland ING Bank Voltijd

    **REQ-10101070**: - **16/09/2025**: - **IT Engineering**: - **Amsterdam, Nederland**: - **€5.485 - €8.828**ING Bank** - Details van de functie **Site Reliability Engineering Epert (enablement/consultant)** As a Tech Site Reliability Engineering Epert your position will be twofold. You are SRE epert for a specific IT Domain (60%) AND for a specific...

  • Site Reliability Engineer

    1 week geleden


    Amsterdam, Nederland Stacks Voltijd

    This role is based out of our Amsterdam office or London office Is dit de functie die u zoekt? Zo ja, lees dan verder voor meer details en zorg ervoor dat u vandaag nog solliciteert. We are an office-first company & believe great products are made when we are together. About Stacks At Stacks, we’re transforming the way finance teams approach one of their...

  • Site Reliability Engineer

    2 weken geleden


    Amsterdam, Nederland Funda Voltijd

    Do you get a thrill from building infrastructure that millions of people rely on every single day? Then you’ll feel right at home at Funda. At a platform of our scale, every tweak matters – and as a Site Reliability Engineer (SRE), you play a key role in keeping everything running smoothly. With 99.99% uptime we’re more reliable than many banks, and...

  • Site Reliability Engineer

    2 weken geleden


    Amsterdam, Noord-Holland, Nederland Funda Voltijd

    Do you get a thrill from building infrastructure that millions of people rely on every single day? Then you'll feel right at home at Funda. At a platform of our scale, every tweak matters – and as a Site Reliability Engineer (SRE), you play a key role in keeping everything running smoothly. With 99.99% uptime we're more reliable than many banks, and that's...

  • Site Reliability Engineer

    1 week geleden


    Amsterdam, Noord-Holland, Nederland Stacks Voltijd

    This role is based out of our Amsterdam office or London office We are an office-first company & believe great products are made when we are together.About StacksAt Stacks, we're transforming the way finance teams approach one of their most critical processes: the monthly close. For mid to large enterprises, the close is a painstaking, manual effort that...

  • Site Reliability Engineer

    1 week geleden


    Amsterdam, Noord-Holland, Nederland Stacks Voltijd

    This role is based out of our Amsterdam office or London office We are an office-first company & believe great products are made when we are together.About StacksAt Stacks, we're transforming the way finance teams approach one of their most critical processes: the monthly close. For mid to large enterprises, the close is a painstaking, manual effort that...