Lock Applications for this job are now closed
Closing soon

ExodusPoint Capital, founded in 2017 by Michael Gelband, began managing investor capital in 2018. The firm employs a global multi-strategy investment approach, seeking to deliver compelling asymmetric returns by combining complementary liquid strategies managed by experienced investment professionals within a robust risk framework. ExodusPoint brings together an accomplished team with hands-on experience running multi-manager businesses to create an institutional investment management firm.

Job description

ExodusPoint is seeking a motivated individual to join our global Site Reliability Engineering (SRE) team of six, split between the US and UK. As a Junior SRE, you will collaborate with experienced engineers to support, automate, and optimize our infrastructure stack. This role offers an exciting opportunity to work closely with both our development/business user base and our infrastructure teams—making you a key liaison in ensuring reliability and smooth operations across the organization.

Responsibilities

  • Infrastructure Liaison & DevOps Support
  • Serve as a bridge between development/business teams and infrastructure teams.
  • Collect and translate requirements from diverse stakeholders into actionable solutions.
  • Assist in designing and building CI/CD pipelines, then hand them over to user teams.
  • Platform & Tooling Management
  • Contribute to the deployment and maintenance of technologies like Kafka, Kubernetes, GitLab, and Airflow.
  • Explore and implement various DevOps tools to streamline and optimize workflows.
  • Collaborate on integrating new and existing systems to ensure smooth interoperability.
  • Monitoring & Observability
  • Support the monitoring infrastructure by setting up, automating, and managing monitoring tools.
  • Onboard development teams onto our observability platform, enabling them to easily track and respond to system metrics and alerts.
  • Collaborate with teams to fine-tune dashboards and alerting rules for effective and proactive monitoring.
  • Automation & Reliability Engineering
  • Help drive reliability and scalability by automating repetitive tasks and integrating self-service capabilities for the user base.
  • Work with senior engineers to implement best practices for container orchestration (Kubernetes) and data streaming (Kafka).
  • Troubleshoot system issues and propose long-term fixes to enhance performance.
  • Collaboration & Continuous Improvement
  • Interact with a wide array of stakeholders, from highly technical engineers to non-technical business users.
  • Participate in team knowledge sharing to increase understanding of SRE best practices.
  • Regularly evaluate existing processes and systems, suggesting innovative improvements

Qualifications

  • Basic Technical Foundation: Familiarity with Linux environments, containerization concepts (Kubernetes), and DevOps practices.
  • Eager to Learn: Strong desire to acquire new skills in areas like Kafka, CI/CD tools (GitLab or similar), and workflow automation (Airflow).
  • Problem-Solving Mindset: Curiosity and perseverance in tackling technical challenges, paired with the resourcefulness to find solutions.
  • Team Player: Excellent communication and collaboration skills, with a willingness to help—and learn from—others.
  • Adaptability & Ownership: A proactive attitude toward stepping into new tasks and taking responsibility for outcomes.

Why join us

  • Hands-On Experience: Work on real, production-level infrastructure, gaining invaluable experience in SRE best practices.
  • Mentorship & Growth: Be part of a small, supportive team where you’ll receive guidance and have room to grow your technical expertise.
  • Cutting-Edge Technologies: Get exposed to a wide range of modern tools—from Kubernetes and Kafka to Airflow—essential in today’s tech landscape.
  • Global Collaboration: Engage with colleagues across different time zones and backgrounds, enhancing both technical and soft skills.
  • Impactful Work: Help shape our monitoring, DevOps tooling, and reliability processes, directly influencing organizational success.