Skip to main content

Observability Platform Engineer

  • Infrastructure Engineering
  • London

We tackle the most complex problems in quantitative finance, by bringing scientific clarity to financial complexity.

From our London HQ, we unite world-class researchers and engineers in an environment that values deep exploration and methodical execution - because the best ideas take time to evolve.  Together we’re building a world-class platform to amplify our teams’ most powerful ideas.

As part of our engineering team, you’ll shape the platforms and tools that drive high-impact research - designing systems that scale, accelerate discovery and support innovation across the firm.

The role

As an Engineer on the Observability Platform team, you’ll manage the critical entry and exit points to our telemetry services, ensuring engineers across the business can reliably produce and consume telemetry data for their services.

You’ll work closely with the Observability Engineering team to design and implement robust, scalable data pipelines that ingest, route and visualise telemetry in predictable and composable ways. Your work will empower engineers to gain actionable insight into their systems, enabling informed decision-making and operational efficiency.

Operating under the broader Platform Engineering department, our team also holds responsibility for enhancing the reliability of our entire High-Performance Computing (HPC) stack — from networking and storage through to compute and application platforms.

We’re looking for an engineer with deep expertise in observability stacks and a keen understanding of the unique challenges associated with managing telemetry at cloud-scale volumes. You’re passionate about building systems that give customers clear, consistent access to telemetry data, helping them run their services as effectively as possible.

Experience running large-scale observability platforms for a diverse customer base is essential. Familiarity with core Site Reliability Engineering (SRE) principles is highly beneficial.

Key responsibilities of the role include: 

  • Being a key contributor to the development of our observability and reliability platforms
  • Contributing to the roadmap for observability tooling, ensuring alignment with business goals and scalability requirements
  • Working with telemetry data at enormous scale, ingesting data from industry-leading GPU clusters
  • Working with AWS services, ensuring seamless integration with the observability platform
  • Collaborating with cross functional engineering teams to establish observability as a core function of the development lifecycle
  • Working closely with application teams to ensure observability systems are fully integrated and providing the necessary insights
  • Enabling SRE frameworks, promoting SLAs, SLOs and SLIs, and working closely with platform teams to ensure reliability is constantly improving
  • Helping to foster a culture of continuous learning and improvement, encouraging adoption of new observability tools and techniques

Who are we looking for?

The ideal candidate will have the following skills and experience: 

  • Proven experience on observability or SRE teams in a cloud-native or hybrid-cloud environment, running platforms in production and at scale
  • Well versed in reliability engineering concepts, including different types of testing, progressive deployments, error budgets, the role observability plays and fault-tolerant design
  • Hands-on experience with modern observability tools and frameworks such as Prometheus, OTEL (OpenTelemetry), Grafana and enterprise SaaS Observability platforms, such as Datadog and Dynatrace
  • Expertise in designing, building and scaling observability solutions for distributed systems
  • Customer focused, with an enthusiasm for providing infrastructure as a service and defaulting to a product lens when evaluating platform scale problems
  • Excellent communication skills and the ability to collaborate with cross-functional teams
  • Experience with cloud platforms, such as AWS, Azure or Google Cloud
  • Familiarity with microservices architecture and containerised environments, such as Kubernetes and Docker
  • Knowledge of infrastructure as code (IaC) and automation tools, such as Terraform and Ansible

Why join us?

  • Highly competitive compensation plus annual discretionary bonus
  • Lunch provided (via Just Eat for Business) and dedicated barista bar
  • 35 days’ annual leave
  • 9% company pension contributions
  • Informal dress code and excellent work/life balance
  • Comprehensive healthcare and life assurance
  • Cycle-to-work scheme
  • Monthly company events
Location: London
Apply Now
An image of Neil
Neil Corporate IT Manager

"My favourite part of working for G-Research is that technology is at the heart of everything we do at the company, driving the business forward and enabling us to stay ahead of the competition."

Find out more

What our people say

An image of Mario
Mario FPGA Manager

"While some people might think working in finance may not be too exciting, at G-Research, it is, especially if you see it as a problem to solve. How do we solve this algorithm? How do we get faster? This is why I think people are really excited to work here."

Find out more
An image of Mia
Mia Software Engineer

"What I appreciate most about working in G-Research is the supportive and knowledgeable environment. Everyone is incredibly helpful and patient, which ensures there’s a good balance between being challenged and your workload."

Find out more
An image of Ross
Ross Cloud Engineering Manager

"My favourite thing about working here is the people. G-Research strives to hire not only the brightest minds, but good people, which in turn creates a brilliant collegiate and social atmosphere at the company."

Find out more
An image of Matteo
Matteo Quantitative Research Intern

"One of the things that has truly stood out to me is the collaborative and welcoming culture. I hadn’t expected such a supportive environment but it’s been one of the main reasons I’ve enjoyed working here from day one."

Find out more
An image of Margot
Margot HRIS manager

"I enjoy how dynamic the work environment at G-Research is. It keeps you busy and continuously creates opportunities to develop yourself and your career, too."

Find out more
An image of Alexander
Alexander Software Engineer

"I've felt very lucky to work with teams of people across the business who are generous with their time, knowledge and ideas as we collaborate to continuously build and rebuild complex systems with lots of moving parts."

Find out more
An image of Simon
Simon Cyber Security Manager

"There are lots of people within the business that have started as a junior and progressed – which I think is testament to G-Research's belief in fostering growth and recognising potential."

Find out more
An image of Sebastian
Sebastian Senior Quantitative Researcher

"G-Research makes a lot of effort to have a very open culture and gives a lot of freedom to its individual researchers to pursue directions that they think are valuable, with each researcher very much driving their own research. I didn’t feel like I was losing a lot of freedom compared to academia."

Find out more
An image of Yang
Yang Quantitative Researcher

"What I like the most about my job is it’s super open. I’m able to work with a lot of folks from other teams, too, such as working closely with engineers and other quantitative researchers."

Find out more
An image of Yousuf
Yousuf Machine Learning Engineer

"My intern experience was really good. You get the opportunity to impact a business, which is important if you’re preparing to enter the workplace. You get to do something useful and see how it gets used; I worked on a project that is still being used now."

Find out more

Interview process

Online Application

Our assessment process kicks off with our Talent Acquisition team, who will review your application and assess your fit for the role.

Stage One: Technical Interview

You will meet with a team member – or take a remote test – where your technical abilities will be put to the test.

Stage Two: Behavioural Interview

We will set aside technical skills and focus on you.

Stage Three: Further Technical Interviews

Here, we will take a deeper dive into your technical skills and competencies.

Stage Four: Management Interviews

The final stage of our interview process is where you will meet members of your team, your future manager, and functional leadership.

Observability Platform Engineer Apply now

Stay up to date with G-Research