Do you want to tackle the biggest questions in finance with near infinite compute power at your fingertips?
G-Research is a leading quantitative research and technology firm, with offices in London and Dallas. We are proud to employ some of the best people in their field and to nurture their talent in a dynamic, flexible and highly stimulating culture where world-beating ideas are cultivated and rewarded.
This role is based in our new Soho Place office – opened in 2023 - in the heart of Central London and home to our Research Lab.
The role
Ensuring resilience, uptime and operational efficiency is mission-critical to its success.
As a Production Software Engineer, you will play a key role in driving observability, reliability, change safety and runtime optimisation across a complex, federated engineering environment.
You will design and implement the systems, tooling and workflows that ensure the distributed platform is robust, observable and engineered for speed and safety. This includes owning the engineering infrastructure that supports deployment automation, real-time diagnostics and fault-tolerant operations.
This is a high-impact, hands-on engineering role. You’ll work closely with software engineers, infrastructure engineers, front office execution and research to streamline delivery into production, strengthen runtime behaviour and reduce operational risk. You’ll also participate in a shared production support model to reinforce stability and continuous improvement.
Engineering underpins our continued growth and success, and we are committed to recruiting and developing the world’s best Engineers.
Key Responsibilities of the role include:
Improving the resilience and efficiency of real-time distributed systems by identifying bottlenecks, reducing operational toil and limiting blast radius
Developing tooling and frameworks to support frequent, low-risk software delivery across a wide engineering estate
Working with infrastructure teams to own and evolve domain specific metrics, alerting and diagnostics infrastructure used to operate and monitor the platform
Building and maintaining core systems for deployment automation, observability, runtime environment management and release readiness
Promoting runtime engineering best practices, working with federated teams to align on standards, service ownership and fault tolerance
Participating in a shared production support rotation, helping respond to incidents, troubleshoot runtime issues and driving engineering improvements
Collaborating with application, research and execution teams to define and uphold runtime boundaries, deployment criteria and production SLAs
Who are we looking for?
Strong background in software engineering, ideally in distributed, real-time systems
Experience with containerisation and orchestration technologies, such as Kubernetes, in production environments
Familiarity with observability tooling and practices, such as Victoria Metrics, Prometheus, Grafana, OpenTelemetry and SLOs
Well-developed debugging skills with the ability to navigate unfamiliar systems, identify root causes and deliver effective solutions under time pressure
Proven track record of contributing to fault-tolerant, high-availability platforms
Proven ability to build and deploy software in resource-constrained environments, leveraging standard tooling effectively to support practices such as CI/CD systems and deployment automation
Comfort working in a federated model, engaging across multiple engineering teams and product streams
Focus on continuous improvement, reducing the need for manual intervention through engineering
Why should you apply?
Highly competitive compensation plus annual discretionary bonus
Lunch provided (via Just Eat for Business) and dedicated barista bar
35 days’ annual leave
9% company pension contributions
Informal dress code and excellent work/life balance
Comprehensive healthcare and life assurance
Cycle-to-work scheme
Monthly company events