Accessibility Links

Platform Reliability Engineer - Java-Linux-Jenkins-containers

  • Job reference: 27592
  • Location: City of London
  • Job type: Permanent
  • Start date: Not specified
  • Contact: Not specified
  • Sector: Infrastructure & Cloud Services
  • Salary: £70000 - £80000 per annum

Sorry, this vacancy has now expired.

Please see our job results page to find our current open vacancies or take a look at our Specialisms below and go straight to your area.

BI, Data & Analytics, Change Management, Digital and Development, ERP and CRM Systems, Executive Leadership, Information and Cyber Security, Infrastructure & Cloud Services, Interim Management, Strategy & Architecture, our Work For La Fosse.

 

 

 

 Return to homepage.

Senior Platform Reliability Engineer

Platform Reliability Team, is a highly skilled team of software engineers focused on building performance and reliability into the clients state of the art trading platform. The clients vision for future and evolution of its architecture and development practices are constantly presenting a new set of challenges. The team is focused to play its key part in the strategy to sustain its technology leadership.

The team is characterized by passion for continuous improvement, natural problem solver, in-depth understanding of reliability patterns, and enablers of DevOps culture. Our strategy is to empower development teams to deliver reliable high-performance software, and develop control and platform intelligence mechanisms to ensure that only high standard components get as far as production.

The selected candidate will play an important role in driving the team's strategy forward and reports to a Platform Reliability Lead. The responsibilities also include:

  • Engage in and improve the whole lifecycle of services-from inception, through design (software) and refinement (deployment, operational, monitoring).
  • Engage with Development teams before services go live through activities such as system design consulting, development of libraries/services to introduce standardized libraries, capacity planning and launch reviews.
  • Ensure once services are LIVE they have monitoring and alerting to validate overall system health. This should go beyond basic process checking and touch various other aspects of engineering like measuring end-to-end latency, kernel level metrics (sockets overflow) etc.
  • Discover Single-Point-Of-Failures within a System.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and resiliency.
  • Practice sustainable incident response and blameless postmortems.

The role has an element of Out-Of-Hours support. Each person in the team will be on Escalation and will be involved to investigate & technically coordinate critical issues with the trading platform.

The character and attitude is as important as technical capability hence; We are looking for someone who is full of energy and positive, friendly and easy to work with, a great team player, and has a can-do attitude. She/he is technically very strong, passionate about technology and excites people around them too. A person who enjoys troubleshooting, is analytical and able to think logically, and can communicate effectively and precisely.

Required Experience:

  • Experience in developing reliable applications with good design patterns and latency critical.
  • Work experience in environments characterized by high throughput, low latency, zero downtime, frequent production deploys.
  • Troubleshooting complex problems in a distributed environment.
  • Design technical solutions.
  • Driving a project from inception to production and overseeing all steps involved.
  • Good communication skills and able to inspire colleagues.

Required Skills:

  • In depth understanding of Java and JVM.
  • Continuous Delivery principles.
  • Messaging Systems e.g. JMS.
  • Experience with algorithms, data structures.

Bonus:

  • Good understanding of Linux containers (i.e. Docker).
  • Concurrency and multi-threading in java.
  • Experience in finance or gaming industry is a bonus.

Related jobs
Technical Support Engineer
  • Permanent
  • Liverpool
  • £25000 - £30000 per annum
  • Reference kw - 31990
  • Technical Support Engineer | Liverpool | ITIL Certified A Technical Support Engineer is required to support and maintain systems and infrastructure used by the company at a level commensurate with skill levels and experience as directed by your Manager. You will be accountable for: Perform 1st and 2nd line support for...
Read more
Senior Network Engineer
  • Permanent
  • West End
  • £0 per annum
  • Reference 31694
  • Role: Senior Network Engineer Location: City of London Salary: £60-80k + benefits I'm currently working with a leading Asset Manager. They are looking for a Senior Network Engineer to join their team and take responsibility for their network infrastructure. You will be looking after the day to day support of the...
Read more
Lead Cloud Network Consultant/Architect
  • Permanent
  • West End
  • £80000 - £100000 per annum + benefits
  • Reference 30188
  • Role: Lead Cloud Network Consultant/Architect Location: City of London Salary: £80-100k + benefits I'm currently working with a FTSE100 Insurance company that are looking for a Lead Cloud Network Consultant/Architect to lead the cloud network transformation on their global digital platform...
Read more