Accessibility Links

Platform Reliability Engineer - Java-Linux-Jenkins-containers

  • Job reference: 27592
  • Location: City of London
  • Job type: Permanent
  • Start date: Not specified
  • Contact: Not specified
  • Sector: Infrastructure & Cloud Services
  • Salary: £70000 - £80000 per annum

Sorry, this vacancy has now expired.

Please see our job results page to find our current open vacancies or take a look at our Specialisms below and go straight to your area.

BI, Data & Analytics, Change Management, Digital and Development, ERP and CRM Systems, Executive Leadership, Information and Cyber Security, Infrastructure & Cloud Services, Interim Management, Strategy & Architecture, or Work For La Fosse.

 

 

 

 Return to homepage.

Senior Platform Reliability Engineer

Platform Reliability Team, is a highly skilled team of software engineers focused on building performance and reliability into the clients state of the art trading platform. The clients vision for future and evolution of its architecture and development practices are constantly presenting a new set of challenges. The team is focused to play its key part in the strategy to sustain its technology leadership.

The team is characterized by passion for continuous improvement, natural problem solver, in-depth understanding of reliability patterns, and enablers of DevOps culture. Our strategy is to empower development teams to deliver reliable high-performance software, and develop control and platform intelligence mechanisms to ensure that only high standard components get as far as production.

The selected candidate will play an important role in driving the team's strategy forward and reports to a Platform Reliability Lead. The responsibilities also include:

  • Engage in and improve the whole lifecycle of services-from inception, through design (software) and refinement (deployment, operational, monitoring).
  • Engage with Development teams before services go live through activities such as system design consulting, development of libraries/services to introduce standardized libraries, capacity planning and launch reviews.
  • Ensure once services are LIVE they have monitoring and alerting to validate overall system health. This should go beyond basic process checking and touch various other aspects of engineering like measuring end-to-end latency, kernel level metrics (sockets overflow) etc.
  • Discover Single-Point-Of-Failures within a System.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and resiliency.
  • Practice sustainable incident response and blameless postmortems.

The role has an element of Out-Of-Hours support. Each person in the team will be on Escalation and will be involved to investigate & technically coordinate critical issues with the trading platform.

The character and attitude is as important as technical capability hence; We are looking for someone who is full of energy and positive, friendly and easy to work with, a great team player, and has a can-do attitude. She/he is technically very strong, passionate about technology and excites people around them too. A person who enjoys troubleshooting, is analytical and able to think logically, and can communicate effectively and precisely.

Required Experience:

  • Experience in developing reliable applications with good design patterns and latency critical.
  • Work experience in environments characterized by high throughput, low latency, zero downtime, frequent production deploys.
  • Troubleshooting complex problems in a distributed environment.
  • Design technical solutions.
  • Driving a project from inception to production and overseeing all steps involved.
  • Good communication skills and able to inspire colleagues.

Required Skills:

  • In depth understanding of Java and JVM.
  • Continuous Delivery principles.
  • Messaging Systems e.g. JMS.
  • Experience with algorithms, data structures.

Bonus:

  • Good understanding of Linux containers (i.e. Docker).
  • Concurrency and multi-threading in java.
  • Experience in finance or gaming industry is a bonus.

Related jobs
3rd Line EUC Engineer
  • Contract
  • City of London
  • £400 - £475 per day
  • Reference 34164
  • Global FTSE 100 company based in the city requires a 3rd line support engineer to take a key role within the global service delivery team. They will be focusing on 3rd line engineering for the end user compute environment. The 3rd line support engineer will have the following experience: Cloud experience (Azure...
Read more
Network Engineer
  • Permanent
  • Hinckley
  • £35000 - £45000 per annum + car allowance and benefits
  • Reference 33290
  • Role: Network Engineer Location: Hinckley, Leicester Salary: £35-45k + benefits (car allowance, 10% bonus) I'm currently working with a highly successful FTSE 250 construction company that are looking for a Network Engineer to join their growing team. They are transitioning to the cloud and have large projects...
Read more
Operations Engineer
  • Permanent
  • City of London
  • £45000 - £65000 per annum + Benefits
  • Reference 33919
  • Position - Operations Engineer Location - Central London Salary - £45,000 - £65,000 Background: We are working with a hugely forward thinking organisation making cutting edge scientific breakthroughs and working closely with the NHS. They are looking for a Operations Engineer who will be responsible for the...
Read more