Site Reliability Engineer
Location: London
Salary: Up to £700 per day
Searching for a Senior SRE / Site Reliability Engineer to join a leading market research organisation reporting directly into the Head of SRE. I am looking for a consultant who has specific experience in developing SRE principals within a team and has knowledge of containers, scaling, distributed systems, networking, cloud native and many more. The ideal profile should have come from a software development background and understand DevOps ways of working.
Key Responsibilities
- Partner with Product Owners to agree availability targets that our customers value the most.
- Document every action so your findings turn into repeatable actions-and then into automation.
- Make monitoring and alerting alert on symptoms and not on outages.
- Debug production issues across services and all levels of the stack including real user experience issues.
- Be on a standby rotation to respond to availability incidents and provide support for Product teams with customer incidents.
- Learn from your time on-call to prevent incidents from ever happening.
- Run our Infrastructure Platform with Terraform and Kubernetes.
- Use the Infrastructure Platform to run your product as a first resort and make suggestions to improve the platform as much as possible.
- Improve the deployment process to make it as boring as possible.
- Plan the growth of your product’s infrastructure.
- Design, build and maintain paved road modules that allows products to scale.
Key Skills
- Infrastructure as Code – IaC
- Terraform
- CI/CD
- Python / JavaScript
- Kubernetes – EKS
- Cloud
- AWS
Site Reliability Engineer
Location: Southampton
Salary: £650 – £690 per day
Scope
- Migrate our CI/CD infrastructure to AWS
- Enhance our existing AWS Product implementation with the latest best practices.
Requirements:
- Amazon Web Services (specifically networking/VPC and IAM)
- Terraform
- Git, Gitlab
- Experience in designing, building, and maintaining high performance, highly available, production hosting environments, preferably using AWS (VPCs, security groups, RDS, S3, EC2, EKS)
- Understanding of security engineering and security best practices
- Good knowledge of at least one high-level scripting language, such as bash
- Advanced knowledge of best practices and tooling for Continuous Integration and Continuous Deployment
- Containerisation: Docker & Kubernetes
- Experience with monitoring and tooling such as Elastic, StatusCake, PagerDuty
Desirable:
- Agile delivery
- Thorough understanding of major system components used in completing assigned tasks (i.e Linux, Networking, Storage or Databases).
- Java/Spring programming experience
Site Reliability Engineer
Location: London
Salary: £600 – £615 per day
A unicorn client of ours is seeking a highly skilled Site Reliability Engineer to join immediately on a contract basis.
Essential requirements:
- Proven track record within the SRE/DevOps space
- Advanced usage of Terraform to build AWS resources
- NewRelic
- Strong development skills across Python/Java
- Kubernetes
Desirable requirements:
Any experience across the below would be highly desirable;
- Circle CI, Jenkins, Azure DevOps
- Open Telemetry frameworks