Our client, a growing Financial Service firm is seeking a Senior Service Reliability Engineer to join the team and partner in the design, deployment, and automation of application services across the organization. This position will also partner to provision software and manage cloud infrastructure for continuous integration, automation, and proactive monitoring.
The Engineer will help plan for complex releases and service updates. Engineers are responsible for driving service improvement through metrics and analysis. There is a heavy investment on cloud services such as AWS and Azure being made. The SRE team is pivotal in the transition to this containerized, microservices environment. They will be responsible for developing best practices and templates that all teams can leverage. This role will be key for improving CI/CD automation and service deployment. In addition, they join their peers in Cloud Engineering in overall environment design and support.
Skills:
- 2+ years of experience with AWS services such as VPC, EC2, ECS, ELB, S3, IAM, RDS, RedShift, Neptune, OpenSearch, Route53, Lambda, API Gateway, etc.
- 5+ years of experience working with Windows, Linux, IIS, .net, Python, Go environments at scale
- Understanding of containerization and experience leveraging Docker and Kubernetes at scale
- Ability to program with one or more high level languages, such as PowerShell, Python or Go
- Experience with configuration management, Ansible preferred.
- Familiarity with APM and service telemetry environments
- Deep knowledge of Git and Azure DevOps pipelines
- Solid knowledge of cloud security and best practices
- Solid background in network, storage, and DNS services
- Understanding of Project Management tools, techniques and methodologies including agile
- Bachelor's Degree in Computer Science or relevant work experience preferred
location: New York, New York
job type: Permanent
salary: $130,000 - 140,000 per year
work hours: 8am to 4pm
education: Bachelors
responsibilities:
- Partner with development teams to improve services through rigorous testing and deployment procedures
- Leverage automation such as Infrastructure as Code and pipelines for efficiency and consistency
- Gather and analyze metrics from different services to ensure and improve reliability and availability
- Responsible for application integration activities for monitoring tools such as CloudWatch, New Relic, Sumo Logic, Datadog and PagerDuty
- Analyze software applications to identify vulnerabilities using CSPM, Xray, and Sonarqube
- Design and evolve deployment systems and pipelines for reliability, security, and efficiency
- Develop deep insight into application and service performance
- Troubleshoot priority incidents
- Advise teams on industry best practices on security, deployment, and monitoring
- Gain deep understanding of supported services
qualifications:
- Experience level: Experienced
- Minimum 5 years of experience
- Education: Bachelors (required)
skills:
Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.
For certain assignments, Covid-19 vaccination and/or testing may be required by Randstad's client or applicable federal mandate, subject to approved medical or religious accommodations. Carefully review the job posting for details on vaccine/testing requirements or ask your Randstad representative for more information.