Reliability Engineer

  • location: Glen Mills, PA
  • type: Permanent
  • salary: $120,000 - $140,000 per year
easy apply

job description

Reliability Engineer

job summary:
Technical expertise is critical in order to imagine and drive technical improvements across our database, networking, and infrastructure teams, and to partner with our application teams, implementing more robust and performant applications for our internal solutions and business solutions.

You should be someone excited with the challenge of bringing new thinking to operations and is passionate about imaginging and implementing improvements and relentlessly pursues excellence, is a deep and broad technical expert, and can build trusting relationships across teams.

 
location: Glen Mills, Pennsylvania
job type: Permanent
salary: $120,000 - 140,000 per year
work hours: 9am to 5pm
education: Bachelors
 
responsibilities:
  • Ensure user visible uptime and quality, providing operational and development expertise in making our systems fail rarely, and are fast to fix when they do fail
  • Administer daily operations of servers including log review with escalation, patch & upgrade applications, manage backup & restoration implementation and testing.
  • Install and manage windows servers and .NET applications running on windows servers.
  • Participate in architecture and design reviews to provide recommended improvements to the development teams to improve the reliability and performance of applications
  • Employee will participate in a 24/7 on-call rotation schedule providing third-level incident response with other Information Technology team members.
  • Minimize manual involvement by imagining & implementing continuous improvements that create an operating environment, including the development of new tools, dynamically monitoring, alerting, & automated self-healing & recovery
  • Identify and/or analyze problems relating to mission critical services and implement automation to prevent problem recurrence; with the goal of automating response to all non-exceptional service conditions.
  • Capable of presenting analyses and recommendations to leadership or discussing the technical merits of solutions with engineers and architects.
  • Own the day-to-day health, uptime, monitoring, and reliability of services and server infrastructure
  • Practice Agile and Scrum methodologies
 
qualifications:
  • Strong experience with software engineering
  • Experience managing Windows operating system
  • Strong experience and working knowledge of .NET applications
  • Strong experience with VSTS, or similar ALM tool
  • Working knowledge of Azure Services, especially ARM templates
  • Strong experience with PowerShell
  • Strong working knowledge of system and network architecture
  • Understanding of the concepts and principles behind DevOps, Continuous Delivery, Agile, Lean, etc.
  • Build and release management experience with Microsoft Azure DevOps
  • Use of DevOps tools to deliver and operate end-user services a plus (e.g., Chef, New Relic, Puppet, etc.)
  • Good understanding of Microsoft Azure cloud computing platform.
  • Good experience with APM, Network monitoring and load balancer tools
  • Experience and knowledge of database technologies, particularly MS SQL
  • Knowledge of virtualization and its benefits for improving reliability
  • Strong experience with instrumentation, monitoring, alerting, and responding relative to performance and availability of applications
  • Capable of technical deep dives into infrastructure, databases, and application, specifically in designing, coding, operating, and supporting high-performance, highly available services and infrastructure
  • Experience in designing for failure, including disaster recovery and business continuity planning
  • Experience operating and supporting mission-critical applications (e.g. incident and outage management)
  • Experience problem solving issues on globally distributed systems and critical product service environments
  • Knows what is possible using latest networking, infrastructure, database, and application technologies to driving automation and reliability improvements
  • Excellent at building relationships across teams
  • Firm sense of accountability and ownership
  • Desire to understand our businesses and users
  • Proficient in ITIL concepts
 
skills:
  • Strong experience with software engineering
  • Experience managing Windows operating system
  • Strong experience and working knowledge of .NET applications
  • Strong experience with VSTS, or similar ALM tool
  • Working knowledge of Azure Services, especially ARM templates
  • Strong experience with PowerShell
  • Strong working knowledge of system and network architecture
  • Understanding of the concepts and principles behind DevOps, Continuous Delivery, Agile, Lean, etc.
  • Build and release management experience with Microsoft Azure DevOps
  • Use of DevOps tools to deliver and operate end-user services a plus (e.g., Chef, New Relic, Puppet, etc.)
  • Good understanding of Microsoft Azure cloud computing platform.
  • Good experience with APM, Network monitoring and load balancer tools
  • Experience and knowledge of database technologies, particularly MS SQL
  • Knowledge of virtualization and its benefits for improving reliability
  • Strong experience with instrumentation, monitoring, alerting, and responding relative to performance and availability of applications
  • Capable of technical deep dives into infrastructure, databases, and application, specifically in designing, coding, operating, and supporting high-performance, highly available services and infrastructure
  • Experience in designing for failure, including disaster recovery and business continuity planning
  • Experience operating and supporting mission-critical applications (e.g. incident and outage management)
  • Experience problem solving issues on globally distributed systems and critical product service environments
  • Knows what is possible using latest networking, infrastructure, database, and application technologies to driving automation and reliability improvements
  • Excellent at building relationships across teams
  • Firm sense of accountability and ownership
  • Desire to understand our businesses and users
  • Proficient in ITIL concepts

Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.

easy apply

get jobs in your inbox.

sign up
{{returnMsg}}

related jobs

    Site Reliability Engineer

  • location: Conshohocken, PA
  • job type: Permanent
  • salary: $125,000 - $155,000 per year
  • date posted: 9/9/2019

    Lead Reliability Engineer

  • location: Glen Mills, PA
  • job type: Permanent
  • salary: $130,000 - $160,000 per year
  • date posted: 9/12/2019

    Sales Engineer

  • location: Philadelphia, PA
  • job type: Permanent
  • salary: $120,000 - $140,000 per year
  • date posted: 7/29/2019