DevOps (Site Reliability) Engineer

  • location: Boston, MA
  • type: Permanent
  • salary: $105,000 - $115,000 per year

job description

DevOps (Site Reliability) Engineer

DevOps (Site Reliability) Engineer

Candidate Description

Our client is seeking a highly motivated DevOps (Site Reliability) Engineer to join a small, highly collaborative team and help create the next generation of data products for trading and investing.

Successful candidates will be curious, independent thinkers who are excited by challenges and driven by building great products. They will play a key role in developing their Machine Intelligence infrastructure.

Responsibilities

  • Design and implement scalable and pluggable Machine Learning -based systems
  • Lead automation efforts to eliminate manual work involved in building clusters, performing releases and other operational work.
  • Build and deploy software, analyze logs and telemetry data for issues.
  • Apply good software development methodology in writing automation code.
  • Provide on-call support for critical services.
Example Project

  • Distributed services and APIs to access hundreds of real-time and batch data sources
  • Develop data ingestion and normalization framework that can collect and process data from hundreds of sources daily (and in real-time)
Background

We will consider candidates from a wide range of backgrounds, however, the many of the problems the candidate would be tasked with solving will require designing systems and good software development methodology in writing automation code. Therefore, candidates with a computer science or engineering background are preferred.

Experience

Successful candidates will have:

  • Professional experience in Devops (Site Reliability) Engineering
  • Experience deploying applications in a production or mission-critical environment
  • Experience in financial services is a plus.
Education

  • Bachelor's Degree in Computer Science, Engineering, Physics, Mathematics, or similar quantitative discipline.
  • Master's Degree a plus.
Skills

Required

  • Understanding of distributed systems
  • Have a good grasp of Linux systems, networking and security
  • Have a thorough understanding of cloud based architectures
  • Working experience with Java is highly desired (or Python or Scala)
  • Experience with orchestration systems
  • Experience with REST API, databases and/or key-value systems
  • Experience with infrastructure as code tools such as TerraForm
  • Experience in CI build tools such as Jenkins
  • Strong Familiarity with Big Data technologies and architectures: Hadoop, Spark, Kafka, etc.
  • Experience with containers and scalable computing platforms: Docker (ECS), Mesos,
  • Experience implementing and administering logging, telemetry and monitoring tools
  • Know how in maintaining and debugging distributed systems in Java runtime environment
  • Knowledge of Lambda architecture
  • Excellent communication skills

get jobs in your inbox.

sign up
{{returnMsg}}

related jobs


    Site Manager

  • location: Danvers, MA
  • job type: Permanent
  • date posted: 5/22/2018