Software Engineer - Site Reliability

  • Kensho
  • Cambridge, MA, USA
  • Jul 19, 2018
Full time Data

Job Description

As a Site Reliability Engineer (SRE) at Kensho, you are a thoughtful, collaborative, and dynamic technologist who loves building the infrastructure that helps others do their jobs more effectively and efficiently. You think deeply about the implications, relationships, edge cases, and failure modes, and you are passionate about correctness, uptime, stability, and writing the next thing, so you aren't spending time maintaining older projects.
Are you a prolific, intellectually curious technologist who appreciates code, reliability, and beauty? We are on a mission to clarify complex data through scientific, statistical, analytical, computational, and inspired study. By transforming the data, we are able to bring transparency to some of the most important issues on the planet. You will be joining a team of veterans from Google, Twitter, and Facebook, as well as academia.

What You'll Do:

    • Run Kensho’s production services that support the world’s top banks
    • Monitor, maintain and help scale Kensho’s web based financial applications
    • Design and build advanced automated operational and deployment frameworks alongside tooling and infrastructure to help engineering teams measure and increase their velocity
    • Use automated frameworks for unit, integration, end-to-end, smoke, dirt, and other testing approaches to detect issues early
    • Cultivate full-team participation in high quality, thoughtful software
    • Periodic “pager-duty” is required

What We Look For:

    • Experience running production services in a modern, containerized cloud environment
    • Expertise in automated and scalable testing, automation, and continuous integration frameworks and best practices
    • Desire to build a strong, operationally minded engineering culture
    • Practical understanding of algorithms, data structures, and design patterns
    • Effective coding, documentation, and communication habits
    • Thoughtful and collaborative code reviewer and teammate
    • 3+ years of experience

How to Really Get Our Attention:

    • Major technical contributor at a top 10 software company
    • Your open source projects show innovation and initiative
    • Hedge fund or major financial institution trading experience
    • Research, publications, and patents

Technologies We Like:

    • Postgres, Kubernetes, HAProxy, Jenkins, Git, Docker, Cypress, Prometheus, Kibana, Elasticsearch, Grafana


    • Medical, Dental, and Vision insurance with 100% premium covered
    • Unlimited vacation days
    • Paid Parental Leave
    • 401(k) plan with employer match
    • Free snacks and drinks
    • Dog-friendly office
    • Cardio machines and weights in the office
    • Hubway (bike sharing program) membership