Senior Site Reliability Engineer

Published date Posted on Indeed on Dec 26, 2021 (22 d ago)

Senior Site Reliability Engineer

Sight Machine is built on the shoulders of a unique, robust and highly scalable Infrastructure as Code model. This enables the creation and operation of customer instances in our ecosystem in a standardized and simplified manner. We are looking for team members to help us build, maintain, and improve the infrastructure that makes Sight Machine the leading provider of Manufacturing Data Pipelines and Analytics.

Great things happen when people can bring their authentic selves to work. We empower all of our team members to share their perspectives, passions and experiences because collectively we make a better, stronger team through always “open communications” mind.

Our team collaborates closely with peers & cross functional stakeholders throughout the business, our clients on the forefront of digital transformation, and the cutting edge of digital manufacturing thought leadership.

Although we have offices in SF and Ann Arbor, we have a remote-friendly culture with folks based all around the US and the rest of the world. For this role in particular, we are looking for someone that is willing to work in core Eastern Time zone hours to provide coverage for our Ann Arbor based teams.

The Role

In this role you will join our Site Reliability and Infrastructure Team in deploying, managing, optimizing and upgrading the systems that run Sight Machine software. You must love learning new technology, problem solving, and building automation in the Infrastructure as Code paradigm.

Success will take a blend of technical expertise, experience with deployment technology frameworks, customer-centric focus, and a team-spirited approach to solve architectural challenges supporting your peers in Application Engineering.


  • Employing DevOps principles, provide technical operational support for comprehensive cloud infrastructure operations for all customers, internal and external.

  • Troubleshoot and resolve complex systems problems that cross multiple layers of the systems stack from networking, to operating systems, to cloud resources, to databases.

  • Instrument Monitoring and Alerting infrastructure for critical services

  • Creating, revising, and testing operational runbooks and automation for maintaining Sight Machine Infrastructure

  • Design and code appropriate tools to support our internal platforms and systems

  • Participate in our on-call support schedule
  • Proactively pursue opportunities of operational innovation to improve stability, reliability, availability of the all platform components, and optimize efficiency, and propagate a security-first culture


  • Embody a Quality-first and Security-first culture in all that you do

  • You can work independently and have willingness to listen, learn and commit to team decisions

  • You value clear communication and you're empathetic and respectful of others

  • Strong Linux Fundamentals with 3+ years expertise managing cloud infrastructure

  • Expertise with a container-deployed service (Kubernetes, Docker), and management of a first-tier cloud provider (Azure, GCP, AWS)

  • Operational experience with monitoring/alerting systems such as Sentry, Opsgenie, Prometheus

  • Familiarity with networking systems, including IP addressing, TCP, UDP protocols, DNS, routing, and netmasks.

  • Deep understanding of cloud performance, and how to diagnose and resolve bottlenecks, and keep the performance at optimal levels

Nice to Haves

  • Experience some elements with our current tech stack: Kubernetes, Prometheus, Elasticsearch, Python, Java, Kafka, Postgres, and Jenkins

  • Coding experience in any of Python, Bash, Java, Go

  • Previous experience or a keen interest in industrial IoT, analytics, or manufacturing a plus


Ann Arbor or Remote (Eastern Time Zone hours)

About Sight Machine

Sight Machine strengthens manufacturers by providing the industry’s only standard data model and system-level visualization capabilities. By integrating all crucial data into a single innovative platform, everyone involved in the fabrication process can visualize, contextualize and examine data in one intuitive interface.

Sight Machine is committed and mission-driven to improve lives, strengthen communities and make the world cleaner through continuously re-envisioning manufacturing processes - making them more efficient, sustainable and absolute.

Founded in Michigan in 2011 and expanded to San Francisco in 2012, Sight Machine blends the spirit of technology innovation and the down to earth style of Detroit manufacturing. Our team includes early leadership from Yahoo, Tesla Motors and Oracle. Together, we share wide industry knowledge and a commitment to advance manufacturing to a more sustainable future.

We take pride in our self-starter culture where employees are enabled and encouraged to achieve their professional goals through leadership guidance, learning and development. Our philosophy is that careers are continuous journeys, and we dedicate time and offer resources so that employees can reach their full potential.

Sight Machine is proud to be an equal opportunity employer and considers candidates legally authorized to work in the US regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. Sight Machine also considers qualified applicants regardless of criminal histories, consistent with legal requirements.

Let us know

Help us maintain the quality of jobs posted on RemoteTechJobs and let us know if:

Error on reporting

Related jobs

DescriptionYour Role:Tenable is looking for an experienced Senior Software Engineer to join our Vulnerability Intelligence Feeds research team. This position will focus on the design, development, and maintenance of our framework of web scrapers, data normalizers, content generat

Morgan 6 Morgan 6 |
13 d ago

Morgan 6 is a fast-growing leader in the government contracting field based in Charleston, SC. Our mission is to expertly apply state of the art technology to improve the lives of Warriors – past, present, and future. Our core solutions include Software Engineering and Agil

Vivid Seats Vivid Seats |
24 d ago

Who we are: Founded in 2001, Vivid Seats (NASDAQ: SEAT) is a leading online ticket marketplace committed to becoming the ultimate partner for connecting fans to the live events, artists, and teams they love. We believe in the power of experiences and are fiercely dedicated to bui

ProFocus ProFocus |

TITLE: Software Development Engineer in Test - PythonLOCATION: REMOTEPAY: Target pay for this role is $120K-140K per year but may vary based on experienceENGAGEMENT TYPE: Direct HireWHAT YOU’LL BE DOINGThis is an experienced level Software Development Engineer in Test posit

US RemoteSophos LabsJob Requisition Number: AMLAB192Sophos is a worldwide leader in next-generation cybersecurity, protecting more than 500,000 organizations and millions of consumers in more than 150 countries from today’s most advanced cyberthreats. Powered by threat inte