Senior Site Reliability Engineer

Job description

We are looking for an experienced SRE to join our team. The ideal candidate will be able to work in a fast paced environment, operate gracefully under stress, effectively manage multiple assignments, be self driven, proactive and have great interpersonal and communication skills.

What you'll do

  • IT Infrastructure Management
  • Provisioning of infrastructure components for supporting business services
  • Developing, configuring, and deploying tools to be used by SRE/DevOps teams
  • Developing and maintenance of system documentation
  • Change Management: Determining and validating new features and updates for the managed infrastructure
  • Site Availability, Reliability and Serviceability
  • Handling support escalation issues and incident reviews
  • Availability and Performance Monitoring
  • Service Recovery and Emergency Response
  • Capacity Planning
  • Automated Infrastructure Provisioning (Infrastructure as Code)

Required Technical Skills

You have worked as an SRE or equivalent position for more than 8 years.

You can demonstrate strong knowledge and working experience on the following technology stack:

  • Linux Systems Administration (RedHat Enterprise or Ubuntu)
  • Kubernetes
  • Linux Containers (docker/podman)
  • Helm
  • Istio (or similar service mesh)
  • Terraform
  • Bash (or KSH,CSH)
  • AWS EKS and/or GCP GKE

Not mandatory, but the following are highly appreciated skills:

  • CI/CD: GitLab Pipelines, GIT, GitHub Actions
  • Cloud Providers: AWS (EC2, RDS, etc), GCP, Azure
  • Languages: Python, Go or equivalent
  • Databases: MongoDB, PostgreSQL, AWS RDS, AWS DynamoDB, AWS DocumentDB
  • Security hardening: Linux OS, Kubernetes, Istio, Containers
  • Networking: AWS LB, Ngnx, Envoy
  • Monitoring: Datadog, AWS Cloudwatch, Prometheus, Graffana


Eclypsium delivers a cloud-based enterprise device security platform for modern distributed organizations. From corporate laptops and desktops, to servers in data centers, to network infrastructure devices, Eclypsium protects the devices that organizations rely on, all the way down to firmware. Eclypsium provides comprehensive device and firmware inventory, automatically identifies and patches firmware risks, scans devices for supply chain breaches, and continuously monitors devices for persistent and stealthy firmware attacks. Eclypsium’s cloud-based solution is deployed in minutes. Protecting Fortune 100 enterprises and federal agencies, Eclypsium was named a Gartner Cool Vendor in Security Operations and Threat Intelligence, a TAG Cyber Distinguished Vendor, one of the World’s 10 Most Innovative Security Companies by Fast Company, a CNBC Upstart 100, a CB Insights Cyber Defender, and an RSAC Innovation Sandbox finalist. For more information, visit


Eclypsium is headquartered in Portland, OR with distributed remote employees and global teams in Argentina and the Bay Area. We offer competitive compensation and benefits packages and are committed to the wellbeing of our employees and their families.

Benefits & Perks include:

  • Competitive compensation & startup equity
  • Comprehensive medical, dental, vision coverage
  • Life insurance, short term and long term disability coverage
  • Flexible time off
  • Employee assistance program
  • Paid parental leave
  • Paid sabbatical
  • Access to learning & development platforms
  • Home office support for remote employees
  • Regular events and celebrations


Eclypsium is an equal opportunity employer. We believe in the importance of diverse teams and value candidates of all backgrounds. We do not discriminate on the basis of age, ancestry, citizenship, color, ethnicity, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or invisible disability status, political affiliation, veteran status, race, religion, or sexual orientation.


Enterprise device security platform