Site Reliability Engineer

Burlington, VT

Post Date: 10/03/2017 Job ID: 10155248
  • Design and assist in the authoring of software tools that reliably manage application delivery
    Design and assist in the setup and maintenance of the build/release infrastructure
  • Embed with specific development teams to ensure best practices are implemented
    Improve predictability and reliability of software releases
  • Reduce application deployment windows by leading company towards a Continuous Deployment environment
  • Fluent in at least one scripting language in addition to Bash (Python/Perl/PHP/Ruby), or demonstrated ability to write programs using a high-level programming language like: C++, Java, or Ruby
  • Linux (CentOS/RHEL/Amazon Linux) system engineering expertise
  • Configuration management systems (Puppet, Ansible, and Docker knowledge preferred)
  • Networking knowledge (AWS VPC experience is a plus
  • Experience using metrics and monitoring to ensure customer SLA objectives are met
  • Experience operating Cloud Computing platforms (e.G. Amazon AWS, Google Compute, Azure) and their PaaS based components (Elastic Beanstalk, Cloudfront, S3, RDS, etc.)
  • Containerization platforms (Docker, Rancher, Kubernetes)
  • Agile development, testing, and deployment expertise
  • Experience in Java including Spring Boot
  • Distributed version control system experience (Git preferred)
  • Database operations at scale (MySQL, MongoDB, Dynamo, RDS)
  • Maven, Gradle, and Jenkins
  • Experience with application telemetry tools such as InfluxDB, Prometheus, Grafana, Datadog, or New Relic
  • Experience with log aggregation and anomaly detection platforms such as Splunk, Sumologic, Graphite, CloudWatch, or ELK stack
  • Big data platforms such as Cloudera, Vertica, Hadoop, Amazon Redshift, or Elastic MapReduce
  • Package management platforms such as npm, pip, Ruby gems, rpm, and others

Not ready to apply?

Send an email reminder to:

Share This Job:

Related Jobs: