DevOpsTW / jobs

Our jobs board.
69 stars 4 forks source link

Closed [美商法里亞徵才] - Site Reliability Engineer #23

Open FariaHR opened 5 years ago

FariaHR commented 5 years ago

[DESCRIPTION]

Faria Education Group is a world leader in international education SaaS systems and services, serving over 3 million+ students across 10,000 schools in over 130 countries through our 3 integraed systems: Atlas for curriculum planning, OpenApply for admissions and enrolment, and ManageBac for learning management. This role will specifically focus on supporting our OpenApply product. We develop on Ruby on Rails with excellent tools, full technical control, and with clear product demand.

Site Reliability Engineers are hybrid systems and software engineers who are responsible for and take ownership of reliability, automation, and other issues related to “keeping the lights on” across Faria’s multi-product SaaS systems stack.

SREs are integrated within the Technical Operations team and work under the Head of Technical Operations and with the CTO and Principal Developers. We are looking for engineers who want to be a part of developing infrastructure software, maintaining it and scaling it.

[LOCATION] Taipei, Taiwan

[RESPONSIBILITIES]

[REQUIREMENTS]

HARD SKILLS:

  1. MUST HAVE — Minimum of 5+ years of system administration experience for a high-usage, web-based software service ideally built using open-source software components

  2. MUST HAVE — Knowledge of Amazon AWS services and API’s including EC2, S3, VPC, IAM

  3. MUST HAVE — Knowledge and familiarity with alerts & monitoring tools, and system management tools for Linux environments (including DataDog, Nginx, NewRelic, CloudFlare, MySQL/PostgreSQL, Apache, IPTables, ELK stack

  4. NICE TO HAVE — Knowledge and familiarity with configuration management tools including Ansible, Chef or Puppet

  5. NICE TO HAVE — Knowledge of deploying / troubleshooting / tuning Ruby on Rails applications (Passenger, Capistrano, Sidekiq, Bundler)

SOFT SKILLS:

  1. MUST HAVE — Strong communication skills in English with an ability to coordinate the incident response with urgency. Fluency in Mandarin is a plus

  2. MUST HAVE — Proper remote presence & etiquette (acknowledging requests in a timely fashion over Slack, not leaving requests unacknowledged at all)

  3. MUST HAVE — Tagging the appropriate person and persistently reminding them every 24 hours until a full resolution is achieved (not having things fall through the cracks)

  4. MUST HAVE — Effective adherence to operating procedures (organizing day-to-day work and large-scale tasks in a calm manner with priority-driven sequencing)

  5. MUST HAVE — Experience of working in China.

[BENEFITS]

Please apply here if you're the one we're looking for: https://www.workable.com/j/E9BE41F5DD or email hiring@managebac.com