Site Reliability Engineer

Job Description

  • Lead technical design, implementation, configuration, development/customization, delivery, and support of application environments consisting of complex IT infrastructure on a variety of platforms
  • Organize work, task assignment, and progress tracking and reporting for team of engineers.
  • Design and implement tools to automate the creation and management of application environments including tasks like deployment, configuration, monitoring and management
  • Apply sound release management and configuration management principles to ensure the stability of production environments
  • Work closely with application development, project management, operations, and systems development to produce technical designs, implementation plans, identify key milestones, and project timelines from design phase all the way through deployment and support
  • Developing and implementing the tools for gathering metrics, monitoring and logging
  • Diagnose issues in production, identify root cause, and implement fixes
  • Optimize backend performance to maximize speed and scale
  • Provide input on application architecture strategies, standards, and technology decisions

Key Qualifications

  • Experienced in monitoring system such as Open-Falcon, Zabbix, ..., etc.
  • Extensive experience in designing, configuring, and delivering large scale application technical infrastructure.
  • Experience as a project lead, supporting multiple simultaneous projects, in high scale environment
  • Strong coding and scripting ability (Java, C, C++, Python, Perl)
  • Strong experience with database technologies (Oracle, Mongo preferred)
  • Experience and knowledge applying best practices to build secure platforms
  • Excellent Analytical and creative problem solving skills
  • Must be highly collaborative and able to work with different teams
  • A strong sense of focus and excellent attention to detail while working in a very fast-paced environment
  • Ability to learn new technologies in a short time
  • Strong communication skills and ability to articulate complex solutions well

Education

BS in Computer Science, related technical field or equivalent practical experience.