About the Role
Job Details
Senior Site Reliability Engineer (Infrastructure)
The candidate should be self-motivated, production sensitive, and technically talented to support Integral’s 24x7 FX trading environment. In this role, the candidate will provide monitoring and support for our hosted applications on Linux and Windows based systems and will also be responsible for the maintenance of our Linux and windows systems.
Responsibilities
Work in a demanding, fast-paced atmosphere.
Actively monitor the stability and performance of Integral’s FX trading platform in 4 global data centers
Support a 24x7 distributed enterprise environment.
Ensure standards and SLAs are met, including response time, follow up, ticket updates, and resolution
Escalate events as required by the documented procedures with the proper level of urgency and follow-through
Interface with engineering teams to coordinate next actions
Perform standard systems troubleshooting - diagnose troubles detected by our systems and work quickly to resolve issues
Apply applications specific updates and fixes.
Automate existing operating procedures.
Problem determination, workaround resolution, root cause analysis, major incident management
Installing and upgrading Linux and windows system software.
Detecting and troubleshooting software and hardware issues.
Requirements
Computer Science related Bachelor's Degree or equivalent experience.
Previous experience dealing with support cases or requests via e-mail, telephone, ticketing system.
5+ years of equivalent experience, including experience with Linux systems administration, networking and database technologies
Must have a good understanding of distributed computing and solid understanding of networking and UNIX system concepts
Working knowledge of at least one of the following scripting languages is also preferred Perl, Python , Shell , JavaScript
Understanding of Dockers or containers is desirable.
Experience administering and deploying development CI,CD tools such as, Jenkins required
Understanding of Configuration Management and Deployment tools like salt, Puppet will be an added advantage
Understanding of change and release management.
Working knowledge with SSL certificates installation and management is required
Basic knowledge of networking principles including routing, subnets, TCP, IP, VLANs, multicast and UDP.
Must possess excellent written and verbal communication skills and be able to interact effectively and professionally with customers and engineers throughout the world
An addiction to working in a super-fast-paced environment and resolving multiple interrupt-driven priorities simultaneously is a must
Always curious and wanting to learn new things
Dependable and strong work ethic
Strong customer service mindset