About the Role
You’ll be responsible for monitoring of general uptime and availability for all applications owned by NOBITEX. The SRE role is embedded within the cross-functional with teams and DevOps and Security team. You’ll have the opportunity to design systems and solutions that best support the needs of your team and use best practices to defend against cyber-attacks within a large-scale business
Responsibilities:
- Identify sources of instability in large-scale distributed systems and drive operational excellence
- Analyze complex systems from a reliability and resilience perspective
- Improve reliability and drive down the burden of toil with tooling and automation
- Implement and continually improve application and system monitoring.
- Resolve complex technical issues as necessary
- Use modern tools to streamline configuration management
- Diagnose complex system performance problems using dumps, traces, or other diagnostics aids
- Third party Integrations
- Incident Response
- Implement and continually improve application and system monitoring.
- Participate in on-call rotations
- Task automation