NOC Engineer
This is a full-time NOC Engineer role. The NOC Engineer will be responsible for monitoring and maintaining network systems and servers, identifying and troubleshooting issues. You would be working closely with the Incident Management, IT teams and 3rd party service providers to ensure the availability of all systems & applications to our internal users & external members.
Skills and Responsibilities:
- Supporting NOC operations for continuous monitoring of Global IT Infrastructure (24x7) and services.
- Understanding on Cisco Routers and Switches, for gather diagnostics and do initial troubleshooting
- Expertise in troubleshooting Interface and routing issues
- Understanding of GobalProtect Operations, ability to gather and analyze user logs and troubleshooting
- Understanding of Palo Alto Operations, ability to check routing and firewall rules, ability to analyze traffic logs
- Ability to log into Windows Servers, gather diagnostic's and do initial troubleshooting
- Ability to do basic DNS troubleshooting
- Ability to work on issues with Windows/Linux servers
- Ability to aid IT Teams as or when needed and work closely with the vendors.
- Comfortable in working 24X7 environment.
- Comfortable in working in rotational shifts.
- Understanding of Active Directory.
- Monitor all the Consoles and Ensure every Alerts is picked for investigation.
- Immediate response to alerts generated from the monitoring tools.
- Basic Understanding of Monitoring Tools like SolarWinds/PingDom/Datadog/CloudWatch etc.
- Basic Understanding on AWS Cloud and how are they monitored using AWS CloudWatch.
- Basic Understanding of polling methods like SNMP/ICMP/WMI/Agents etc.
- Understanding of IT Infrastructure to be able to effectively escalate issues to the right team/individual
- Understanding of Jira Service Management for ticketing of alerts.
- Ability to multitask and handle multiple alerts
- Ability to contextualize alerts and identify criticality of alerts
- Excellent verbal and written skills
- Ability to Collaborate across departments and work as part of a team.
- Average experience of 5 years
Good To have
- Knowledge on Configuring, Monitoring and tracking Alerts generated by different Monitoring Systems (Solarwinds, Pingdom, AppDynamics, Datadog).
- Operational knowledge of Confluence for knowledge base and articles(Atlassian – Jira).
- Knowledge on observability Tools and their concepts.
- Knowledge of Application Performance Monitoring (APM).
- Working Knowledge on any of the Observability Tool (like Datadog).