CloudShare delivers the world's most advanced environments for software
development, testing and training in the cloud. CloudShare gives enterprise customers
the best of what cloud computing can offer. Together with our advanced automation,
intuitive self-service and full visibility and control we enable our customers to quickly
and easily use cloud computing to accelerate their businesses.
CloudShare is looking for a talented System Administrator to join our IT team, who will
work closely with our software developers and core teams to design, scale and maintain
our production infrastructure. In this capacity, you will be responsible for the most
critical infrastructure and development clusters at CloudShare, working with the latest
in storage, hardware, virtualization and cloud technologies to ensure that our
operations run smoothly.
Responsibilities:
• Solve complex UNIX/Linux issues in production/colo, lab, and customer environments
• Deploy and scale production systems with minimal downtime
• Secure/harden operating systems against compromise
• Assist in network and systems infrastructure design
• Develop creative solutions for complex technological challenges
• Identify disaster scenarios; plan and test recovery
• Install, configure, and manage network routers, switches, load balancers, firewalls,
IPS, VPN, console servers, and other network devices
• Assist with management of virtualization environment and storage management.
• Explore new tools that can improve system management such as Docker, Ansible, and
much more
• Work directly with onshore and offshore technical teams to quickly detect, diagnose,
and resolve issues within client network infrastructure; own the escalation process
24x7.
• Respond to and provide guidance during observed and declared operational events
• Investigate, analyze and troubleshoot customer environment emergencies. Establish a
plan of resolution; execute the plan while mitigating risk or disruption to the customer
environment and determine root cause.
Requirements
• 5+ years of experience with UNIX/Linux system administration
• Advanced knowledge of TCP/IP and networking
• Network Administration and design (Cisco, Linux firewalling / routing)
• Virtualization experience (Vmwareֿ)
• Advanced scripting experience using Python, Bash, PowerShell, Etc.
• Experience in operating a 24/7 high-availability IT infrastructure
• Experience with monitoring solutions and services (Nagios/Icinga, PagerDuty, …..)
• Willing to participate in a 24x7 on-call rotation
• Ability to work independently with minimum supervision
• Fun, enthusiastic and outgoing team player.