Cloud Operations Specialist
Our Digital & Technology team wakes up every day with one goal in mind – to connect Canadians to the people and things that matter most. Collectively, we’re proud to support 30 million Canadians each month.
We manage a robust portfolio that champions the leading edge of technology and media. We drive projects that expand connectivity to underserved communities from coast-to-coast-to-coast; build and enhance our fixed broadband network to provide high-speed Internet, TV and Smart Home Monitoring; and support our world class wireless network, offering our customers Canada’s largest and most reliable 5G network. As the Digital & Technology team, we are building our tomorrow, today.
Come play a key role in building the future of innovation in Canada, Let’s make your possible.
Do you enjoy working on high-scale, complex, and high visibility projects and programs? If yes, consider the following opportunity:
Rogers has an exciting opportunity if you are passionate about technology. Be a part of a dynamic team responsible for driving forward our portfolio while championing Public Cloud and DevOps collaboration across the organization. Aside from your technical skills, you’re a leader that’s passionate about SRE / DevOps, enjoy coaching others and spreading the culture. As a Sr. Cloud Operations Specialist, you will mentor other team members and collaborate with cross-functional teams across IT to plan for and maintain a cost effective, high performing, highly available and scalable cloud infrastructure. This is a unique opportunity to work with leading-edge technologies in a fast-paced environment within a large organization.
RESPONSIBILITIES:
- System and application management in production and non-production environments.
- 24/7/365 rotational on-call escalation support to ensure delivery of first-class service and support to customers.
- Establish best practice standards for a cloud adoption & automation framework.
- Full stack monitoring, alerting and troubleshooting including network, storage, OS, compute, services & applications.
- Incident management including root cause analysis and remediation.
- Security and compliance management with centralized logging, threat analysis and remediation.
- Proactive system analysis, tuning and optimization.
- Support Architects in the development of the architectural direction and governing policies.
- Establishing necessary testing requirements to evaluate proposed designs and validate technology platforms
QUALIFICATIONS:
- Passion for emerging technologies and love keeping up with the latest technology trends.
- Customer centric, and realize the value of providing excellent customer service.
- Strong written and oral communications skills and the ability to communicate with all levels of users/management
- Strong organization skills, interpersonal skills and excellent attention to details
- Strong understanding of Agile methodologies & Scrum framework
- Ability to understand and apply new technologies with a strong desire to learn and improve
- Ability to effectively prioritize, make decisions, solve problems and execute tasks in a high-pressure environment
- Ability to work independently and within a team on multiple simultaneous tasks within a challenging fast paced customer-oriented environment.
- Thorough understanding of Internet architectures;
- Experience with System Administration / DevOps experience in a large-scale, multi-OS (Linux/Windows) high-availability environment required with knowledge of infrastructure automation, infrastructure as code, security governance & compliance, patch, cost and performance management
- Experience with configuration management tools (Ansible, Chef, Saltstack or Puppet) and continuous integration tools (Jenkins, Bamboo, CircleCI, Capistrano)
- Experience working with programming and scripting languages such as (Bash, Python, Go, PHP, Perl)
- Experience with cloud computing and application platforms; AWS, Azure, OCI
- Experience working with databases, including installation, configuration, monitoring, backups and tuning (MySQL, Cassandra)
- Experience with containerization & orchestration: AKS, EKS, Docker, Kubernetes,
- Experience setting up and regularly using monitoring, trending and logging tools (Graphite/Grafana, Zabbix, CollectD, StatsD, Sensu, Nagios, ELK, Librato, Graylog, Cloudwatch, Azure OMS, Newrelic, Dynatrace)
- Experience with revision control systems and usage for managing systems configuration; SVN, Git, etc.
- Experience with managing, operating and monitoring backup systems
- Familiarity with fundamental networking/distributed computing environment concepts; can configure NFS, key-based SSH; DNS; familiar with basic networking concepts: switching, routing, OSI network layers, firewalls, load-balancing at various layers, SMTP, HTTP, SSL, SSH, LDAP
- A solid understanding of a UNIX-based operating system; understand paging and swapping, inter-process communication, devices and what device drivers do, file system concepts (LVM, clustering, logical partitions) as well as familiarity with Windows-based operating systems; IIS configuration, services, authentication, policies. RDP.
Schedule: Full time
Brampton, ON, CA
Job Segment:
Cloud, Operations Manager, Systems Analyst, Performance Management, Technology, Operations, Human Resources