DE Jobs

Search from over 2 Million Available Jobs, No Extra Steps, No Extra Forms, Just DirectEmployers

Job Information

UTAH VALLEY UNIVERSITY Site Reliability Engineer I - Operations (PT, Variable Hour) in OREM, Utah

Site Reliability Engineer I - Operations (Part-Time, Variable Hour)

 

Salary: $19.19 - $23.99 Hourly

 

Job Type: Part-Time Staff

 

Job Number: FY2402617

 

Closing: 5/6/2024 11:59 PM Mountain

 

Location: 800 W University Parkway, Orem

 

Division: Office of Information Technology

 

Position Announcement

 

With guidance from Senior Administrators, Operate and control all computing equipment relating to infrastructure and applications, including servers, mainframes, databases, and software. Install and set up equipment, troubleshoot and resolve errors, perform preventative maintenance, and plan down-time procedures. Back up data and ensure system security. Oversee maintenance of room environment (e.g., cleaning, cooling, and power).

 

Work with Atlassian products such as Confluence, Opsgenie, Insight, and Jira for responsibilities like documentation of operational processes, monitoring and alerting, and managing and automating organizational data. They will participate in business hours on-call rotations by validating and communicating alerts publicly and internally from integrated systems across the organization according to the Operational standard procedures. Design, implement, and maintain site reliability processes and systems that increase efficiency, eliminate downtime, and maintain performance at scale across platforms. Develop tools to ensure that the organization's services (internally critical and/or externally visible systems) have reliability and uptime appropriate to users' needs. Diagnose, resolve, and escalate service-impacting issues. Use CI/CD pipeline to support automated testing and deployments. Develop monitoring and alerting platforms to detect and resolve performance-impacting issues. Collaborate on complex, cross-functional technical projects.

 

Summary of Responsibilities

 

• With Supervision epic plan and execute projects related to the three pillars of IT operations: operational processes, change incident problem, and Ops Readiness.? Assist in execution of monitoring systems and alerting configurations so that operation knows about outages before users do.

• Create, facilitate, and integrate documentation including installation steps, standard operating procedures, incident runbook, and disaster recovery documentation into a curated change/incident/problem management library. Assists Network, Application, Database, and Systems Administrators with the enforcement of standard procedures, act as remote hands within a secure data center, and maintain all required supplies and tooling for the deployment of physical enterprise equipment.

• As an incident commander, participate in business hour on-call rotation, evaluate incoming alerts for validity, and dispatch the appropriate SME to resolve the issues. Execute public communications in accordance with Operational standard procedures informing stakeholders of possible service disruptions. Maintain integrity of Runbooks.

• Perform other job-related duties as assigned.

 

Minimum Qualifications

 

Requires an associate's degree with 2 years of experience or 4+ years of combined education and relevant experience.

 

Knowledge, Skill, and Abilities

 

• Knowledge of Linux and Windows Operating systems, TCP/IP fundamentals, firewall management, and anti-virus software.

• Knowledge of best practices for securing operating systems, data center maintenance, and network setup.

• Knowledge of various Monitoring solutions such as Prometheus, PRTG, Site24x7, TestCafe, Selenium, Splunk, NewRelic, Azure Monitor, and AWS CloudWatch.

• Knowledge of storage technologies such as SAN or NAS.

• Knowledge of A zure Active Directory, Active Directory, and LDAP.

• Knowledge of load balancing, clustering, and enterprise server architecture.

• Knowledge of Relational Database principles and databases/languages such as PL/SQL, MySQL, SQL Server, Oracle, Microsoft SQL, or M Access.

• Knowledge of the Atlassian Suite, including Jira, Confluence, Status Page, and Opsgenie.

• Knowledge of Scrum/Agile principles as applicable to a DevOps Team.

• Communicate effectively in normal and high-pressure situations verbally and through written mediums.

• Perform basic server, system, and application procedures such as managing user access, performing maintenance, and troubleshooting.

• Skills in troubleshooting hardware and software problems and researching technical issues.

• Experience using basic CLI tools in Windows and Linux operating systems to troubleshoot and gather information.

• Skills in customer service and interpersonal communication, both verbally and written.

• Basic scripting and programming skills in languages such as Python, JavaScript, JSON, SQL, Bash, TestCafe, and Selenium.

• Experience with instant communication and team collaboration platforms like MS Teams, Slack, or Jitsi. Skills in working in an ITSM solution such as Jira, ServiceNow, Asana.

• Ability to identify, research, troubleshoot, and implement solutions for hardware and software problems.

• Ability to work in a customer service, team-oriented, collaborative, Scrum/Agile environment.

• Highly self motivated with the ability to learn quickly and accept feedback from peers.

• Ability to learn the implement process, and maintenance procedures for new technologies, equipment, hardware, and software such as operating systems, ITSM tools, monitoring solutions, and data center management.

• Ability to act as an ?on-call? incident commander for communicating outages between customers, subject matter experts, teams, and leaders.

• Ability to create proposals in visually-pleasing and user-friendly language.

• Ability to think critically and solve complex problems.

• Ability to perform tasks in a timely and professional manner.

 

jeid-0769124f02939641b3c783bffeb7a48b

 

Copyright ©2024 Jobelephant.com Inc. All rights reserved.

 

https://www.jobelephant.com/ 

#OtherSectors

DirectEmployers