Site Reliability Engineer (SRE) Job at Hirekeyz Inc, Omaha, NE

c0xNSlZZWk1TK1E1aXY0RnZnT0l5V0YyK2c9PQ==
  • Hirekeyz Inc
  • Omaha, NE

Job Description

Title: Site Reliability Engineer (SRE)

Location: Omaha, NE / Dallas, TX

Job Type: Full Time

Job Summary :

Seasoned Site Reliability Engineer (SRE) with 5+ years of experience in supporting complex, large-scale distributed systems. Highly skilled in managing production failures, conducting root cause analysis, and driving effective remediation. Strong communicator with expertise in ing, monitoring, and release management, complemented by automation proficiency and a keen ability to learn quickly.

This role involves providing 24/7 support as part of the SRE team, ensuring the reliability and performance of mission-critical Java, .NET, and Batch applications deployed across GCP, PCF, and on-premise environments.

Years of experience needed

Candidate experience 5+ Years

Technical Skills:

Expertise in understanding large scale production systems and technologies, for example load balancing, monitoring, distributed systems, microservices, and configuration management.

Should have solid hands-on experience in troubleshooting and fixing application failures, application Performance degradation, Code issues, cloud platform issues, Batch Failures, Infra failures, DB failures, Network failures.

Hands-on experience in performing Production deployments using CI/CD and exposure to deployment strategies.

Experience in troubleshooting of Linux/Unix.

Monitor the application/Services/batch availability.

Act quickly on the application s(Performance, Availability) and Batch Job failures

Perform the required analysis (Code/Log) and escalate to the Engineering team as required.

Initiate and drive the Techlines in case of outages/major incidents/Batch abends and ensure Service Restoration in the least time possible.

Effectively handle the Incident, Problem, Release and Change management.

Own and deliver the user stories assigned as part of the sprint.

o The user stories range from application code Debugging, Issue analysis, Code fix, Knowledge base creation, documentation of SOP's, Production Deployments, Pre & Post Patching/Maintenance activities, Service Requests.

o Build monitoring solutions using APM tools like Splunk, Appdynamics, Thousand Eyes, ITRS, AppMetrics, MoogSoft, Kafka etc.

o Automate of day-day operational tasks.

o Be part of the Exit reviews to ensure the best practices are followed to have the right code deployed to Production systems

o Provide feedback/recommend improvements to the system which would enable highly stable systems.

Strong understanding of Networking Concepts (TCP/IP, SSL/TLS, IPSec, VPN etc), Firewall and Load Balancers.

Experience in Scripting Shell/Powershell/Python

Strong Experience in working with any Cloud-based infrastructure (PCF, GCP, AWS, Azure Cloud or others)

Certifications Needed:

As per industry standards

Skills

PRIMARY COMPETENCY : Production Support PRIMARY SKILL : Production Support PRIMARY SKILL PERCENTAGE : 51 SECONDARY COMPETENCY : Unix SECONDARY SKILL : Linux Administration SECONDARY SKILL PERCENTAGE : 25 TERTIARY COMPETENCY : Tools TERTIARY SKILL : Splunk TERTIARY SKILL PERCENTAGE : 24

Job Tags

Full time,

Similar Jobs

Marvel Technologies Inc

Databricks Consultant Job at Marvel Technologies Inc

 ...Databricks Consultant Location: Baltimore M.D (Remote) Duration: thru Dec 2025 Client: One Main Financials Job Summary: Sogeti is looking for a skilled and experienced Data Consultant to join our team. The ideal candidate will have a strong background... 

CaribbeanCatalyst Inc.

Chief Executive Officer Job at CaribbeanCatalyst Inc.

Career Opportunity Chief Executive OfficerOur Client, is seeking to recruit a dynamic leader for its operations in Barbados as its Chief Executive Officer (CEO). This position will report to the Board of Directors (Managers).The RoleReporting to the Managers, the... 

AO SOUTH - Lisa Cassidy

Remote Customer Service Representative (Entry Level) Job at AO SOUTH - Lisa Cassidy

&##127775; Ready to Revolutionize Your Work-Life Balance While Achieving Remarkable Success? Position: Virtual Insurance Specialist Location: 100% Remote | Work From Home Join our fully virtual team and experience a career that offers both extraordinary income... 

Mountainside (MTN)

Pediatric Licensed Professional Nurse (LPN) - FT Day - Mountainside, NJ Job at Mountainside (MTN)

 ...Job Overview: As a Pediatric LPN, you will provide rehabilitative nursing care for pediatric patients and their families under supervision in a long-term care facility. Qualifications: Required: Graduate of an accredited school of practical nursing NJ... 

Prospere Companies

Business Broker / M&A Advisors (Colorado - Durango) Job at Prospere Companies

 ...Are you ready to join a renowned business brokerage firm with over 40 years of experience? Look no further! We're expanding our team and...  .../Fort Worth, Waco, and Las Vegas Southoffers four business broker positions to help us further expand our presence and dominance...