SantaAnaRecruiter Since 2001
the smart solution for Santa Ana jobs

Sr Site Reliability Engineer (SRE)

Company: Jobleads
Location: Santa Ana
Posted on: May 3, 2021

Job Description:

Work Location: Santa Ana , CA Secondary Location: Phoenix, AZ, US; Los Angeles, CA, US; San Diego, CA, US; Portland, OR, US; Seattle, WA, US; Employment Category: Full Time - Regular Join our team! As a global leader in providing title insurance, settlement services and risk solutions for real estate transactions, First American (NYSE: FAF) is an ideal place to build your career. We have been entrusted with helping our customers achieve and protect their dream of homeownership since 1889. We believe that our people are the key to the company's continued success, and we invest in diverse talents and backgrounds and empower our teams to achieve more than they could anywhere else. First American has created an award-winning culture and has been named to the Fortune 100 Best Companies to Work For list for the sixth consecutive year and to more than 50 regional Best Places to Work lists. For more information, please visit Job Summary Work from Home! ***This role can be remote, with preference to West Coast states / Pacific time zone*** The team is on an exciting transformation journey and moving from the classic support model to the site reliability engineering model. We are looking for a person who has prior Site Reliability Engineering (SRE ) experience on the on-premise and cloud technologies. We are in search of an engineer who has lived and thrived during the transformation journey by not just following the work directed, but by the virtue of thinking out-of-the-box and working on developing monitoring applications and taking up deep dives into issues. We prefer candidates who have prior experience in supporting applications in the AWS cloud. Essential Functions Efficiently handle live production incidents, debug/troubleshoot application and infrastructure issues, follow and implement SRE best practices. Monitor and analyze production data/telemetry to pinpoint performance issues or errors, and best ensure production stability Provide frequent and clear communication to application stakeholders during outages and issues. Understand application code, deep dive, and conduct root cause analysis of application problems to prevent future occurrence. Recommend architecture to increase resiliency in application, with special focus on 3rd party dependencies to prevent cascade failures. Further collaborate with development and operations team to ensure availability and reliability of the application and infrastructure. Serve as an escalation point for other Systems Administrators, Engineers, and other technology teams in the resolution of server and system problems. Fine tune existing tools develop new custom internal tools for application monitoring and automate. Maintain effective knowledge base and runbooks to bring faster resolution to production issues. Provide weekend on-call rotation for production support. Constantly update personal technical and business knowledge and skills and mentor others to increase the knowledge and skills of the team. Provide stellar organizational support, customer support, and self-manage project initiatives. Requirements Bachelor's degree in computer science or equivalent combination of education and experience. 4+ years of hands-on experience in application and technical support role in live production environment following Development, DevOps, and SRE best practices. Experience developing and supporting web-based applications, webservices, and database driven applications using Microsoft Technologies C#, .Net 4.5, Azure DevOps, & SQL Server 2016. Deep experience in monitoring and analyzing infrastructure performance using standard performance monitoring tools - Perfmon, PerfView, ProcDump, DebugDiag, etc. Extensive experience with configuring and monitoring via tools such as Cloudwatch, AppDynamics, ELK, Splunk, AppDynamics, Microsoft SCOM, Windows Processes, etc. PREFERRED SKILLS Experience in API management tools like Mulesoft, Mashery, Akana a plus. Experience with automation using PowerShell, Python scripting or similar technologies a plus. Experience with AWS cloud products and services preferred #LI-JD1 First American invests in its employees' development and well-being, empowers them to provide superior customer service and encourages them to serve the communities where they live and work. First American is committed to diversity and inclusion. We are an equal opportunity employer. Based on eligibility, First American offers a comprehensive benefits package including medical, dental, vision, 401k, PTO/paid sick leave and other great benefits like an employee stock purchase plan. Returning users, access your profile or edit/update your resume. First American Title Insurance Company makes no express or implied warranty respecting the information presented and assumes no responsibility for errors or omissions. First American, the eagle logo, , and First American Title are registered trademarks or trademarks of First American Financial Corporation and/or its affiliates.

Keywords: Jobleads, Santa Ana , Sr Site Reliability Engineer (SRE), Other , Santa Ana, California

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category

Log In or Create An Account

Get the latest California jobs by following @recnetCA on Twitter!

Santa Ana RSS job feeds