SantaAnaRecruiter Since 2001
the smart solution for Santa Ana jobs

BTS Site Reliability Engineer - Open to Remote!

Company: First American Financial Corporation
Location: Santa Ana
Posted on: May 3, 2021

Job Description:

CompanySummaryJoin our team! As a global leader in providing title insurance, settlement services and risk solutions for real estate transactions, First American (NYSE: FAF) is an ideal place to build your career. We have been entrusted with helping our customers achieve and protect their dream of homeownership since 1889. We believe that our people are the key to the company's continued success, and we invest in diverse talents and backgrounds and empower our teams to achieve more than they could anywhere else. First American has created an award-winning culture and has been named to the Fortune 100 Best Companies to Work For list for the sixth consecutive year and to more than 50 regional Best Places to Work lists. For more information, please visit JobSummaryWe are well known for our excellence in Title Insurance and Escrow Services. We are less known for our excellence in technology and we're on a mission to change that. We're continually investing in our future, how we work and what we build. Be part of a transformative team that will shape the future as we envision, design and develop our next generation solutions. We're looking for an experienced Site Reliability Engineer to help us migrate from an old technology platform. While doing so we want to envision the best possible future and solve the problems that matter most. Join us if you are driven by curiosity and have passion for creating the future.*Remote candidates encouraged to apply!The Site Reliability Engineer applies strong engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges. The role is responsible for the availability, reliability, integrity, and efficient operation of critical platform services and applications, ensuring they meet the requirements of internal and external users. This is achieved by monitoring, maintaining, automating and developing solutions that focus on uninterrupted delivery of applications throughout the software lifecycle.Essential FunctionsMeasure and monitor availability and overall system and environment health.Build and implement monitoring and recovery tools to provide optimum delivery and resilience.Implement orchestration and tooling solutions to ensure that repetitive administration tasks are performed at a high level of efficiency and free of defect.Provide primary operational support and engineering for multiple large distributed software applicationsBuilds permanent solutions to mitigate common system failures and bugs.Partners with development and engineering teams to automate and optimize service availability, scalability, performance, monitoring and alerting. May perform complex technical tasks with guidance from senior technical personnel. Provides standard problem identification and resolution support services.Uses infrastructure management tools to perform performance monitoring and capacity planning functions.Contributes to the design, development and implementation of departmental workflow processes and procedures.Required to provide on-call support during off-duty hours on weekdays, weekends and holidays on a scheduled/rotating basis.Required to perform duties outside of normal work hours based on business needs.Knowledge and Skills/Technology UsedCloud Platform:Good working knowledge of cloud services and architecture. (AWS, Azure)Distributed Systems. (Architectures, micro-services, high availability)Good working knowledge of distributed message bus.Good working knowledge of container computing. (Docker, Kubernetes, Service Mesh)Build and configure Azure, AWS services. (LAMBDA, Azure Functions)Proxies and Load Balancing. (Nginx, HAProxu, Envoy)Monitoring and Tools:Knowledge of ServiceNow integrations.Expertise with log event aggregation, metric collection and application monitoring and event handling. (Elastic, SCOM, AppD, Uptrends, AppInsights, Cloudwatch)Basic working knowledge of Windows and UNIX/Linux technologies.Basic working knowledge network triaging, packet loss and routing. Understands Service Level Objectives (SLO), Service Level Indicators (SLI), Error Budgeting and Burn Rates.Development:Applies "everything as code" methodologies across configuration, infrastructure and orchestration. Experience with programming languages. (.Net, C#, C++, Python)Experience with continuous integration tools. (Chef, Ansible, Jenkins, Stash/Git)Experience with configuration management tools. (Puppet, Hiera, Terraform, Terragrunt, Asnsible)Knowledge of scripting languages or other tools to enable workflow automation.Related:Partners with development or engineering teams to automate and optimize service troduce and test new technologies, tools and systems that enable fast and safe code deployment.Administrative/Interpersonal Skills (All):Good organization skills to balance and prioritize work assignments.Good verbal and written communication skills.Ability to work as a member of a multi-cultural, multi-location, team.Typical Education Generally requires BS Degree or equivalent work experienceTypical Range of Experience Employees entering this job typically have 5+ years hands-on technical experience in system support and application developmentLicense or CertificationDesired Certifications:Cloud foundation (Azure, AWS, GCP, OCI)Infrastructure Management ToolsServer and storage technologiesNetworking#techreferralFirst American invests in its employees' development and well-being, empowers them to provide superior customer service and encourages them to serve the communities where they live and work. First American is committed to diversity and inclusion. We are an equal opportunity employer.Based on eligibility, First American offers a comprehensive benefits package including medical, dental, vision, 401k, PTO/paid sick leave and other great benefits like an employee stock purchase plan.Job InfoType:Full timeLocation: USA, California, Santa Ana, USA, California, Los Angeles, USA, Washington, Seattle, USA, Utah, Salt Lake City, USA, Nevada, Las Vegas, USA, California, San Francisco, USA, Oregon, Portland, USA, California, San Diego, USA, Idaho, Boise, USA, Arizona, Phoenix, USA, Texas, Irving, USA, California, Irvine, USA, Colorado, Denver, USA, California, SacramentoSDL2017

Keywords: First American Financial Corporation, Santa Ana , BTS Site Reliability Engineer - Open to Remote!, Other , Santa Ana, California

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category
within


Log In or Create An Account

Get the latest California jobs by following @recnetCA on Twitter!

Santa Ana RSS job feeds