UWM

Observability Engineer I

Business Unit
United Wholesale Mortgage
Location
US-MI-Pontiac

 

The Observability Engineer plays a critical role in making the internal state infrastructure and services visible to stakeholders for troubleshooting, performance analysis, capacity planning, and reporting. An Observability engineer develops platforms and tooling to enable developers and operators to efficiently trace performance problems to their source, and to map their application performance to business objective.

 

Attention to detail is important as identifying trends in our monitoring tools and reporting allows the team to remain proactive. This position requires an uptime-focused, technical team member. This team member must have a good understanding of the relationships between infrastructure and application, and the ability to analyze technical issues in all layers of our environment.

WHAT YOU WILL BE DOING

  • Designing and Implementing Observability Solutions:

    • Architecting Observability Pipelines: Design and implement robust pipelines for collecting, processing, and analyzing telemetry data (metrics, logs, traces) from various sources within the IT ecosystem.
    • Instrumenting Applications and Infrastructure: Ensure proper instrumentation of applications and infrastructure components (servers, networks, databases, cloud services) to collect relevant data using agents and APIs where appropriate.
    • Integrating Tools: Set up Dynatrace for deep application performance monitoring, user experience monitoring, and full-stack observability. Configure SolarWinds for robust network and infrastructure monitoring, traffic analysis, and configuration management.
    • Managing Logs: Utilize Cribl and Dynatrace for efficient log management, including the collection, processing, and analysis of log data to enhance observability and troubleshooting capabilities.

    Monitoring and Analyzing System Performance:

    • Real-time Monitoring: Continuously monitor the health and performance of applications and infrastructure components by creating real-time dashboards and visualizations within both Dynatrace and SolarWinds. Empower other team members to interpret/create similar visualizations.
    • Performance Analysis: Analyze performance metrics like response times, error rates, resource utilization, and network traffic patterns to identify bottlenecks and areas for improvement.
    • Troubleshooting and Root Cause Analysis: Utilize Dynatrace's AI-powered root cause analysis to pinpoint the source of application-related problems. Leverage SolarWinds' advanced network diagnostics to troubleshoot network issues and identify their root causes.  
    • Proactive Monitoring: Provide proactive monitoring data to cross-functional teams and assist in the interpretation of the data to resolve issues in the environment.

    Alerting and Incident Management:

    • Configuring Alerts: Set up customizable alerts and notifications in primarily Dynatrace and SolarWinds to detect anomalies and performance threshold breaches, ensuring timely awareness of potential issues.
    • Automating Alerting and Remediation: Strive to automate alerting and remediation processes to reduce the mean time to resolution (MTTR) and improve system uptime.
    • Supporting Incident Management: Collaborate with development and operations teams to support incident management, ensuring efficient handling of IT incidents and minimizing impact on users.

WHAT WE NEED FROM YOU

Must Have Qualifications:

  • Experience: Proven experience in designing and implementing observability solutions, including telemetry data pipelines, application and infrastructure instrumentation.
  • Tool Proficiency: Strong proficiency with Dynatrace (or equivalent) for application performance monitoring and log management.
  • Monitoring and Analysis Skills: Expertise in real-time monitoring, performance analysis, and root cause analysis using tools like Dynatrace and SolarWinds.
  • Technical Knowledge: Solid understanding of IT infrastructure components such as servers, networks, databases, and cloud services.
  • Collaboration Skills: Ability to work effectively with cross-functional teams to provide proactive monitoring data and assist in issue resolution.
  • Problem-Solving Skills: Strong analytical and troubleshooting skills to identify and resolve performance bottlenecks and system issues.
  • Support: Ability to provide on-call support on a rotating basis, reaching out to other teams as necessary

Nice to Have Qualifications:

  • Certifications: Relevant certifications in Dynatrace, SolarWinds, Cribl or other observability tools.
  • Scripting and Automation: Knowledge of scripting languages (e.g., Javascript, C#) and experience in automation frameworks.
  • Technology Experience: Experience with cloud platforms such as AWS, Azure, or Google Cloud, and familiarity with the .NET framework.
  • DevOps Practices: Familiarity with DevOps practices and tools, including CI/CD pipelines.
  • Alerting and Incident Management: Experience in configuring alerts, automating alerting and remediation processes, and supporting incident management.

THE PLACE & THE PERKS

Ready to join thousands of talented team members who are making the dream of home ownership possible for more Americans? It’s all happening on UWM’s campus, where our award-winning workplace packs plenty of perks and amenities that keep the atmosphere buzzing with energy and excitement.

 

It’s no wonder that out of our six pillars, People Are Our Greatest Asset is number one. It’s at the very heart of how we treat each other, our clients and our community. Whether it’s providing elite client service or continuously striving to improve, our pillars provide a pathway to a more successful personal and professional life.

 

From the team member that holds a door open to the one that helps guide your career, you’ll feel the encouragement and support on day one. No matter your race, creed, gender, age, sexual orientation and ethnicity, you’ll be welcomed here. Accepted here. And empowered to Be You Here.

 

More reasons you’ll love working here include:

  • Paid Time Off (PTO) after just 30 days
  • Additional parental and maternity leave benefits after 12 months
  • Adoption reimbursement program
  • Paid volunteer hours
  • Paid training and career development
  • Medical, dental, vision and life insurance
  • 401k with employer match
  • Mortgage discount and area business discounts
  • Free membership to our large, state-of-the-art fitness center, including exercise classes such as yoga and Zumba, various sports leagues and a full-size basketball court
  • Wellness area, including an in-house primary-care physician’s office, full-time massage therapist and hair salon 
  • Gourmet cafeteria featuring homemade breakfast and lunch
  • Convenience store featuring healthy grab-and-go snacks
  • In-house Starbucks and Dunkin
  • Indoor/outdoor café with Wi-Fi

DISCLAIMER

All the above duties and responsibilities are essential job functions subject to reasonable accommodation and change. All job requirements listed indicate the minimum level of knowledge, skills and/or ability deemed necessary to perform the job proficiently. Team members may be required to perform other or different job-related duties as requested by their team lead, subject to reasonable accommodation. This document does not create an employment contract, implied or otherwise. Employment with UWM is "at-will." UWM is an Equal Opportunity Employer. By selecting “Apply for this job online” you provide consent to UWM to record phone call conversations between you and UWM to be used for quality control purposes.

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed

Connect With UWM!

Not sure what to apply for? Connect with us to speak with a Recruiter and explore opportunities!