JerseyCityRecruiter Since 2001
the smart solution for Jersey City jobs

Senior Site Reliability Engineer (SRE) - Platform Services

Company: JPMorgan Chase & Co.
Location: Jersey City
Posted on: May 31, 2021

Job Description:

As part of the Platforms Services Site Reliability Engineer (SRE) team you will be charged with developing and building solutions to support our internal DevOps teams. Key responsibilities of the SRE team are:

  • Develop best in class monitoring and observability frameworks to accomplish end to end flow monitoring and noiseless alerting with proper telemetry
  • Provide oversight for the creation and maintenance of Service Level Objectives, root cause analysis, stakeholder management and communication
  • Identify key customer user journeys, agree on availability calculations and ensure the accuracy via automation
  • Facilitate the definition of SLOs and deliver solutions for efficient, sustainable achievements of them
  • Drive a customer focused, team-based culture that is agile and proactive
  • Learn, integrate and evangelize industry best practices, patterns for troubleshooting and blameless post-mortems
  • Partner with development teams throughout the life cycle for building reliable system from the start
  • Provide guidelines/patterns and establishes proper metrics for building highly scalable, reliable, high performing systems
  • Be a participant and an escalation point in rotational support
  • Coach and mentor teams on Software Development, Agile/DevOps & SRE Practices

Requirements & Qualifications

This role requires a wide variety of strengths and capabilities, including:

  • BS/BA degree or equivalent experience
  • Expert practitioner in multiple technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm
  • Experience with one or more cloud platforms like Cloud Foundry, Mesosphere, Kubernetes, AWS, GCP, Azure
  • Expert knowledge in one or more of the infrastructure components. (E.g. Routers, load balancers, cloud products, container systems, compute, storage and networks)
  • Understanding of Network and Cloud connectivity technologies, i.e. Security, Load Balancing, and Network Routing Protocols
  • Hands-on experience with cloud deployment, monitoring, and ops analysis tools such as Prometheus, Elasticsearch, Grafana, Kibana, Splunk, DynaTrace, etc.
  • Experience in building event monitoring solutions using tools like FluentD, Kafka, etc.
  • Expert in multiple technology stacks with designing, coding, testing, delivering software
  • Software development experience in one or more general purpose programming languages: Python, Java, C, C++, Go, AngularJS
  • Experience with developing frameworks that helps increasing developer and release velocity, improving code health and technical standards
  • Provide governance around adoption, and influence software engineering teams on roadmaps and designs
  • Strong understanding of business processes and how they map to the technology stack (application layer down to physical layer)
  • Strong customer focus and ability to build teamwork and partnership across stakeholder teams
  • Considered as leader in the firm/LOB for SRE practices
  • Understanding of agile and lean philosophies and proficient in Continuous Integration and Continuous Delivery

Keywords: JPMorgan Chase & Co., Jersey City , Senior Site Reliability Engineer (SRE) - Platform Services, Other , Jersey City, New Jersey

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category

Log In or Create An Account

Get the latest New Jersey jobs by following @recnetNJ on Twitter!

Jersey City RSS job feeds