Service Operations (SO) is responsible for service support and delivery for JPMorgan Chase's (JPMC) Global Technology Infrastructure. Through its Global Service Desk and Infrastructure Operations Centers, SO provides global, coordinated diagnostic and support services, while its Production Assurance and Support functions leverage and execute industry leading infrastructure management and support processes that are designed to minimize customer outages and impacts.
The Digital Reliability Operations (DRO) Infrastructure developer is responsible for the development and operation of the infrastructure supporting the Digital Channel across the Development, UAT, Performance, DR, and Production environments (24 x7). In this exciting and high energy position, you will have a mission-critical role in managing the infrastructure that powers the chase*.com and mobile consumer and business channels. The Engineer will work closely with Digital Application Development and Application Support teams in all regions to ensure high standards of support are maintained and reliability goals achieved. The DRO Engineers are expected to drive standards and improvement through automation focusing on efficiency, reduction of risk, and ensuring high availability and sustained resiliency through configuration management.
•The DRO Infrastructure Developer is the Primary contact point for Digital infrastructure Development and service activities. The function provides 24x7 coverage
•Deliver Service Improvement Activities across the Digital infrastructure through automation, analytics and API development
•Work as part of an Agile team to deliver milestones and target end products through well defined sprints
•Develop software architecture with scalable and resilient implementation
•Resolving high priority incident tickets, with potential customer impacts and work on tactical & Strategic fixes.
•Environmental oversight and root cause analysis for stability and reliability issues and timely resolution or escalation
•Maintain operational support documentation
•Plan and perform scheduled hygiene and maintenance of the infrastructure
•Ensure environmental testing and High Availability/Sustained Resiliency tests are completed successfully
•Track and report service metrics as defined
•Understands and articulates the impact and importance of the Digital channel across our GSO Partners.
•Displays a sense of urgency around critical issues and presents the correct impact review in terms of business value and impact during technical and non-technical reviews and understands when to escalate issues
•Participate in analysis and design of future services and release management
•At least 9 years experience and a high level of proficiency in Red Hat Enterprise Linux and AIX (preferable in financial services)
•Strong expertise with Python, Perl or shell scripting
•Strong experience in delivering applications using Agile methodology
•Experience with production support of highly available applications
•Focus on customers, ownership, operations and the ability to deliver results
•Position requires active participation in design review, specific to a tailored solution in order to meet a specific client request in the production estate
•Frequently anticipates problems and analyzes ways to mitigate the risk.
•Strong verbal and written communication skills
•Documents small-to medium-scale projects and delivers presentations with minimal supervision.
•Dissects complex situations and refocuses on critical technology tasks.
•Must have a high degree of technical expertise/professional mastery to recommend process improvements
•Is often consulted by peers and seen as the informal leader on tactical problems
•Familiarity with SAN, TCP/IP, DNS , VMWare, HMC, VIO client partitions, NIM, yum, is highly desirable
•Must be able to effectively and efficiently troubleshoot issues with AIX and Red Hat systems to ensure the highest availability and the lowest mean time to restore
•Familiarity with automation tools, such as Autosys or Control-M, CFEngine, Puppet, Ansible is a plus
•Experience in project work, incident management and resolution, change management, patching, release support are a plus
•Spearheads complex programs that span multiple inter-organizational units and interfaces with more experienced management
•Knowledge of system performance monitoring and operational capacity management
•Must be available to be in an on-call rotation
Not ready to apply? Leave your information with us and we will keep you up to date with new career opportunities.
Sign in to our application system to continue your job search.
Current employees sign in here.
You can also apply using your LinkedIn® profile. It may save you some time because your information will be automatically transferred into our system. Just click on the LinkedIn logo when you get to the application screen and follow the directions.
During the application process, be sure you have an up-to-date copy of your Résumé, your cover letter and any other documentation you would like to submit.