Rackspace Senior Manager, Site Reliability Engineering in San Antonio, Texas
Overview & Responsibilities
As the Senior Manager for TES Site Reliability Engineering, you’ll lead two teams comprised of Systems and Network Engineers that build solutions to enhance availability, performance and stability of our internal platforms for teams across Rackspace. You'll define processes and lead your team to respond effectively to alerts, tickets, calls and in addition manage ongoing project work. Your team will be working in both production and non-production environments focusing on the SRE core tenants. The best person for this role is someone who has strong leadership experience, understands in depth engineering and networking concepts and has a very collaborative spirit. Your able to manage a project from vision to implementation with little to no guidance. You take a proactive approach to resolve issues before they even exist, ensuring that your internal customers and partners can do their job without issue. You love partnering with developers, engineers and operations teams to drive solutions for your customers.
In this role you will:
Work with an awesomely talented passionate group of Rackers
Create and meet roadmap deliverables for designing and building an internal Platform as a Service solution to support internal Rackspace development and application teams using technologies such as containers, Kubernetes, application pods, VMware, etc.
Drive the vision for a state of the art multi cloud platform solution and provide monthly costs and investment ROI data
Support internal teams on multiple levels that will utilize your platforms including VMware, AWS and OpenShift.
Drive your teams to identify opportunities to automate infrastructure and application deployment processes for internal Rackspace developers
Drive whiteboarding sessions to lead the team in architecting and developing full stack solutions, from whiteboard to green SLA’s
Own end-to-end availability and performance of mission critical services and plan / prioritize building automation to prevent problem recurrence; automate response to all non-exceptional service conditions.
Keep in close contact with your customers and partners to ensure your roadmap objectives provide solutions, align to the business objectives and most of all are improving their abilities to provide custom tools for our customers.
Educate on best practices in terms of redundant architecture and application deployment workflows
Lead by example, care for your team and establish credibility with the quality of your and your team's technical execution.
Work closely with your peers that oversee other technical teams ensuring that cross training and knowledge sharing is happening to eliminate single point of failures and silo’s
Manage employees around the globe including on-call rotations, running incidents, 1:1’s, team meetings, quarterly review, etc.
BA/BS degree in Computer Science or related technical field, or equivalent practical experience
7-10 years Technical leadership experience. Includes understanding of SDLC and systems infrastructure principles and how they interrelate
Strong driving and collaboration/coordination skills. Experience facilitating across large diverse cross-functional teams
Strong facilitative leadership skills; able to effectively sell your ideas and convince others to follow based on persuasion rather than authority
Strong analytical skills to understand issues and work collaboratively to identify root cause
Effective communication & liaising across a wide range of audiences from engineers to executives
Proven track record of managing large, complex, multidisciplinary programs
Strong organizational skills, planning, and attention to detail are also required
Internally motivated, self-starter with ability to plan, organize and establish priorities to meet goals and achieve results
Must work well under pressure, balancing multiple priorities and objectives. Handles conflict well
Demonstrated leadership working in a broad cross-functional environment
Experience working with SaaS/cloud applications and enterprise technology Preferred qualifications
Hands-on technical experience combined with strong management and communication skills
Capable of technical deep-dives into code, deployment architecture, networking, operating systems and storage
Demonstrated expertise in recruiting and managing a team of bright, experienced engineers/project managers/analysts on large scale projects
Expertise in problem solving and analyzing global scale distributed systems Key Skills/Competencies
Complete Ownership and Accountability Mindset
Experience in running complex large scale distributed systems
Passionate about uptime and resiliency and operational excellence
Thought leadership, Problem Solving and creative thinker
Ability to make smart trade-offs, say no when relevant
Grow talent/raise the bar, talent optimization, lead strong engineers/personalities
Communication, ability to run effective interference/framing with Directors and above
Req # 40065
Category Leadership, Networking, System Administration / Engineering