Minimum qualifications:
- Bachelor's degree or equivalent practical experience.
- 8 years of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, JavaScript).
3 years of experience in a technical leadership role; overseeing projects, with 2 years of experience in a people management, supervision/team leadership role.
- Experience with Cloud Services, Telemetry Systems and Incident Response.
- Ability to work non-standard hours in on-call rotation.
Preferred qualifications:
- Master's degree or PhD in Computer Science, or a related technical field.
- Experience in a customer focused role or as a Cloud Customer.
- Ability to work across organizational boundaries.
About the job
Like Google's own ambitions, the work of a Software Engineer goes beyond just Search. Software Engineering Managers have not only the technical expertise to take on and provide technical leadership to major projects, but also manage a team of Engineers. You not only optimize your own code but make sure Engineers are able to optimize theirs. As a Software Engineering Manager you manage your project goals, contribute to product strategy and help develop your team. Teams work all across the company, in areas such as information retrieval, artificial intelligence, natural language processing, distributed computing, large-scale system design, networking, security, data compression, user interface design; the list goes on and is growing every day. Operating with scale and speed, our exceptional software engineers are just getting started -- and as a manager, you guide the way.
With technical and leadership expertise, you manage engineers across multiple teams and locations, a large product budget and oversee the deployment of large-scale projects across multiple sites internationally.
The Cloud Incident Response team has a mission to create a dependable experience for GCP customers. The team responds to and helps coordinate, mitigate, or resolve major incidents across all of GCP. The core team builds the processes, systems, and tooling necessary to deliver customer focused mitigations to critical GCP incidents. This will involve working on tooling, systems, and processes.
Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
Responsibilities
- Participate in on-call rotation supporting Critical Incident Response.
- Focus on high-quality customer outcomes and continuous collaboration across GCP teams.
- Create training and processes for incident management life-cycle and partner with Cloud SRE UTLs, and Cloud Support leadership team.
- Build systems and tooling to support the team, improve visibility, detection of issues, communications to customers, stakeholders, and customer facing teams.
- Define and escalate risks in Cloud, reduce incident probabilities with strategic and tactical/pragmatic approaches as needed.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also
Google's EEO Policy and
EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our
Accommodations for Applicants form.