Senior Software Engineer, Machine Learning, Infrastructure - Google
Sunnyvale, CA
About the Job
Minimum qualifications:
- Bachelor’s degree or equivalent practical experience.
- 5 years of experience with software development in one or more programming languages, and with data structures/algorithms.
- 3 years of experience testing, maintaining, or launching software products, and 1 year of experience with software design and architecture.
- 3 years of experience with developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage or hardware architecture.
Preferred qualifications:
- Master's degree or PhD in Computer Science, or in a related technical field.
- 1 year of experience in a technical leadership role.
- Experience developing accessible technologies.
- Experience in Graphics Processing Units (GPU), Tensor Processing Units (TPU) or other hardware accelerators.
- Experience authoring technical documentation, developer facing content, Application Programming Interfaces (API), networking protocols, or network architecture.
About the job
Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
This team in the Network Infrastructure Group is responsible for delivering Google’s network capacity: turning up and upgrading our intra-cluster and inter-cluster networks, including specialized networking for Machine Learning (ML)-specific hardware like Tensor Processing Units (TPU) and Graphics Processing Units (GPU).In this role, you will be responsible for building models of Google's network, specifically, allowing clients to plan and initiate changes to Google's network. These models are used for all network operations including: ordering networking components, building and installing actual hardware, connecting it to the production network, tuning efficiency in real-time, monitoring and more.
Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
Responsibilities
- Write and test product or system development code.
- Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
- Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
- Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
- Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.