Solutions Architect - AI and HPC Cloud
Company: NVIDIA
Location: Santa Clara
Posted on: April 24, 2024
|
|
Job Description:
NVIDIA is looking for a Solutions Architect to work in IPP's
(Infrastructure, Planning and Process) Cloud Infrastructure Team.
IPP is a global organization within NVIDIA. This group works with
various other groups within NVIDIA such as Graphics Processors,
Mobile Processors, Deep Learning, Artificial Intelligence and
Driverless Cars to cater to their infrastructure needs. These cloud
services provide almost half a million automated jobs per day on
thousands of servers helping with the productivity of thousands of
NVIDIA's software engineers worldwide. The cloud hosts a
heterogeneous mix of machines and devices with various operating
systems (Windows/Linux/Android), a multitude of hardware platforms
both NVIDIA GPUs and Tegra Processors. Are you passionate about
distributed infrastructure and looking for sophisticated, critical
issues, ready to build the next generation of cloud services,
design creative solutions, mine through data to uncover real
problems and fix them?
What you'll be doing:
Work with NVIDIA Product Teams to understand new product
requirements including HPC and AI/ML Products.
Finding Optimum Solutions to deploy these products in a Datacenter
or a Lab environment using sophisticated design techniques,
services and tools.
Assist in roll-out and deployment of new development features aimed
at supporting the latest NVIDIA hardware and technologies.
Work closely with world-class engineers, architects, technical
product managers and application developers setting the best
strategies in place for a product launch.
Defining and implementing full scale solutions for product
onboarding into our hosted and private cloud environments.
Solve sophisticated problems involving multi-site deployments of
NVIDIA products.
Collaborate with multi-functional teams, including system
engineering, software engineering, mechanical/thermal engineering,
operations, data center teams, external vendors, and other partners
to successfully deliver a reliable and robust platform from concept
to prototype to deployments.
Directly contribute to the overall quality of deployments and
improve time to market next gen products.
Integrate and Optimize Cluster Deployment methods and manage SW
stack deployments, including provisioning these services into the
cloud.
What we need to see:
Bachelor's or Master's Degree in Computer Science or Software
Engineering, or equivalent experience.
10+ years of relevant experience.
5+ years of Linux and Scripting experience.
Solid background on OS Kernels and system engineering.
A track record of quickly understanding new technologies outside of
your domain expertise and deploying systems in sophisticated
configurations from hardware through multiple layers of software in
a fast-paced environment.
Strong technical skills and understanding of embedded systems,
orchestration & automation systems, data centers and cloud
architecture, as well as excellent communication and planning
skills.
Strong problem-solving ability and experience in product
engineering/failure analysis and debug/ HW or test design.
Understanding of dense datacenter design including compute, Storage
and networking.
Ways to stand out from the crowd:
Understanding of software engineering principles and enterprise
system architecture with an automate and Scale approach.
Experienced with compute clusters administration, automation as
well as experience with productivity tools and process automation
is big plus
Experience in large scale QA environments, for product bring
ups.
Special skills in large-scale computing and cluster computing
(MPI), data center design include high speed interconnect
InfiniBand, Cluster Storage and Scheduling related design and/or
management experience.
Strong background on Windows & Linux administration.
NVIDIA is leading the way in groundbreaking developments in
Artificial Intelligence, High-Performance Computing and
Visualization. The GPU, our invention, serves as the visual cortex
of modern computers and is at the heart of our products and
services. Our work opens new universes to explore, enables
outstanding creativity and discovery, and powers what were once
science fiction inventions from artificial intelligence to
autonomous cars. NVIDIA is looking for phenomenal people like you
to help us accelerate the next wave of artificial intelligence.
Widely considered to be one of the technology world's most
desirable employers. We have some of the most forward-thinking and
hardworking people in the world working for us. If you're creative
and passionate about new technologies we want you on our team!
The base salary range is 208,000 USD - 391,000 USD. Your base
salary will be determined based on your location, experience, and
the pay of employees in similar positions.
You will also be eligible for equity and benefits
(https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts
applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and
proud to be an equal opportunity employer. As we highly value
diversity in our current and future employees, we do not
discriminate (including in our hiring and promotion practices) on
the basis of race, religion, color, national origin, gender, gender
expression, sexual orientation, age, marital status, veteran
status, disability status or any other characteristic protected by
law.
Keywords: NVIDIA, Milpitas , Solutions Architect - AI and HPC Cloud, Other , Santa Clara, California
Click
here to apply!
|