NVIDIA is looking for a Senior Software Engineer to join NSV tools (Network Solutions Validation) group.
As a senior team member, you will be part of a development effort of high-performing software automation systems for NVIDIA's Data Center environments.
You will interact with NIC, OS, Switch, HCA, CPU and GPU compute as well as architects, network engineers, and developers.
We drive the data growth of the world’s biggest companies.
With talented engineers around the globe, the work environment is dynamic, meaningful, and fast-paced.
Are you ready for the challenge? What you’ll be doing: Design and develop an automation platform used to provision, configure, and monitor HPC data centers.
Implement scalable, reliable, and maintainable services that enhance cluster visibility and improve operational efficiency.
Improve stability and performance across the provisioning pipeline through architectural enhancements and code optimizations.
Troubleshoot issues in distributed environments and contribute to system observability and reliability improvements.
Work cross-functionally with architects, DevOps engineers, product managers and stakeholders to ensure high-quality releases.
Participate in code reviews, technical design discussions, and continuous improvement activities within the team.
What we need to see: B.
in Computer Science, Engineering, or a related field (or equivalent practical experience).
5+ years of strong hands-on experience on Linux-based platforms.
Proficient scripting and automation.
Background in DevOps and Network Engineering practices.
Hands-on experience with large-scale network architectures, switches/routers, OVS, SR-IOV, and network operating/management systems.
Networking expertise: Ethernet, VLANs, TCP/UDP/IP, QoS, L2/L3 protocols, BGP, EVPN/VXLAN, and common network topologies.