We are seeking a Senior Network Engineer to join our AI datacenter development team. This role involves testing, validating, and scaling advanced network infrastructure across multi-vendor environments. You will be responsible for replicating real-world AI datacenter topologies, validating performance under stress, and ensuring network solutions deliver uncompromising reliability, scalability, and performance.
Key Responsibilities
Define and execute regression, functional, performance, and scale test suites for network infrastructure
Investigate complex performance and scaling bottlenecks in AI datacenters
Design and build customer-scale testbeds emulating diverse network architectures
Write and debug automation scripts (Python, Bash) to drive traffic generators and manipulate test environments
Analyze large-scale test data to identify root causes of hardware/software issues across multi-vendor platforms
Collaborate with R&D and architecture teams to validate ASIC and micro-architectural behaviors
Support POC efforts, innovation initiatives, and customer use-case reproductions
Generate comprehensive test reports highlighting results, insights, and recommendations
Validate network protocols and hardware-software interactions in AI workloads.
Key Responsibilities
Define and execute regression, functional, performance, and scale test suites for network infrastructure
Investigate complex performance and scaling bottlenecks in AI datacenters
Design and build customer-scale testbeds emulating diverse network architectures
Write and debug automation scripts (Python, Bash) to drive traffic generators and manipulate test environments
Analyze large-scale test data to identify root causes of hardware/software issues across multi-vendor platforms
Collaborate with R&D and architecture teams to validate ASIC and micro-architectural behaviors
Support POC efforts, innovation initiatives, and customer use-case reproductions
Generate comprehensive test reports highlighting results, insights, and recommendations
Validate network protocols and hardware-software interactions in AI workloads.
Requirements:
Required Qualifications
8+ years of experience in system validation, performance testing, or troubleshooting in networking environments
Strong understanding of network protocols (L2/L3) and hardware-software interactions
Hands-on experience with congestion control, collective communication frameworks, and AI-scale workloads
Proficiency in Linux-based systems with strong Python scripting and automation skills
Practical experience with network traffic generators for performance and stress testing
Ability to debug across multi-vendor platforms
Excellent troubleshooting skills, curiosity, and problem-solving mindset
Strong communication and collaboration skills with ownership over end-to-end testing
Preferred Qualifications
Industry certifications (CCNP) or equivalent expertise
Knowledge of AI/ML workloads and distributed training
Experience with high-speed networking (InfiniBand, Ethernet, RDMA)
Familiarity with containerization and orchestration.
Required Qualifications
8+ years of experience in system validation, performance testing, or troubleshooting in networking environments
Strong understanding of network protocols (L2/L3) and hardware-software interactions
Hands-on experience with congestion control, collective communication frameworks, and AI-scale workloads
Proficiency in Linux-based systems with strong Python scripting and automation skills
Practical experience with network traffic generators for performance and stress testing
Ability to debug across multi-vendor platforms
Excellent troubleshooting skills, curiosity, and problem-solving mindset
Strong communication and collaboration skills with ownership over end-to-end testing
Preferred Qualifications
Industry certifications (CCNP) or equivalent expertise
Knowledge of AI/ML workloads and distributed training
Experience with high-speed networking (InfiniBand, Ethernet, RDMA)
Familiarity with containerization and orchestration.
This position is open to all candidates.







