Principal Network Engineer

TLDR

Own the design and implementation of front-end network architecture for large-scale AI and GPU-accelerated infrastructure, ensuring platform scalability and performance.

Our mission at Tensorwave Cloud is to build seamless, secure, reliable, and resilient AI infrastructure at scale, eliminating barriers and challenging the status quo to empower builders and support AI innovation.

About the role

We are seeking a Network Principal to own the front-end network architecture for large-scale AI and GPU-accelerated infrastructure. This role is responsible for the design, implementation, and evolution of customer-facing, service, and control-plane networks that interface with large GPU, and storage.

This role operates as a peer to the Back End Network Principal, with shared responsibility for end-to-end platform scalability, reliability, and performance.

Responsibilities

  • Own front-end network architecture - DCI, edge, ingress/egress, and control-plane networks

  • Architect and operate large edge and service networks

  • Design scalable Ethernet architectures

  • Define routing, segmentation, and isolation strategies

  • Lead hands-on deployment, validation, and troubleshooting in new data centers

  • Define and maintain reference architectures, standards, and long-term growth models

  • Own relationships with network carriers and service providers

  • Work in collaboration with the platform team to design and deliver network solutions for Kubernetes-centric use cases

  • Partner closely with the Back End Network Principal to define clean interface boundaries between front-end and RDMA back-end fabrics

Required Experience

  • Bachelor of Science in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience

  • 10+ years data center networking experience

  • Proven experience with very large Ethernet fabrics and large-scale edge networks

  • Strong hands-on experience with BGP, traffic engineering, and high-availability designs

  • Experience with 100G–400G+ Ethernet environments

  • Familiarity with optical standards and transceiver types (e.g., 100G/400G/800G, SR/LR/ER, DWDM)

  • Demonstrated experience working with carriers providers to deliver production connectivity

  • Multi-Vendor Experience - Juniper, Cisco, Arista, Whitebox

  • NOS Experience - Junos, IOS/IOS-XE, NX-OS, EOS, SONiC

Preferred Experience

  • Automation or scripting experience in Python, GO, Bash, or equivalent

  • AI platform or GPU cluster environments

  • Multi-tenant or customer-facing platforms

  • Strong familiarity with Kubernetes networking concepts

  • Exposure to network automation and programmability

  • 100G+ environments

  • AI, GPU, or HPC exposure

What We Bring

  • Mission driven company

  • Competitive Salary

  • Stock Options

  • 100% paid Medical, Dental, and Vision insurance

  • Flexible PTO

  • Paid Holidays

  • 401(k)

  • Parental Leave

  • Flexible Spending Account

  • Short Term Disability Insurance

  • Life and Voluntary Supplemental Insurance

  • Mental Health Benefits through Spring Health

We’re looking for resilient, adaptable people to join our team, people who believe in the mission and think at massive scale. The solutions that worked on a handful of devices will not work at Exascale. Be prepared to be pushed daily, to learn a lot, and literally build the future.

Tensorwave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, national origin, or veteran status.

Benefits

Health Insurance

100% paid Medical, Dental, and Vision insurance

Mental Health Benefits

Mental Health Benefits through Spring Health

Paid Parental Leave

Parental Leave

Paid Time Off

Flexible PTO

TensorWave delivers a high-performance cloud computing platform that leverages AMD Instinct™ GPUs to supercharge AI research and advanced workloads. Tailored for developers and researchers in the AI space, our platform removes infrastructure hurdles, enabling innovators to focus on pushing the boundaries of technology.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Network Engineer Q&A's
Report this job
Apply for this job