Lead the architecture and design of large-scale distributed systems for Alluxio, focusing on scalability, performance, and contributing to open-source projects.
As a Sr. Staff Software Engineer – Distributed Systems at Alluxio, you will lead the end-to-end architecture and technical evolution of our next-generation distributed data platform. You will drive system-level design decisions that enable Alluxio to scale to thousands of nodes and exabytes of data, while maintaining performance, reliability, and simplicity for users.
In this role, you will operate as a technical architect and hands-on engineering leader, partnering closely with engineering teams and product management to translate complex requirements into scalable distributed system designs.
What You Will Be Able To Build and Owner:
1.Lead the end-to-end architecture and design of large-scale distributed systems powering the Alluxio platform.
2.Drive technical strategy and architectural direction across multiple teams and components.
3.Design systems that support high scalability, fault tolerance, performance optimization, and data durability.
4.Provide hands-on development and deep technical guidance in critical areas of the system.
5.Lead complex system design reviews and mentor senior engineers on distributed systems design.
6.Identify and resolve system-level performance bottlenecks and reliability challenges.
7.Collaborate with product management and engineering leadership to translate product goals into technical solutions.
8.Influence the broader technical ecosystem through open-source contributions and architectural thought leadership.
Who We Are Looking For:
1.Master or BS degree in Computer Science or related technical field, or equivalent practical experience.
2.Proven experience of 2+ years in a technical leadership or architect role, driving system-level design and guiding engineering teams.
3.Strong hands-on software development experience in one or more general-purpose programming languages, including but not limited to Java, C/C++, or Go.
4.Deep architecting expertise in at least two of the following areas:
1)Distributed and parallel systems
2)Distributed storage systems
3)Architecting large-scale software systems
5.Demonstrated ability to design and implement high-quality, stable, and scalable end-to-end system architectures in production environments.
6.Strong analytical thinking and complex problem-solving skills.
7.Excellent communication skills and ability to influence technical direction across teams.
We Would Especially Appreciate If You Have:
1.PhD in Computer Science, Distributed Systems, or related fields.
2.Deep understanding of consensus algorithms, storage engines, or large-scale data systems.
3.Experience building or operating cloud-native infrastructure platforms.
4.Experience contributing to or maintaining open-source distributed systems projects.
5.Track record of designing systems that operate at massive scale (thousands of nodes or higher).
6.Passion for building high-performance infrastructure software.
7.Contributions to Alluxio open-source community.