Astera Labs is a global leader in purpose-built connectivity solutions that unlock the full potential of AI and cloud infrastructure. Our Intelligent Connectivity Platform integrates PCIe®, CXL®, and Ethernet semiconductor-based solutions and the COSMOS software suite of system management and optimization tools to deliver a software-defined architecture that is both scalable and customizable. Inspired by trusted relationships with hyperscalers and the data center ecosystem, we are an innovation leader delivering products that are flexible and interoperable. Discover how we are transforming modern data-driven applications at www.asteralabs.com.
As a Principal Cloud Infrastructure Engineer with a focus on AWS cloud technologies, you will play a pivotal role in driving our organization’s transformation to a cloud-based high-performance computing (HPC) platform using AWS. Your responsibilities will include implementing and maintaining the architecture of a highly available, high-performance computing platform that supports our ASIC and hardware engineers. You will work closely with the Engineering, DevOps, and IT Teams to develop and deploy critical infrastructure while enhancing automation and security. This role will be instrumental in executing the cloud strategy of the company with the ability to implement cutting-edge platforms supporting our internal business units and customers.
Based in Santa Clara, this position requires an in-person presence, offering a unique opportunity to impact our global operations directly.
Basic Qualifications
- Strong academic and technical background in information technology, preferably in computer/electrical engineering. A Bachelor’s degree is required, and a Master’s is preferred.
- ≥8 years’ related experience as an infrastructure, operations, DevOps, SRE, and/or security engineer.
- Hands-on experience working with AWS services and tools such as EC2, S3, VPC, Route 53, IAM, CloudTrail, CloudWatch, Security Hub, Guard Duty, Inspector, Shield, WAF, KMS, HSM, etc.
- Ability to choose appropriate AWS services based on specific use cases and requirements.
- Strong understanding of cloud security frameworks, standards, and best practices.
- Understanding of virtual networks and network architecture in AWS including VPC (Virtual Private Cloud) configuration, subnets, security groups, NACL, NAT Gateway and routing tables.
- Experience with AWS CloudFormation or other infrastructure-as-code tools.
- Ability to set up monitoring and logging.
- Understanding of backup and recovery mechanisms using AWS services.
- Should know the software development life cycle and be familiar with DevOps working practices and tooling including Terraform/CloudFormation, Jenkins, Bitbucket, JIRA, and Ansible.
- Collaborate with hardware, ASIC, software engineers, and other stakeholders.
- AWS Certification(s) such as Solutions Architect Pro, DevOps Engineer Pro, SysOps Admin, and Security
- Professional attitude with the ability to prioritize a dynamic list of multiple tasks and to work with minimal guidance and Entrepreneurial, open-mind behavior and can-do attitude
- Authorized to work in the US
Required Experience
- Strong experience in design, implementation, and deployment of secure cloud systems to meet business needs.
- Design secure network architectures, including firewalls, intrusion detection/prevention systems, VPNs, and other security technologies.
- System administration of high-performance infrastructure like NFS file system, license servers, AD/LDAP servers, proxy servers, and dynamic compute nodes
- Automate ways to keep our high-performance computing grid functioning smoothly by monitoring queues, nodes, services, and infrastructure for errors, latency issues, and traffic problems.
- Proactively fix known issues and prevent future downtime, and maintain communication between grid, scheduler, and teams.
- Ability to work with external partners, and with Cloud and Datacenter services, and manage SSL Certificate, Secure File Transfer, VPC networking, Security Groups/Firewall, and DNS.
- Experience with scripting languages and using them for semiconductor EDA automation (primarily Python, Bash and TCL, Java/C/C++ a plus)
- Develop, implement, and manage security measures and controls to protect cloud-based systems and infrastructure.
- Design and maintain system backup solutions for system recovery and disaster recovery
- Provide technical support to ensure the reliable operation of cloud production
- Experience in collaborating with cross-functional teams.
- Diagnose and troubleshoot complex issues related to cloud computing systems.
- Developing cloud native CI/CD workflows using Jenkins and Atlassian tools
Preferred Experience
- Proven experience in the semiconductor industry or related fields.
- Strong understanding of high-performance computing (HPC), parallel processing, and distributed systems.
- Familiarity with CAD systems and their integration into cloud environments.
- Experience with ASIC design, simulation, and verification workflows.
- Managing user job submissions over Slurm, PBS Pro, or similar job schedulers
- Deploying scalable EDA workloads in AWS
- Installing and managing EDA tools from companies like Synopsys, Cadence, Ansys, etc.
The base salary range is $160,000 - $240,000. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
We know that creativity and innovation happen more often when teams include diverse ideas, backgrounds, and experiences, and we actively encourage everyone with relevant experience to apply, including people of color, LGBTQ+ and non-binary people, veterans, parents, and individuals with disabilities.