Own the reliability of our cloud infrastructure, design proactive solutions to prevent incidents, and build automation that enhances stability and performance of our systems.
Lucidya is an AI-native platform for customer experience (CX) intelligence that manages entire customer lifecycles autonomously, from initial engagement through retention and growth.
Unlike platforms that only surface insights and leave the action to you, Lucidya closes the loop with proprietary NLU technology built in-house and trained on millions of multilingual conversations. This enables marketing, support, CX, and research teams to deliver personalized experiences that drive measurable improvements in customer satisfaction, retention, and lifetime value.
As we continue scaling globally, the reliability, performance, and resilience of our infrastructure become mission-critical to everything we do.
Why this role matters
At Lucidya, our platform processes massive volumes of real-time customer data. Any downtime, latency, or instability directly impacts our customers’ ability to make decisions and serve their own users.
This role exists to make sure that doesn’t happen.
As a Site Reliability Engineer, you’ll sit at the heart of our platform’s stability, owning the reliability of our cloud infrastructure and ensuring it scales seamlessly as we grow. You won’t just react to issues; you’ll anticipate them, design systems that prevent them, and build automation that removes them entirely.
If you enjoy solving complex infrastructure challenges, eliminating inefficiencies, and building systems that “just work” - this is where you’ll thrive.
You’ll be responsible for outcomes, not just tasks. Here’s what success looks like in this role:
You’ll make reliability the default
You’ll own and optimize our cloud environments
You’ll run and improve Kubernetes in production
You’ll build strong observability and respond to incidents
You’ll automate everything that shouldn’t be manual
You’ll collaborate to improve the entire system
First 30 days:
By 90 days:
Requirements
This is what will make you successful in this role:
Technically, you likely:
When it comes to monitoring:
What sets you apart:
Nice to Have (but not required)
What the hiring process will look like
Lucidya is an AI-native platform that transforms how brands manage customer experiences across their entire lifecycle. By leveraging advanced Machine Learning and proprietary NLU capabilities, we empower businesses to engage with customers personally and effectively, driving meaningful improvements in satisfaction and retention at scale.
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Site Reliability Engineer Q&A's