We're looking for a highly skilled Senior Data Engineer to join our team. This critical role involves managing massive volumes of high-velocity, high-cardinality data generated by real-time processing systems, primarily in the Computer Vision domain. The ideal candidate possesses deep expertise in TimescaleDB for efficient storage and querying, proficiency with cloud-based object storage, and familiarity with the specialized NVIDIA ecosystem, including DeepStream, GPUs, and Trident, to ensure data integrity and flow from the edge to the analytics platform.
Responsibilities:
Data Architecture & Storage
Design, build, and optimize high-throughput data pipelines using modern tools to ingest streaming data from various sources into our core data platform.
Serve as the subject matter expert for TimescaleDB, managing schema design, performance tuning, compression policies, and data retention strategies for petabytes of time-series data.
Architect and manage the tiered storage strategy, leveraging cloud-based storage solutions (e.g., S3, GCS) for cold storage and archival of raw and processed data.
Ensure data models are scalable and optimized for both real-time operational queries and large-scale analytical processing.
Real-Time & Computer Vision Integration
Collaborate with ML and Computer Vision teams to integrate the data pipeline with NVIDIA DeepStreamapplications, managing metadata and telemetry extracted from video streams.
Develop solutions that utilize NVIDIA GPUs effectively, particularly concerning how derived data is ingested and processed immediately after the visual inferencing stage.
Familiarity with NVIDIA Trident storage orchestration is desirable for managing persistent volumes in Kubernetes environments hosting vision applications.
Implement data quality checks and validation processes to ensure the high integrity of timestamps and measurement data from the edge devices.
Engineering Excellence & Collaboration
Apply expert-level proficiency in a major programming language (Python or Scala preferred) for ETL/ELT pipeline development and tooling.
Drive the adoption of best practices, including Infrastructure as Code (IaC) and comprehensive monitoring (e.g., Prometheus/Grafana) for the data platform components.
Provide technical guidance and mentorship to junior team members, fostering a culture of high performance and technical rigor.
Minimum of 5+ years of professional experience in Data Engineering, focusing on high-volume data platforms or distributed systems.
Expert proficiency with TimescaleDB (PostgreSQL), including experience managing production instances, hypertable partitioning, and continuous aggregates.
Demonstrated experience designing and managing large-scale data lakes or warehouses utilizing cloud-based object storage (AWS S3, Azure Blob Storage, or GCP Cloud Storage).
Deep experience with streaming platforms (e.g., Apache Kafka, Flink) and real-time data ingestion patterns.
Proficiency in modern programming languages (e.g., Python, Scala, or Go) for data processing and pipeline orchestration.
Preferred Skills & Domain Knowledge
Familiarity with the NVIDIA Computer Vision stack, including concepts related to DeepStream, NVIDIA GPUs, or edge-to-cloud data flow.
Experience or strong understanding of the requirements for storing and retrieving high-dimensional time-series data (e.g., sensor data, telemetry, and machine learning metadata).
Experience with containerization and orchestration (Docker, Kubernetes) in the context of data processing jobs.
Familiarity with database security protocols and compliance requirements for sensitive data.
Perks you enjoy at KMS Mexico
KMS Technology was established in 2009 as a U.S.-based software services company. With development centers in Vietnam and Mexico, we have been trusted globally for the superlative quality of our software consulting & development services, technology solutions, and engineers' expertise. We pride ourselves on creating brilliant solutions for our clients by leveraging deep expertise, advanced technologies, and delivery excellence for a shared success where everyone can reach their fullest potential. With three Business Lines:KMS Software: Leverage software domain expertise to help clients make better business decisions in technology platforms, increase speed-to-market, and gain critical development support through innovative technology solutions.KMS Solutions: Empower BFSI businesses to embrace the digital finance revolution and expedite clients’ journey towards complete digitalization, technology consulting, data analytics, software development, and software quality.KMS Healthcare: Build transformative next-gen technologies to solve healthcare’s most challenging problems, providing innovative tools and expertise to providers, payers, life sciences, and medical technology vendors.
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!
Be the first to apply. Receive an email whenever similar jobs are posted.
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Senior Data Engineer Q&A's