Invent and deploy state-of-the-art generative AI capabilities and tools for millions of users, ensuring impactful visual expression and collaboration across various teams.
At Picsart, we bring the wonder of creativity to the world and make it easy. As a Senior ML Scientist on our Generative Computer Vision team, you’ll help invent and deploy breakthrough generative AI capabilities for millions of creators globally. Your work will shape the future of visual expression by building state-of-the-art tools at the intersection of research and real-world impact.
Invent and publish advanced diffusion, transformer, and multimodal text-to-image/video models, including high-resolution generation (up to 16K).
Build innovative features for image/video retouching, effects, quality enhancement, and avatar generation.
Develop and optimize training and inference pipelines with use of data parallelism, quantization, distillation, TensorRT.
Design, build, and maintain training and evaluation workflows using PyTorch 2.x, DataBricks, and SLURM.
Create human-in-the-loop evaluations including side-by-side visual comparisons and technical metrics calculation.
Collaborate closely with product, design, and engineering teams to ship research-driven features at scale.
Ph.D. (or equivalent research experience) in Computer Science, Electrical Engineering, Mathematics, or a related field.
5+ years of experience delivering computer vision and generative AI models in production.
First-author publications or patents in top-tier venues such as CVPR, ICCV, ECCV, SIGGRAPH, NeurIPS, or ICLR.
Deep expertise in PyTorch and CUDA.
Hands-on experience with diffusion models, GANs, and vision-language architectures.
Strong foundations in linear algebra, probability, optimization, and large-scale model training techniques (e.g., mixed precision, gradient checkpointing).
Contributions to open-source ML/CV projects or toolkits.
#LI-MS1
For Applicants Based in California - California Job Applicant Privacy Notice (https://rb.gy/lqu5mv)
Picsart is the largest digital creation platform where users create, remix, and share billions of visual stories using intuitive editing tools. With one of the largest open-source content collections, including free photos, stickers, and templates, it caters to a global community across various devices in 30 languages.
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Scientist Q&A's