About Docsumo:
Docsumo is your go-to Document AI solution for streamlining business operations. We turn complex documents like bank statements, policies, and financial statements into valuable, actionable data. Our cutting-edge technology helps businesses make smarter decisions faster. We are backed by marquee investors such as Sequoia, Barclays, Fifth Wall, Common Ocean, and Techstars.
At Docsumo, we're on a mission to revolutionize how businesses handle data. We empower companies to:
Boost efficiency by 6-10 times
Make quick, accurate decisions from unstructured information
Scale operations effortlessly through innovative technology
The opportunity as Senior Machine Learning Engineer:
We are seeking senior professionals with over 4 years of experience in the field. This is a role for candidates with a proven track record in machine learning, deep learning, NLP, and computer vision.
If you have led and managed a team of ML scientists and engineers and have a strong foundation in deploying end-to-end ML solutions, this opportunity is for you.
This role offers the chance to rapidly advance into leadership positions, such as Lead ML Engineer, where you'll spearhead critical projects with creative autonomy.
You'll work closely with our CTO, Data Science, and Engineering teams, shaping the future of intelligent document processing for our expanding client base in the US.
This is a full-time role with flexible options, including hybrid work in Kathmandu or remote work from India.
Working hours are 10:00 am to 7:00 pm IST, with a 1-hour lunch break.
You will report directly to the Data Science Lead or the CTO, collaborating with a talented team to deliver innovative ML solutions that drive customer success.
Key Responsibilities
Collaborate with cross-functional teams of scientists and engineers to design, develop, and implement advanced machine learning systems that transform innovative ideas from conceptual stages into operational APIs.
Conduct cutting-edge research in machine learning, focusing on the application and fine-tuning of Large Language Models (LLMs) to develop robust, scalable solutions for intelligent document processing.
Lead a team of data scientists and machine learning engineers, providing mentorship and fostering a culture of collaboration and continuous learning to achieve high performance and innovative outcomes.
Plan, manage, and oversee the full lifecycle of ML projects, ensuring alignment with business goals and timely delivery of high-quality solutions. Develop and apply sophisticated machine learning algorithms to address complex business challenges, particularly those involving the processing and analysis of unstructured data.
Engage in Agile development processes including regular standups, sprint planning, and retrospectives to facilitate iterative progress and maintain high standards of output.
Ensure the documentation of machine learning methodologies, model development processes, and maintain rigorous testing standards to ensure reliability and efficiency of models in production.
Drive the integration and optimization of LLMs and other advanced models to enhance performance and operational efficiency, continuously seeking improvements.
Need to Have:
At least 4 years of industry experience in machine learning, deep learning, NLP, and computer vision, ideally within tech companies, product startups, or R&D environments.
1-2 years of experience leading teams of 4-5+ ML scientists and engineers, demonstrating effective leadership and project management skills. Proficiency in PyTorch and TensorFlow, with experience in training deep neural networks, implementing transfer learning, and optimizing models.
Strong skills in classification and regression techniques, with hands-on experience in Scikit-learn, Numpy, Pandas, and Scipy. Practical experience with Transformers, such as BERT and GPT, for tasks like text classification, entity recognition, and sentiment analysis.
Proficiency in Python and strong understanding of Object-Oriented Programming (OOP) principles.
Experience with version control using Git, cloud platforms like AWS and Google Cloud, and containerization technologies such as Docker and Kubernetes.
Ability to work effectively in a team, demonstrating motivation, resourcefulness, and a growth mindset.
Bachelor’s degree in Computer Science, Statistics, Machine Learning, Physics, Mathematics, or a related field.
Nice to Have:
Experience with Large Language Models, including techniques like PEFT, prompt engineering, few-shot learning, and RLHF.
Knowledge of computer vision applications like OCR, OpenCV, CNNs, and multimodal AI.
Experience with advanced data visualization libraries.
Familiarity with Agile development practices and sprint management.
A track record of innovative research contributions or publications in the machine learning domain.