About the Role
We are seeking a talented AI Engineer to join our cutting-edge team. In this role, you'll tackle complex challenges in large language models (LLMs), optical character recognition (OCR), and model scaling. You'll be at the forefront of developing and optimizing AI systems that push the boundaries of what's possible in machine learning.
Key Responsibilities
- Lead research initiatives to improve OCR accuracy across diverse document types and languages
- Train and fine-tune LLMs using domain-specific data to enhance performance in specialized contexts
- Develop techniques to scale LLMs efficiently for high-volume production environments
- Design and implement novel approaches to model optimization and evaluation
- Collaborate with cross-functional teams to integrate AI solutions into production systems
- Stay current with the latest research and incorporate state-of-the-art techniques
- Document methodologies, experiments, and findings for both technical and non-technical audiences
Required Qualifications
- Masters or PhD in Computer Science, Machine Learning, AI, or a related field
- Minimum 8+ yr of working experience in the relevant skills and technologies
- Strong understanding of deep learning architectures, particularly transformer-based models
- Experience with OCR systems and techniques for improving text recognition accuracy
- Proficiency in Python and deep learning frameworks (PyTorch, TensorFlow, or JAX)
- Demonstrated ability to implement and adapt research papers into working code
- Excellent problem-solving skills with a methodical approach to experimentation
- Strong communication skills to explain complex technical concepts clearly
Preferred Qualifications
- Research focus during PhD in areas relevant to our work (NLP, computer vision, multimodal learning)
- Familiarity with distributed training systems for large-scale models
- Experience with model quantization, pruning, and other efficiency techniques
- Understanding of evaluation methodologies for assessing model performance
- Knowledge of MLOps practices and tools for model deployment
- Publications at top-tier ML conferences (NeurIPS, ICML, ACL, CVPR, etc.)
About FourKites
FourKites®, the leader in AI-driven supply chain transformation for global enterprises and pioneer of advanced real-time visibility, turns supply chain data into automated action. FourKites’ Intelligent Control Tower™ breaks down enterprise silos by creating a real-time digital twin of orders, shipments, inventory and assets. This comprehensive view, combined with AI-powered digital workers, enables companies to prevent disruptions, automate routine tasks, and optimize performance across their supply chain. FourKites processes over 3.2 million supply chain events daily — from purchase orders to final delivery — helping 1,600+ global brands prevent disruptions, make faster decisions and move from reactive tracking to proactive supply chain orchestration.
Working at FourKitesWe provide competitive compensation with stock options, outstanding benefits and a collaborative culture for all employees around the globe, including:
- 5 global recharge days, in addition to standard holidays, and a hybrid, flexible approach to work.
- Parental leave for all parents, an annual wellness stipend and volunteer days also provide you with time and resources for self care and to care for others.
- Opportunities throughout the year to learn and celebrate diversity.
- Access to leading AI tools and foundation models, with the freedom to experiment and find creative ways to be more effective in your role
And we're always listening for new ways to support everyone in and out of the office.