We are looking for an AI Engineer with expertise in generative AI and text-to-speech (TTS) technologies. This role requires proficiency in TTS technologies, additionally knowledge in diffusion models, LLMs, and advanced prompting for LLMs, image and video generators, as well as hands-on experience in deploying and integrating AI solutions to production.
Key Responsibilities:
- Utilize, train and experiment with TTS models.
- Utilize and experiment with diffusion models (e.g., Flux dev, Flux schnell).
- Develop prompts for LLMs and optimize inputs for image and video generators.
- Implement and deploy generative AI models using Docker and containerization tools.
- Integrate APIs, including Claude and OpenAI, into scalable applications.
- Work with cloud platforms (AWS/GCP) for AI model deployment and optimization.
Requirements
- Technical Skills:
- Proficiency in Python and widely-used AI libraries: PyTorch, Diffusers, Accelerate.
- Familiarity with TTS technologies, both open-source and proprietary solutions.
- Familiarity with RVC, Coqui TTS and other TTS focused tools.
- Strong experience in containerization and deployment (Docker, Kubernetes).
- Familiarity with cloud services (AWS/GCP) for hosting and scaling AI solutions.
- Expertise with APIs integration, especially Claude, OpenAI, ElevenLabs or other similar TTS providers.
- Basics of prompt engineering for LLMs and advanced prompting for image and video generation.
- Familiarity with Huggingface ecosystem.
- Experience:
- Proven track record in training & deploying TTS as well as generative AI models.
- Strong experience in API integration and cloud-based deployments.
- Willingness to work in a fast-paced and challenging environment.
Preferred Qualifications (Optional):
- Experience with complex multi-modal AI solutions.
- Keeping up with the newest models trends and solutions on the market.
Benefits
- Fully Remote Position.
- Extra bonuses may be applicable.