AI Engineer - Machine Learning and Model Deployment Specialist
Location:
San Francisco, CA / Remote
Overview:
We are looking for an experienced AI Engineer to join our team, focusing on machine learning tasks including dataset engineering, model fine-tuning, deployment, and scaling. You will work on various cutting-edge projects such as text generation chat models (Mistral, Mixtral), TTS models, moderation tools, and handling other open-source models for text generation and transcription.
Key Responsibilities:
- Dataset Engineering: Develop and manage datasets for training and testing AI models. Ensure the quality and relevance of data used for different projects.
- Model Fine-Tuning: Fine-tune various machine learning models, including our text generation chat models (Mistral, Mixtral) and other functional/tool models with structured output.
- Deployment and Scaling: Oversee the deployment of AI models into production and manage their scaling. Ensure efficient and robust model performance in live environments.
- Pipeline Management: Design and maintain efficient pipelines for data processing, model training, and inference. Ensure seamless integration of different components of the AI system.
- Model Serving and Inference: Implement and optimize model serving solutions for real-time and batch inference scenarios.
- Handling Open-Source Models: Work with open-source models for various applications like summarization and speech-to-text (e.g., Whisper). Adapt and integrate these models into our ecosystem.
Qualifications:
- Strong experience in machine learning, data engineering, and model deployment.
- Proficiency in fine-tuning and scaling AI models.
- Experience with AI/ML tools and frameworks, specifically PyTorch.
- Familiarity with cloud services and deployment platforms.
- Ability to handle open-source models and adapt them to specific use cases quickly.
What We Offer:
- A chance to work on exciting and innovative AI projects.
- Extremely competitive salary, equity, and benefits.