AI Engineer - Machine Learning and Model Deployment Specialist

Location:

San Francisco, CA / Remote

Overview:

We are looking for an experienced AI Engineer to join our team, focusing on machine learning tasks including dataset engineering, model fine-tuning, deployment, and scaling. You will work on various cutting-edge projects such as text generation chat models (Mistral, Mixtral), TTS models, moderation tools, and handling other open-source models for text generation and transcription.

Key Responsibilities:

Dataset Engineering: Develop and manage datasets for training and testing AI models. Ensure the quality and relevance of data used for different projects.
Model Fine-Tuning: Fine-tune various machine learning models, including our text generation chat models (Mistral, Mixtral) and other functional/tool models with structured output.
Deployment and Scaling: Oversee the deployment of AI models into production and manage their scaling. Ensure efficient and robust model performance in live environments.
Pipeline Management: Design and maintain efficient pipelines for data processing, model training, and inference. Ensure seamless integration of different components of the AI system.
Model Serving and Inference: Implement and optimize model serving solutions for real-time and batch inference scenarios.
Handling Open-Source Models: Work with open-source models for various applications like summarization and speech-to-text (e.g., Whisper). Adapt and integrate these models into our ecosystem.

Qualifications:

Strong experience in machine learning, data engineering, and model deployment.
Proficiency in fine-tuning and scaling AI models.
Experience with AI/ML tools and frameworks, specifically PyTorch.
Familiarity with cloud services and deployment platforms.
Ability to handle open-source models and adapt them to specific use cases quickly.

What We Offer:

A chance to work on exciting and innovative AI projects.
Extremely competitive salary, equity, and benefits.