LLMOps Engineer - Cloud/Gen AI

Manifest Solutions - Westerville, OH, United States

Manifest Solutions is currently seeking an LLMOps Engineer - Cloud/Gen AI for an onsite position in Westerville, OH.

  • Conceptualize, develop, and execute Machine Learning (ML)/LLM pipelines specifically for Large Language Models, including data acquisition, pre-processing, model training/tuning, deployment, and monitoring.
  • Utilize automation tools such as GitOps, CI/CD pipelines, and containerization technologies (Docker, Kubernetes) to streamline ML/LLM tasks across the Large Language Model lifecycle.
  • Establish robust monitoring and alerting systems to track Large Language Model performance, data drift, and other key metrics, proactively identifying and resolving issues.
  • Perform truth analysis to assess the accuracy and effectiveness of Large Language Model outputs, comparing them to known, accurate data.
  • Collaborate closely with infrastructure, DevOps teams, and Generative AI Architects to optimize model performance and resource utilization.
  • Oversee and maintain cloud infrastructure (e.g., AWS, Azure) specifically for Large Language Model workloads, ensuring cost-efficiency and scalability.
  • Stay current with the latest advancements in ML/LLM Ops, integrating these developments into generative AI platforms and processes.
  • Communicate effectively with both technical and non-technical stakeholders, providing updates on the performance and status of Large Language Models.

Requirements:

  • Bachelor’s or Master’s degree in Computer Science, Engineering or similar.
  • At least 5 years of experience as an ML engineer within public cloud platforms.
  • Strong programming skills in Python and/or other languages.
  • Expertise in cloud platforms (e.g., AWS, Azure) for ML workloads, MLOps, DevOps, or Data Engineering.
  • Proven experience in MLOps, LLMOps, or related roles, with hands-on experience deploying and managing machine learning and large language model pipelines.
  • Familiarity with generative AI applications and domains such as content creation, data augmentation, style transfer.
  • Strong knowledge of Generative AI architectures and methods, including chunking, vectorization, context-based retrieval and search, working with Large Language Models such as Open AI GPT 3.5/4.0, Llama2, Llama3, Mistral, etc.

Preferred Requirements:

  • Strong understanding of cybersecurity principles and best practices to ensure the integrity, security, and confidentiality of data.
  • Knowledge of AI ethics and understanding how to apply Trustworthy AI to ensure safe, responsible, and ethical use of AI technology.
  • Experience with data engineering and data visualization tools and techniques.
  • Passion for learning and exploring new generative AI technologies and methods.


Posted On: Monday, December 16, 2024



Apply to this job

or