Cloud InfrastructureComputer ScienceArtificial IntelligenceComputational LinguisticsContainerizationData ScienceLarge Language ModelsNatural Language ProcessingPattern RecognitionPredictive AnalyticsAWS Experience In The Life/Health Sciences Industry Published Work In The AI Field Production-Level Development Skills Leadership Abilities Familiarity With Model Registry Tools Knowledge Of Distributed Computing Platforms
Senior (5-8 years) - Expert (9+ years)
UK
The Industry Cloud for Life Sciences
8465+ employees
SaaS
Role
Who you are
4+ years of data science experience (or 2+ years with a Ph.D.)
Master’s or Ph.D. in Computer Science, AI, Computational Linguistics, or a related field
Strong foundation in Natural Language Processing (NLP) and Machine Learning (ML)
Experience with Reinforcement Learning from Human Feedback (RLHF) methods like Direct Preference Optimization (DPO) and Proximal Policy Optimization (PPO) for training LLMs based on human preferences
Hands-on experience with large language models and transformers (e.g., GPT, BERT)
Proficient in Python and NLP libraries (e.g., NLTK, SpaCy, Hugging Face)
Familiarity with Big Data frameworks (e.g., Ray, Spark) and Deep Learning frameworks (e.g., PyTorch, JAX)
Experience with cloud infrastructure, containerization (Docker, Kubernetes)
Excellent collaboration and communication skills for cross-functional teamwork
Desirables
AWS experience
Experience in the life/health sciences industry, especially pharma
Published work in the AI field
Production-level development skills
Leadership abilities with a strong network for team growth and hiring
Familiarity with model registry tools (e.g., MLflow)
Knowledge of distributed computing platforms (e.g., Ray, Spark)
What the job involves
Develop LLM-based agents specialized in extracting detailed information about Key Opinion Leaders (KOLs) in healthcare
Build an end-to-end pipeline for analyzing unstructured websites and medical documents, enabling semantic searches for KOL data across various languages
Create models for information extraction using cloud infrastructure and collaborate with software developers for deployment
Train ML models with input from 2000+ curators to ensure quality and scalability across different regions, languages, and specialties
Veeva Systems Inc. is committed to revolutionizing the life sciences industry by providing innovative cloud-based software solutions that accelerate the delivery of therapies to patients globally. With a focus on innovation, product excellence, and customer success, Veeva aims to make a positive impact on its customers, employees, and communities.