Your contacts at Teradata (95)
+ 86 more
Why You're a Fit
Job Description
Our Company:
At Teradata, we believe that people thrive when empowered with better information. That’s why we built the most complete cloud analytics and data platform for AI. By delivering harmonized data, trusted AI, and faster innovation, we uplift and empower our customers—and our customers’ customers—to make better, more confident decisions. The world’s top companies across every major industry trust Teradata to improve business performance, enrich customer experiences, and fully integrate data across the enterprise.
What You'll Do: Shape the Way the World Understands Data
We’re seeking a Sr. AI Data Scientist to drive the training and fine-tuning of small and large language models. This role covers the full lifecycle — from data collection and preprocessing through training, evaluation, optimization, and deployment into production. You’ll work on building models that excel in domain-specific tasks, instruction following, tool/structured output reliability, and efficient inference. You will:
- Train and fine-tune open-source and proprietary language models (SLMs/LLMs) for domain-specific use cases, instruction following, and task-oriented outputs.
- Build, preprocess, and curate high-quality training datasets, including instruction data, synthetic data generation, and quality/contamination filtering.
- Develop evaluation frameworks and benchmarks to measure model quality, accuracy, hallucination reduction, and performance metrics.
- Apply parameter-efficient fine-tuning strategies (e.g., LoRA/QLoRA, adapters) and other adaptation techniques (SFT, DPO/RL-based alignment).
- Optimize models for inference speed, latency, and cost using quantization/distillation and serving frameworks (e.g., vLLM, TensorRT-LLM, ONNX).
- Collaborate with engineering teams to deploy models into production and monitor performance over time.
- Stay current with research and industry best practices; contribute new ideas to improve training workflows.
Who You'll Work With: Join Forces with the Best
You’ll collaborate with a world-class team of AI architects, ML engineers, and domain experts at Silicon Valley, working together to build the next generation of enterprise AI systems.
You’ll also work cross-functionally with:
- Product managers and UX designers to craft agentic workflows that are intuitive and impactful.
- Domain specialists to ensure solutions align with real-world business problems in regulated industries.
Who You'll Work With: Join Forces with the Best
Collaborate with a talented team of AI architects, ML engineers, and domain experts in Silicon Valley, all working together to advance the future of enterprise AI. Work closely with infrastructure teams to scale AI workloads globally. This is a rare opportunity to shape cutting-edge AI capabilities within a dynamic, data-driven company, where innovation thrives, and your ideas help define the next generation of data interaction.
Your Qualifications and Qualities
Required:
- B.S./M.S./Ph.D. in Computer Science, Machine Learning, AI, or a related technical field.
- Strong proficiency in Python and PyTorch.
- Solid understanding of Transformer architectures, tokenization, and training objectives
- Familiar with state-of-the-art techniques for preparing AI training data.
Preferred:
- 3+ years of experience in ML / NLP engineering, with hands-on LLM or SLM training or fine-tuning
- Experience with Hugging Face ecosystem and PEFT methods (LoRA / QLoRA)
- Hands-on experience with fine-tuning methods, including full parameter updates and PEFT (LoRA/QLoRA).
- Familiarity with distributed training (DeepSpeed, Accelerate) and GPU workflows.
- Experience deploying models into production environments.
#LI-PG1