Product Do Direct Preference Optimization (DPO) with Arcee AI's training platform Direct Preference Optimization (DPO) is one of the top methods for fine-tuning LLMs... It's available on our model training platform - and today, we bring you support for DPO on our training APIs.
Product Train, Merge, & Domain-Adapt Llama-3.1 with Arcee AI Get Llama-3.1 but better – customize the OS model for all your needs, using Arcee AI's training, merging, and adaptation techniques and tools. Our team created this guide to get you started.