Now in public preview, W&B Training offers serverless post-training for large language models (LLMs), including both reinforcement learning (RL) and supervised fine-tuning (SFT).Documentation Index
Fetch the complete documentation index at: https://wb-21fd5541-weave-caching.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
- Serverless RL: Improve model reliability performing multi-turn, agentic tasks while increasing speed and reducing costs. RL is a training technique where models learn to improve their behavior through feedback on their outputs.
- Serverless SFT: Fine-tune models using curated datasets for distillation, teaching output style and format, or warming up before RL.
- ART, a flexible fine-tuning framework.
- RULER, a universal verifier.
- A fully-managed backend on CoreWeave Cloud.