Dataset Engineering Index
Designing and curating training data for adaptation.
Notes
- Instruction Data Design — Formats, quality dimensions, collection strategies, and diversity analysis for SFT data.
- Synthetic Data Generation — Self-Instruct, Evol-Instruct, verified synthetic pipelines, and data flywheel architecture.
Navigation
← Prev ← Fine-tuning | Next → Inference Optimization →