Model Registry: HuggingFace Hub Private Repo + MLflow + S3 Layout + Versioning
How to manage 50+ FT model versions in production? HuggingFace Hub private repo + MLflow Model Registry + S3 (chunked artifacts) hybrid. Versioning convention (semver + lineage), tags (production/canary/archive), retention policy. Cookbook's model card template (LoRA adapter + base + recipe).
Şükrü Yusuf KAYA
28 min read
Advanced1. Cookbook Model Registry Hiyerarşisi#
HuggingFace Hub (privat repo): kompanyam/llm-models ├── llama-3.1-8b-tr-instruct-v1.0/ # Stable baseline ├── llama-3.1-8b-tr-instruct-v1.1/ # Minor improvement ├── llama-3.1-8b-tr-instruct-v2.0/ # Major retrain ├── llama-3.1-8b-tr-customer-support-v1.0/ # Domain variant └── ... Her repo içinde: - adapter_model.safetensors # LoRA weights - adapter_config.json # PEFT config - tokenizer.json / tokenizer_config.json - README.md # model card (zorunlu) - eval_results.json # benchmark sonuçlar - training_config.yaml # reproducible - WANDB_RUN_URL # full training telemetry
Lineage triple (Part 0 Ders 0.5):
- — kod versiyonu
_git_sha - — dataset versiyonu
_data_sha256 - — eğitim run ID
_wandb_run_id
Bu triple ile 6 ay sonra reproduce edilebilir.
✅ Teslim
- HF Hub'da privat repo aç. 2) Bir FT model'i push et (full convention'la). 3) Sonraki ders: 16.2 — A/B + Shadow Traffic.
Yorumlar & Soru-Cevap
(0)Yorum yazmak için giriş yap.
Yorumlar yükleniyor...
Related Content
Part 0 — Engineering Foundations
Welcome to the Fine-Tuning Cookbook: System, Stage Taxonomy, and the Reproducibility Contract
Start LearningPart 0 — Engineering Foundations
Reproducibility Stack: Seeds, cuDNN Flags, and Deterministic CUDA — End the 'Works on My Machine' Problem
Start LearningPart 0 — Engineering Foundations