Data License Chain: CC-BY-SA Viral Effect + Common Crawl ToS + GitHub Permissive Filter
How does training dataset's license affect FT model? CC-BY-SA viral (derivative must be same license), Common Crawl ToS (research only), GitHub permissive filter (MIT/Apache/BSD only — no GPL). Can model trained on Wikipedia (CC-BY-SA) be CC-BY-SA? Legal gray area.
Şükrü Yusuf KAYA
22 min read
Advanced✅ Teslim
- Training data'ndaki tüm kaynakların lisansını sırala. 2) Hukuki risk değerlendir. 3) Sonraki ders: 18.5 — Model Card + Datasheet.
Yorumlar & Soru-Cevap
(0)Yorum yazmak için giriş yap.
Yorumlar yükleniyor...
Related Content
Part 0 — Engineering Foundations
Welcome to the Fine-Tuning Cookbook: System, Stage Taxonomy, and the Reproducibility Contract
Start LearningPart 0 — Engineering Foundations
Reproducibility Stack: Seeds, cuDNN Flags, and Deterministic CUDA — End the 'Works on My Machine' Problem
Start LearningPart 0 — Engineering Foundations