# Data License Chain: CC-BY-SA Viral Effect + Common Crawl ToS + GitHub Permissive Filter

> Source: https://sukruyusufkaya.com/en/learn/fine-tuning-cookbook/ftc-data-license-chain
> Updated: 2026-05-14T14:43:03.631Z
> Category: Fine-Tuning Cookbook (Model-by-Model)
> Module: Part XVIII — Compliance, Governance & Red-Teaming
**TLDR:** How does training dataset's license affect FT model? CC-BY-SA viral (derivative must be same license), Common Crawl ToS (research only), GitHub permissive filter (MIT/Apache/BSD only — no GPL). Can model trained on Wikipedia (CC-BY-SA) be CC-BY-SA? Legal gray area.

