You can divide the recent history of LLM data scraping into a few phases. There was for years an experimental period, when ethical and legal considerations about where and how to acquire training data ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results