Not a supporter of Copilot, but I think it's pretty easy to access the same data through BigQuery:
>The Google BigQuery Public Datasets program now offers a full snapshot of the content of more than 2.8 million open source GitHub repositories in BigQuery. Thanks to our new collaboration with GitHub, you'll have access to analyze the source code of almost 2 billion files with a simple (or complex) SQL query.
There is a distinction between being able to access the source code, and a tool giving it to you without any context of the underlying license it is governed by.
GP was saying that GitHub has an unfair advantage in that they have instant access to all GitHub code, whereas everyone else is rate limited.
I'm pointing out that this limitation is not meaningful because everyone can access all GitHub hosted source code through BigQuery, where they won't be rate limited.
>The Google BigQuery Public Datasets program now offers a full snapshot of the content of more than 2.8 million open source GitHub repositories in BigQuery. Thanks to our new collaboration with GitHub, you'll have access to analyze the source code of almost 2 billion files with a simple (or complex) SQL query.
https://cloud.google.com/blog/topics/public-datasets/github-...