GitHub releases data on 2.8 million open source repositories through Google BigQuery
GitHub today announced that it’s releasing activity data for 2.8 million open source code repositories and making it available for people to analyze with the Google BigQuery cloud-based data warehousing tool.
The data set is free to explore. (With BigQuery you get to process up to one terabyte each month free of charge.)
This new 3TB data set includes information on “more than 145 million unique commits, over 2 billion different file paths and the contents of the latest revision for
Read more »