Hello moles! I've released a new set of Google Code project data to our own downloads page (on Google Code, no less!) - the datasource_id is 226.
This data took over a month to collect. Included are the following:
--project names (info)
--project license, code and content (info)
--project summary (info)
--project description (info)
--project activity level (info)
--who works on what project and what their role is (people)
--what blogs are listed for each project (blogs)
--what links are listed for each project (links)
--what labels are used to describe each project (labels)
May 2010 data is released for some forges.
-Freshmeat (datasource 218)
-Rubyforge (datasource 219)
-ObjectWeb (datasource 220)
-Free Software Fntn (datasource 221)
-Google Code (datasource 222) - list of projects only
Our collectors for Savannah, Sourceforge, Github, Tigris, Launchpad are all undergoing maintenance at the moment.
UPDATE May 28, 2010
-Savannah data has been released (datasource 224)
Link to download the FLOSSmole data on Google Code.
Lots of new data for you to peruse out on our FLOSSmole Data Downloads Page.
Here's what's out there, recently added:
Google Code, March 2010 (GC) - list of all GC projects donated by Audris Mockus (HUGE THANK YOU TO AUDRIS FOR THIS!!)
Freshmeat, February 2010 (FM)
Objectweb, February 2010 (OW)
Rubyforge, February 2010 (RF)
Github, February 2010 (GH)
Free Software Foundation, February 2010 (FSF)
Savannah, February 2010 (SV)
and Sourceforge from December 2009 (SF)
We have another set of bugs to fix with Sourceforge collection this year, 2010, but those are forthcoming. I'm running a collection now. Hopefully the data will be good. We may even have stats this time. Hallelujah.
Also, thanks to my phenomenal undergraduate superstar Steven Norris, Tigris is coming soon!! and Debian after that. We are rocking the repository collection...