Submitted by megan on September 12, 2013 - 8:21am
We here at FLOSSmole have been gathering data about how free, libre, and open source software is made for about 10 years now, actually a little more.
In that time, a lot has changed in the forge landscape, both with the players and with the tools.
Just for fun, I decided to run a few quick queries to show the ascendance of Github and the concurrent decline of some smaller forges. These two graphs show the rate of new project creation (called 'registration' on Rubyforge and 'creation' at Github - and yes, the Github numbers do include forks).
Submitted by megan on May 23, 2012 - 11:43am
Data files have been released for May 2012. Go check it out on our Google Code downloads page.
Re-writes:
--Free Software Foundation has been re-written from scratch to match their new layout.
--Google Code collector has been re-written to fix a few bugs (still running)
--Launchpad has been finished and will be re-written for June to fix bugs
--Alioth is being re-written to fix bugs
Submitted by megan on March 1, 2012 - 9:23am
February data has been released for Github.
Get the data here from our Google Code downloads page or request direct database access here.
Included with Github data are the following values:
project name
developer name
description
private yes/no
fork number
homepage
number of watchers
open issues
...and all the xml values that these fields are based on!
Have fun!
Submitted by megan on January 18, 2012 - 1:34pm
We're cruising ahead with January 2012 releases. Grab the data from Google Code site or from the teragrid.
Freecode - done (formerly known as Freshmeat)
Savannah - done
Tigris - done
Rubyforge - done
Objectweb - done
Launchpad - done
Submitted by megan on November 2, 2011 - 12:49pm
Here is the status of the November 2011 collection:
done & ready to download on Google Code or query in Teragrid...
============
RUBYFORGE
OBJECTWEB
TIGRIS
LAUNCHPAD
SAVANNAH
ALIOTH
GITHUB
still collecting...
============
GOOGLE
Submitted by megan on June 20, 2011 - 5:00am
Summer is a beautiful thing. Moles, we've got a huge Google Code release for you (ds=271), and the re-vamped Launchpad (ds=272), and also Github (ds=273).
Get your FRESH June data on our Google Code Downloads Page or LIVE on the Teragrid.
Tigris is fixed and is running right now. We're also writing a new collector for Alioth! Lots of new stuff.
Submitted by megan on June 6, 2010 - 11:11am
Submitted by megan on March 9, 2010 - 7:41pm
Lots of new data for you to peruse out on our FLOSSmole Data Downloads Page.
Here's what's out there, recently added:
Google Code, March 2010 (GC) - list of all GC projects donated by Audris Mockus (HUGE THANK YOU TO AUDRIS FOR THIS!!)
Freshmeat, February 2010 (FM)
Objectweb, February 2010 (OW)
Rubyforge, February 2010 (RF)
Github, February 2010 (GH)
Free Software Foundation, February 2010 (FSF)
Savannah, February 2010 (SV)
and Sourceforge from December 2009 (SF)
Submitted by megan on December 9, 2009 - 11:49am
December data has been released for the following forges:
(datasource-abbreviation-full name)
200-fm-freshmeat
201-rf-rubyforge
202-ow-objectweb
203-fsf-free software foundation
204-sv-savannah
205-gh-github
Sourceforge is in progress... it will be datasource_id=206.
Get the data here:
http://code.google.com/p/flossmole/downloads/list
Remember that the files marked "DM" are SQL files (mysql) but the files marked .txt are flat text files (delimited)
Submitted by megan on November 19, 2009 - 9:28am
This month we have data from Freshmeat, Rubyforge, Objectweb, Savannah, Github, Free Software Foundation.
Downloads available at Google Code
Remember, the SQL is available in the datamart*.sql.bz files, the flat (delimited) data is available in the other files.
We're still working on getting our Sourceforge scraper back up and running, and we thank you for your patience.
Pages