free software foundation

February 2010 Data Released

Lots of new data for you to peruse out on our FLOSSmole Data Downloads Page.

Here's what's out there, recently added:

Google Code, March 2010 (GC) - list of all GC projects donated by Audris Mockus (HUGE THANK YOU TO AUDRIS FOR THIS!!)
Freshmeat, February 2010 (FM)
Objectweb, February 2010 (OW)
Rubyforge, February 2010 (RF)
Github, February 2010 (GH)
Free Software Foundation, February 2010 (FSF)
Savannah, February 2010 (SV)
and Sourceforge from December 2009 (SF)

We have another set of bugs to fix with Sourceforge collection this year, 2010, but those are forthcoming. I'm running a collection now. Hopefully the data will be good. We may even have stats this time. Hallelujah.

Also, thanks to my phenomenal undergraduate superstar Steven Norris, Tigris is coming soon!! and Debian after that. We are rocking the repository collection...

December 2009 data released

December data has been released for the following forges:

(datasource-abbreviation-full name)
200-fm-freshmeat
201-rf-rubyforge
202-ow-objectweb
203-fsf-free software foundation
204-sv-savannah
205-gh-github

Sourceforge is in progress... it will be datasource_id=206.

Get the data here:
http://code.google.com/p/flossmole/downloads/list

Remember that the files marked "DM" are SQL files (mysql) but the files marked .txt are flat text files (delimited)

November 2009 data released

This month we have data from Freshmeat, Rubyforge, Objectweb, Savannah, Github, Free Software Foundation.

Downloads available at Google Code

Remember, the SQL is available in the datamart*.sql.bz files, the flat (delimited) data is available in the other files.

We're still working on getting our Sourceforge scraper back up and running, and we thank you for your patience.

October 2009 data released

October 2009 data has been released. Here are the forges we have this month:
Freshmeat
Rubyforge
ObjectWeb
Free Software Foundation directory
Savannah (new)
GitHub (new)

FLOSSmole Downloads

Sourceforge is undergoing a re-write, still, but we will be collecting again from there soon. In the meantime, don't forget that the June 2009 data is available, and also there is the Notre Dame data if you find that helps at all.

Enjoy!

July 2009 data

Hello moles, our July 2009 data has been released: this month we have Objectweb, Freshmeat, Rubyforge, Free Software Foundation directory.

Go to our Google Code pages to download the data.

The most recent datasource_ids are:
178-fm-July2009
179-rf-July2009
180-ow-July2009
181-fsf-July2009

How many projects of each team size are listed in Free Software Foundation?

Description

This chart shows the number of projects of each team size listed in Free Software Foundation.

Visualization

Projects listed as having 0 developers were disregarded (53 projects).

Free Software Foundation Developer Count Chart

SQL Script

SELECT DISTINCT calc_dev_count, count(DISTINCT proj_num) AS count
FROM fsf_projects
WHERE datasource_id= <current>
GROUP BY calc_dev_count
ORDER BY count DESC, calc_dev_count;

How many projects at each repository share URL's?

Description

This chart shows the number of projects at each repository that share URL's.

Visualization

Number of Projects at each Repository that List a Home Page at Another Repository

How many projects at each repository share identical short project names?

Description

This chart shows the number of projects at each repository that share project names.

Visualization

Number of Projects at each Repository that Share an Identical Short Project Name

Shared Short Names Table

Shared Short Names Chart

This graph shows the number of short project names shared in common between each pair of projects. For instance, starfish is a project listed on both Sourceforge and Rubyforge. On Rubyforge, it is described as a "tool to make programming ridiculously easy", but on Sourceforge the starfish project is described as a password management application. There are 1367 projects with shared names on Rubyforge and Sourceforge.

For more information on matching project names and URLs, see:

Squire, M. (2009). Integrating projects from multiple open source code forges. International Journal of Open Source Software & Processes, 1(1). January-March 2009. pp. 46-57.

As of June 2009, how many projects are listed in each repository?

Description

This chart shows the number of projects that FLOSSmole most recently collected from each repository.

Visualization

Project Count Chart

SQL Script

SELECT count(DISTINCT proj_unixname) FROM projects
WHERE datasource_id= <current>;


SELECT count(DISTINCT project_id) FROM fm_projects
WHERE datasource_id= <current>;


SELECT count(DISTINCT proj_unixname) FROM rf_projects
WHERE datasource_id= <current>;


SELECT count(DISTINCT proj_unixname) FROM ow_projects
WHERE datasource_id= <current>;


SELECT count(DISTINCT proj_num) FROM fsf_projects
WHERE datasource_id= <current>;

Syndicate content