free software foundation

What packages are required most by Free Software Foundation projects?

Description

This chart shows the packages that are required most frequently in FSF projects. There is a total of 8,024 requirements.

Visualization



SQL Script

SELECT requirement AS Requirement, COUNT(proj_num) AS Count
FROM fsf_project_requirements
WHERE datasource_id = <current>
GROUP BY Requirement
ORDER BY Count DESC;

How many projects of each team size are listed in Free Software Foundation?

Description

This chart shows the number of projects of each team size listed in Free Software Foundation.

Visualization

Projects listed as having 0 developers were disregarded (57 projects).

SQL Script

SELECT DISTINCT calc_dev_count, COUNT( DISTINCT proj_num ) AS count
FROM fsf_projects
WHERE datasource_id= <current>
GROUP BY calc_dev_count
ORDER BY count DESC , calc_dev_count;

How many projects at each repository share identical short project names?

Description

This chart shows the number of projects at each repository that share project names as of May 2011.

Visualization

project-names

SQL Script

RF-FM

SELECT COUNT( f.projectname_short_fixed )
FROM fm_projects f, rf_projects r
WHERE f.projectname_short_fixed = r.proj_unixname
AND f.datasource_id = <current>
AND r.datasource_id = <current>;

RF-OW

SELECT COUNT( r.proj_unixname )
FROM rf_projects r, ow_projects o
WHERE r.proj_unixname = o.proj_unixname
AND r.datasource_id = <current>
AND o.datasource_id = <current>;

FM-OW

SELECT COUNT( f.projectname_short_fixed )
FROM fm_projects f, ow_projects o
WHERE f.projectname_short_fixed = o.proj_unixname
AND f.datasource_id = <current>
AND o.datasource_id = <current>;

RF-FSF

SELECT COUNT( f.proj_unixname )
FROM fsf_projects f, rf_projects r
WHERE f.proj_unixname = r.proj_unixname
AND f.datasource_id = <current>
AND r.datasource_id = <current>;

FM-FSF

SELECT COUNT( f.proj_unixname )
FROM fsf_projects f, fm_projects fm
WHERE f.proj_unixname = fm.projectname_short_fixed
AND f.datasource_id = <current>
AND fm.datasource_id = <current>;

OW-FSF

SELECT COUNT( f.proj_unixname )
FROM fsf_projects f, ow_projects o
WHERE f.proj_unixname = o.proj_unixname
AND f.datasource_id = <current>
AND o.datasource_id = <current>;

How many projects are listed in each repository?

Description

This chart shows the number of projects that FLOSSmole most recently collected from each repository.

Visualization

Project Count Chart


SQL Script


SELECT COUNT( DISTINCT proj_name )
FROM gc_projects
WHERE datasource_id = <current>;


SELECT COUNT( DISTINCT project_id )
FROM fm_projects
WHERE datasource_id = <current>;


SELECT COUNT( DISTINCT project_name )
FROM lp_projects
WHERE datasource_id = <current>;


SELECT COUNT( DISTINCT proj_unixname )
FROM rf_projects
WHERE datasource_id= <current>;


SELECT COUNT( DISTINCT proj_num )
FROM fsf_projects
WHERE datasource_id= <current>;


SELECT COUNT( DISTINCT project_name )
FROM sv_projects
WHERE datasource_id= <current>;


SELECT COUNT( DISTINCT unixname )
FROM tg_projects
WHERE datasource_id = <current>;


SELECT COUNT( DISTINCT proj_unixname )
FROM ow_projects
WHERE datasource_id= <current>;

May 2011 Data Released

May 2011 data has been released to Google Code and uploaded into Data Central at Teragrid.

Datasources:
263 2011-Mar UDD bugfix replaces 262
264 2011-Mar UDD bugfix replaces 263
265 2011-May UDD May 2011 UDD donation
266 Rubyforge 2011-May Rubyforge 2011-May
267 Objectweb 2011-May Objectweb 2011-May
268 FSF 2011-May Free Software Foundation 2011-May
269 Savannah 2011-May Savannah 2011-May
270 2011-May FM May 2011 Freshmeat

Status of other collectors:
Launchpad - parsing problem
Tigris - mailing list collector problem
Github - collection problem
Google Code - still running (it will be about a month until these are out)

Link to FLOSSmole files on Google Code
Link to instructions for how to access FLOSSmole db at Teragrid

January file releases

Just released data files for the following forges. You can head over to the FLOSSmole data downloads page at Google Code to download any of these files, or wait for them to be released to the Teragrid for live querying (shortly!)

datasource_id, forge_id, abbreviation, name
237 2 FM Freshmeat
238 3 RF Rubyforge
239 4 OW ObjectWeb
240 5 FSF Free Software Foundation
241 10 SV Savannah
243 12 GC Google Code
244 13 TG Tigris

Still running...
245 14 LP Launchpad

Broken...
242 11 GH Github

May 2010 Data released

May 2010 data is released for some forges.

-Freshmeat (datasource 218)
-Rubyforge (datasource 219)
-ObjectWeb (datasource 220)
-Free Software Fntn (datasource 221)
-Google Code (datasource 222) - list of projects only

Our collectors for Savannah, Sourceforge, Github, Tigris, Launchpad are all undergoing maintenance at the moment.

UPDATE May 28, 2010
-Savannah data has been released (datasource 224)

Link to download the FLOSSmole data on Google Code.

February 2010 Data Released

Lots of new data for you to peruse out on our FLOSSmole Data Downloads Page.

Here's what's out there, recently added:

Google Code, March 2010 (GC) - list of all GC projects donated by Audris Mockus (HUGE THANK YOU TO AUDRIS FOR THIS!!)
Freshmeat, February 2010 (FM)
Objectweb, February 2010 (OW)
Rubyforge, February 2010 (RF)
Github, February 2010 (GH)
Free Software Foundation, February 2010 (FSF)
Savannah, February 2010 (SV)
and Sourceforge from December 2009 (SF)

We have another set of bugs to fix with Sourceforge collection this year, 2010, but those are forthcoming. I'm running a collection now. Hopefully the data will be good. We may even have stats this time. Hallelujah.

Also, thanks to my phenomenal undergraduate superstar Steven Norris, Tigris is coming soon!! and Debian after that. We are rocking the repository collection...

December 2009 data released

December data has been released for the following forges:

(datasource-abbreviation-full name)
200-fm-freshmeat
201-rf-rubyforge
202-ow-objectweb
203-fsf-free software foundation
204-sv-savannah
205-gh-github

Sourceforge is in progress... it will be datasource_id=206.

Get the data here:
http://code.google.com/p/flossmole/downloads/list

Remember that the files marked "DM" are SQL files (mysql) but the files marked .txt are flat text files (delimited)

November 2009 data released

This month we have data from Freshmeat, Rubyforge, Objectweb, Savannah, Github, Free Software Foundation.

Downloads available at Google Code

Remember, the SQL is available in the datamart*.sql.bz files, the flat (delimited) data is available in the other files.

We're still working on getting our Sourceforge scraper back up and running, and we thank you for your patience.

Syndicate content