warning: Creating default object from empty value in /var/www/drupal/modules/taxonomy/taxonomy.pages.inc on line 33.

free software foundation

FSF Network of Required Projects, July 2012

FSF allows each project to state what other packages are required by this project.

Here is the SQL code to generate the data showing which software packages are most commonly required by other projects.

Notes: (1)We also have data on the TYPE of requirement (use requirement, build requirement, etc), but this column is not used in this query. (2) The required project does not have to be listed on the FSF directory to be included in the list. (For example, Perl is the most commonly required package, but this is not a FSF-directory listed project.)

SELECT requirement, count( * )
FROM fsf_project_requirements
WHERE datasource_id =319
GROUP BY 1
ORDER BY 2 DESC

FSF Network of Related Projects, July 2012

The Free Software Foundation directory (FSF) allows each project to list other FSF projects that are related to it.

We could construct a social network showing what projects are related. Here is the SQL code needed to make the list of what projects are related to what other projects:

SELECT pr.proj_num, p.proj_unixname, pr.related_project_name
FROM fsf_project_related pr
INNER JOIN fsf_projects p ON p.proj_num = pr.proj_num
WHERE p.datasource_id=[current datasource_id]
AND pr.datasource_id=[current datasource_id];

You can also just see the most commonly related projects by counting as follows:

SELECT related_project_name, count( * )
FROM `fsf_project_related`
WHERE datasource_id = [current datasource_id]
GROUP BY 1
ORDER BY 2 DESC

FSF Count of Projects by License, July 2012

FSF requires projects to list a license under which the project is released. This chart shows the top 25 licenses. Note that there is an "Other" category (211 projects) that includes ALL other licenses AND projects that chose "Other" as their actual license choice. This chart was generated using the licenses listed for projects in July 2012 (datasource_id=319).

SQL Code to generate the data set to build this chart:

SELECT license, count(*)
FROM `fsf_project_licenses`
WHERE datasource_id=[current data source]
GROUP BY 1
ORDER BY 2 DESC;

FSF release dates, by month and year (as of July 2012)

The Free Software Foundation directory lists free software projects and pertinent information about those projects.

One of the items we collect is "released on" date. This information refers to the last release date for the software, as reported in the FSF Directory.

This graph shows the aggregated release dates for the 6855 software packages listed in the FSF directory during the month of July 2012 (datasource_id=319).

Code to collect the number of projects:
SELECT count( * )
FROM `fsf_projects`
WHERE datasource_id = [current datasource];

Code to collect data to fill the chart:
SELECT MONTH( released_on ) , YEAR( released_on ) , COUNT( DISTINCT proj_num )
FROM fsf_projects
WHERE datasource_id =[current datasource_id]
GROUP BY 2 , 1
ORDER BY 2 , 1;

What packages are required most by Free Software Foundation projects? (07-2011)

Description

This chart shows the packages that are required most frequently in FSF projects. There is a total of 8,024 requirements.

Visualization



SQL Script

SELECT requirement AS Requirement, COUNT(proj_num) AS Count
FROM fsf_project_requirements
WHERE datasource_id = <current>
GROUP BY Requirement
ORDER BY Count DESC;

How many projects of each team size are listed in Free Software Foundation? (05-2011)

Description

This chart shows the number of projects of each team size listed in Free Software Foundation.

Visualization

Projects listed as having 0 developers were disregarded (57 projects).

SQL Script

SELECT DISTINCT calc_dev_count, COUNT( DISTINCT proj_num ) AS count
FROM fsf_projects
WHERE datasource_id= <current>
GROUP BY calc_dev_count
ORDER BY count DESC , calc_dev_count;

How many projects at each repository share identical short project names? (05-2011)

Description

This chart shows the number of projects at each repository that share project names as of May 2011.

Visualization

project-names

SQL Script

RF-FM

SELECT COUNT( f.projectname_short_fixed )
FROM fm_projects f, rf_projects r
WHERE f.projectname_short_fixed = r.proj_unixname
AND f.datasource_id = <current>
AND r.datasource_id = <current>;

RF-OW

SELECT COUNT( r.proj_unixname )
FROM rf_projects r, ow_projects o
WHERE r.proj_unixname = o.proj_unixname
AND r.datasource_id = <current>
AND o.datasource_id = <current>;

FM-OW

SELECT COUNT( f.projectname_short_fixed )
FROM fm_projects f, ow_projects o
WHERE f.projectname_short_fixed = o.proj_unixname
AND f.datasource_id = <current>
AND o.datasource_id = <current>;

RF-FSF

SELECT COUNT( f.proj_unixname )
FROM fsf_projects f, rf_projects r
WHERE f.proj_unixname = r.proj_unixname
AND f.datasource_id = <current>
AND r.datasource_id = <current>;

FM-FSF

SELECT COUNT( f.proj_unixname )
FROM fsf_projects f, fm_projects fm
WHERE f.proj_unixname = fm.projectname_short_fixed
AND f.datasource_id = <current>
AND fm.datasource_id = <current>;

OW-FSF

SELECT COUNT( f.proj_unixname )
FROM fsf_projects f, ow_projects o
WHERE f.proj_unixname = o.proj_unixname
AND f.datasource_id = <current>
AND o.datasource_id = <current>;

How many projects are listed in each repository? (05-2011)

Description

This chart shows the number of projects that FLOSSmole most recently collected from each repository.

Visualization

Project Count Chart


SQL Script


SELECT COUNT( DISTINCT proj_name )
FROM gc_projects
WHERE datasource_id = <current>;


SELECT COUNT( DISTINCT project_id )
FROM fm_projects
WHERE datasource_id = <current>;


SELECT COUNT( DISTINCT project_name )
FROM lp_projects
WHERE datasource_id = <current>;


SELECT COUNT( DISTINCT proj_unixname )
FROM rf_projects
WHERE datasource_id= <current>;


SELECT COUNT( DISTINCT proj_num )
FROM fsf_projects
WHERE datasource_id= <current>;


SELECT COUNT( DISTINCT project_name )
FROM sv_projects
WHERE datasource_id= <current>;


SELECT COUNT( DISTINCT unixname )
FROM tg_projects
WHERE datasource_id = <current>;


SELECT COUNT( DISTINCT proj_unixname )
FROM ow_projects
WHERE datasource_id= <current>;

May 2011 Data Released

May 2011 data has been released to Google Code and uploaded into Data Central at Teragrid.

Datasources:
263 2011-Mar UDD bugfix replaces 262
264 2011-Mar UDD bugfix replaces 263
265 2011-May UDD May 2011 UDD donation
266 Rubyforge 2011-May Rubyforge 2011-May
267 Objectweb 2011-May Objectweb 2011-May
268 FSF 2011-May Free Software Foundation 2011-May
269 Savannah 2011-May Savannah 2011-May
270 2011-May FM May 2011 Freshmeat

Status of other collectors:
Launchpad - parsing problem
Tigris - mailing list collector problem
Github - collection problem
Google Code - still running (it will be about a month until these are out)

Link to FLOSSmole files on Google Code
Link to instructions for how to access FLOSSmole db at Teragrid

January file releases

Just released data files for the following forges. You can head over to the FLOSSmole data downloads page at Google Code to download any of these files, or wait for them to be released to the Teragrid for live querying (shortly!)

datasource_id, forge_id, abbreviation, name
237 2 FM Freshmeat
238 3 RF Rubyforge
239 4 OW ObjectWeb
240 5 FSF Free Software Foundation
241 10 SV Savannah
243 12 GC Google Code
244 13 TG Tigris

Still running...
245 14 LP Launchpad

Broken...
242 11 GH Github

Syndicate content