sourceforge

February 2010 Data Released

Lots of new data for you to peruse out on our FLOSSmole Data Downloads Page.

Here's what's out there, recently added:

Google Code, March 2010 (GC) - list of all GC projects donated by Audris Mockus (HUGE THANK YOU TO AUDRIS FOR THIS!!)
Freshmeat, February 2010 (FM)
Objectweb, February 2010 (OW)
Rubyforge, February 2010 (RF)
Github, February 2010 (GH)
Free Software Foundation, February 2010 (FSF)
Savannah, February 2010 (SV)
and Sourceforge from December 2009 (SF)

Data Resources: 

December Sourceforge Data released

After long delay, the December Sourceforge data has been released. You may recall that over summer 2009, SF redesigned their web site which broke many of our crawlers and all of our parsers.

We have re-written these, and with only a few exceptions, have pretty much the same data as we always had.

Here are some release notes:

Data Resources: 

As of June 2009, what are the top programming languages used by projects listed in Sourceforge?

Description

This chart shows the top programming languages used by projects in Sourceforge.

Visualization

Sourceforge Programming Language Chart

Data Resources: 

As of June 2009, what are the top operating systems used by projects listed in Sourceforge?

Description

This chart shows the top operating systems used by projects in Sourceforge.

Visualization

Sourceforge Operating System Chart

Data Resources: 

As of June 2009, how many projects of each team size are listed in Sourceforge?

Description

This chart shows the number of projects of each team size listed in Sourceforge.

Visualization

Projects listed as having NULL or 0 developers were disregarded (1432 and 1478 projects, respectively).

Sourceforge Developer Count Chart

Data Resources: 

As of June 2009, how has the use "Free" and "Open" in project names grown by year?

Description

This chart shows the number of new projects in each repository that use the words "Free" and "Open" in project names. (We ran the queries to make this chart in June. This means 2009 was not yet completed, so this explains the apparent drop-off for the 2009 numbers.)

Visualization

Freshmeat Free & Open Count Chart

Data Resources: 

As of June 2009, how many projects at each repository share URL's?

Description

This chart shows the number of projects at each repository that share URL's.

Visualization

Number of Projects at each Repository that List a Home Page at Another Repository

Data Resources: 

As of June 2009, how many projects at each repository share identical short project names?

Description

This chart shows the number of projects at each repository that share project names.

Visualization

Number of Projects at each Repository that Share an Identical Short Project Name

Data Resources: 

As of June 2009, how have projects in each repository grown by year?

Description

This chart shows the number of NEW projects added to each repository by year.

Visualization

Project Growth Chart

SQL Script

Data Resources: 

As of June 2009, how many projects are listed in each repository?

Description

This chart shows the number of projects that FLOSSmole most recently collected from each repository.

Visualization

Project Count Chart

SQL Script

SELECT count(DISTINCT proj_unixname) FROM projects
WHERE datasource_id= <current>;

Data Resources: 

Pages