megan's blog

August 2012 data released

Data files have been released for August 2012. Go check it out on our Google Code downloads page or sign up for direct database access.

Special release notes:
--The Google Code, Github, and Launchpad collections are not included this month.

Data Resources: 

New Visualizations, Summer 2012

The list of examples and visualizations has been updated.

New stuff INCLUDING....

July 2012 data released

Data files have been released for July 2012. Go check it out on our Google Code downloads page or sign up for direct database access.

Data Resources: 

May 2012 data releases

Data files have been released for May 2012. Go check it out on our Google Code downloads page.

Re-writes:
--Free Software Foundation has been re-written from scratch to match their new layout.
--Google Code collector has been re-written to fix a few bugs (still running)
--Launchpad has been finished and will be re-written for June to fix bugs
--Alioth is being re-written to fix bugs

Data Resources: 

Student work using FLOSSmole data

I often have my students tackle FLOSSmole data as a way of learning more about FLOSS, databases, data visualization, etc.

Here is an example of one of the graphs my students worked on last week, using Freecode data in FLOSSmole, R, and Illustrator.

January 2012 Freecode data set is available here.

Data Resources: 
Tags: 

February Github data released

February data has been released for Github.

Get the data here from our Google Code downloads page or request direct database access here.

Included with Github data are the following values:
project name
developer name
description
private yes/no
fork number
homepage
number of watchers
open issues
...and all the xml values that these fields are based on!

Have fun!

Data Resources: 
Tags: 

February Google Code data released

Google Code data has been released for January/February 2012.

Get the data here from our Google Code downloads page or request direct database access here.

Be aware that there is one open bug for Google Code collection that may affect your use of this data.

Data Resources: 
Tags: 

January 2012 releases

We're cruising ahead with January 2012 releases. Grab the data from Google Code site or from the teragrid.

Freecode - done (formerly known as Freshmeat)
Savannah - done
Tigris - done
Rubyforge - done
Objectweb - done
Launchpad - done

Data Resources: 

Google Code data available

Google Code is our longest data collection effort each month. We've collected everything for November and posted it for your data mining pleasure. Get the files or access it on the Teragrid with direct database access (datasource_id=285).

Data Resources: 

Freshmeat becomes Freecode, and how our data is affected

Three things happened recently to affect our Freshmeat collection

1. Freshmeat announced a name change to Freecode.
2. We have an issue (issue #43) that talks about how the trove definitions for Freshmeat are out of date.
3. Freshmeat replaced trove with tagging and we missed the memo

What I've done is as follows:

For issue #1 - decided not to rename our abbreviation for Freshmeat. It will remain "FM".

Pages