June Data: Google Code, Launchpad, Github

Summer is a beautiful thing. Moles, we've got a huge Google Code release for you (ds=271), and the re-vamped Launchpad (ds=272), and also Github (ds=273).

Get your FRESH June data on our Google Code Downloads Page or LIVE on the Teragrid.

Tigris is fixed and is running right now. We're also writing a new collector for Alioth! Lots of new stuff.

Data Resources: 

May 2011 Data Released

May 2011 data has been released to Google Code and uploaded into Data Central at Teragrid.

263 2011-Mar UDD bugfix replaces 262
264 2011-Mar UDD bugfix replaces 263
265 2011-May UDD May 2011 UDD donation
266 Rubyforge 2011-May Rubyforge 2011-May
267 Objectweb 2011-May Objectweb 2011-May
268 FSF 2011-May Free Software Foundation 2011-May
269 Savannah 2011-May Savannah 2011-May
270 2011-May FM May 2011 Freshmeat

Debian data, Ultimate Debian Database

Hello moles. A quick update on the Debian collections.

I told you earlier that we'd been collecting some Debian data and calculating software engineering metrics for each C/C++ package, and providing that data on both the raw data downloads page and in the database at Teragrid.


Bug on Google File Download page

Heads up Moles - If you've been searching our very, very long file list on the Google Code site, you might have noticed that "search" acts strangely over there. (Strange that Google Code would have search issues...but anyway...)

Today I turned in bug #5211, for some odd behavior in the way search results are returned for (in this case) the keyword "Debian".


Mozilla uses metrics

Here is an interesting post about how Mozilla is beginning to study itself using metrics gathered about contributors and contributions over time. They create charts and tables about patch rates and the like.


April 2011 releases

Here is the current list of files heading up to our downloads page.

ObjectWeb, datasource 257
Free Software Foundation, datasource 258
Freshmeat, 255
Rubyforge, 256
Savannah, 259
Debian Metrics (January), 254
Debian Metrics (March), 261

DOI will be generated following the release of all files.
Everything is backed up to Teragrid if you prefer direct DB access.

March data for Google Code posted

Here is the March 2011 data for Google Code projects, available on our own GC page. Upload to Teragrid is happening now if you prefer direct db access.

The datasource is 252. Available data includes:

--basic project info for each project on Google Code
--links for each project
--people on each project (some hashed)
--blogs for each project
--labels for each project
--groups for each project

2011 database schema update

I've updated the database schema to show some of the newer forges and their tables. To open the .mwb file, use MySQL Workbench. The other file is just a PNG of the same information.

New data for March 2011

Most of the March data has been released to our page on Google Code. Included forge collections are: Free Software Foundation, Freshmeat, Rubyforge, Objectweb, Savannah, Tigris. Google Code is still running. Github and Launchpad are not functional right now (waiting on a bug fixes).

Data Resources: 

Jan/Feb 2011 data uploaded to Teragrid

I've backed up the Jan/Feb data to Teragrid for your live queries. Be sure to log in there and use your database querying tool of choice to check out the data. (If you need an account, read these instructions for how to get yourself an account.)

The datasource_id information is as follows:

237 FM-Freshmeat
238 RF-Rubyforge
239 OW-ObjectWeb
240 FSF-FreeSoftwareFndtn
241 SV-Savannah
243 GC-GoogleCode
244 TG-Tigris
246 - Debian metrics

Data Resources: