Bug on Google File Download page

Heads up Moles - If you've been searching our very, very long file list on the Google Code site, you might have noticed that "search" acts strangely over there. (Strange that Google Code would have search issues...but anyway...)

Today I turned in bug #5211, for some odd behavior in the way search results are returned for (in this case) the keyword "Debian".

Hopefully they'll get back to me soon; I hope this is not due to our very large number of files. That has broken our page at Google Code before.

UPDATE: Fixed. It was something about re-indexing our list.

Mozilla uses metrics

Here is an interesting post about how Mozilla is beginning to study itself using metrics gathered about contributors and contributions over time. They create charts and tables about patch rates and the like.

mozilla project
image link goes to original article

April 2011 releases

Here is the current list of files heading up to our downloads page.

ObjectWeb, datasource 257
Free Software Foundation, datasource 258
Freshmeat, 255
Rubyforge, 256
Savannah, 259
Debian Metrics (January), 254
Debian Metrics (March), 261

DOI will be generated following the release of all files.
Everything is backed up to Teragrid if you prefer direct DB access.

March data for Google Code posted

Here is the March 2011 data for Google Code projects, available on our own GC page. Upload to Teragrid is happening now if you prefer direct db access.

The datasource is 252. Available data includes:

--basic project info for each project on Google Code
--links for each project
--people on each project (some hashed)
--blogs for each project
--labels for each project
--groups for each project

2011 database schema update

I've updated the database schema to show some of the newer forges and their tables. To open the .mwb file, use MySQL Workbench. The other file is just a PNG of the same information.

New data for March 2011

Most of the March data has been released to our page on Google Code. Included forge collections are: Free Software Foundation, Freshmeat, Rubyforge, Objectweb, Savannah, Tigris. Google Code is still running. Github and Launchpad are not functional right now (waiting on a bug fixes).

There are two ways to get the data:
You can download the data at our downloads page - the flat files are so marked, and the SQL files are marked "datamarts". Note that datamarts only contain the latest collection. If you want previous months' worth of data, you'll have to grab those datamarts too.

You can also log into our database on the Teragrid and live-query the data. Read these instructions on getting a login.

Have fun!

Jan/Feb 2011 data uploaded to Teragrid

I've backed up the Jan/Feb data to Teragrid for your live queries. Be sure to log in there and use your database querying tool of choice to check out the data. (If you need an account, read these instructions for how to get yourself an account.)

The datasource_id information is as follows:

237 FM-Freshmeat
238 RF-Rubyforge
239 OW-ObjectWeb
240 FSF-FreeSoftwareFndtn
241 SV-Savannah
243 GC-GoogleCode
244 TG-Tigris
246 - Debian metrics


Debian metrics data released

One of the undergraduate students working on this project, Carter Kozak, recently collected all of the Debian package data for January and parsed it for relevant software engineering metrics. He concentrated on C/C++ code. He also integrated some Debian metadata, such as popcon (popularity contest) and sources.gz. He has written up his findings in a paper (as yet unpublished, but just you wait!) and has donated his data to FLOSSmole. You can find the data on our FLOSSmole data downloads page on Google.

January file releases

Just released data files for the following forges. You can head over to the FLOSSmole data downloads page at Google Code to download any of these files, or wait for them to be released to the Teragrid for live querying (shortly!)

datasource_id, forge_id, abbreviation, name
237 2 FM Freshmeat
238 3 RF Rubyforge
239 4 OW ObjectWeb
240 5 FSF Free Software Foundation
241 10 SV Savannah
243 12 GC Google Code
244 13 TG Tigris

Still running...
245 14 LP Launchpad

242 11 GH Github

Adding 1000 data files to Google Code

I've got about 1000 files that were hosted on Sourceforge (still are) but I'm trying to move all our files into one place. I am running scripts all day to d/l these from SF, relabel them, and move them to Google Code.

If you see old files showing up at Google Code, that's why! Don't forget that you can use the search box there if you are looking for a specific file or topic. Also, send email to the mailing list if you can't find something you're looking for and I'll help you out.

UPDATE: this action apparently broke the Google Code files download page for our project. I've submitted a bug report.

Syndicate content