Blog

minor forges ready for December, inc. FSF

The December "minor" forges are ready for download from the FLOSSmole Google Code Project Page: FM, RF, OW. Enjoy!

Update: I've also released Free Software Foundation (FSF) for October and December. This parser has been a long time coming. But better late than never.

Problem with Public Areas file

We recently had an error in the Sourceforge Public Areas data dumps for October.

New files available for you to download. They are marked "2".

October developers

Better late than never on the October Developers and Project Developers for SourceForge:

Download these developer files here

Sorry about the .gz format instead of the .bz - for some reason I have problems using the Google Code auto-uploader with .bz files. It's a very strange problem, random and intermittent, hard to pin down why some .bz files will go and some will not. Anyway, I gzipped them and we are good to go.

October developers

Better late than never on the October Developers and Project Developers for SourceForge:

Download these developer files here

Sorry about the .gz format instead of the .bz - for some reason I have problems using the Google Code auto-uploader with .bz files. It's a very strange problem, random and intermittent, hard to pin down why some .bz files will go and some will not. Anyway, I gzipped them and we are good to go.

October data

The October data are available at the following locations:

Google Code Downloads Page

Notes:
  • SF Stats are unavailable this month because the server was not reliably reporting stats when we did our collection.
  • There were a few errors in the file uploads. These files can't be taken off of Google Code, so I've just marked the files "DO NOT DOWNLOAD". You can usually tell these files by their very tiny filesize.

Bad August 2008 file

The August 2008 SF file released to Google Code was corrupted, so anyone who downloaded this file only got 90k projects. Please download the new file here. It is called "Datamarts: SF Other, Aug 2008, New, v.1"

Direct DB access for FLOSSmole collection available

Hello moles,

I'm excited to give you all a heads up that the entire flossmole database is now available directly via a MySQL server.

We have transferred the database to the NSF TeraGrid Data Central hosting site [1] (based at the San Diego Supercomputing centre). It's a bigger machine and professionally administered, which was much better than we could offer ourselves. See below for access procedure.

The process of transferring the database also enabled us to prepare comprehensive datamarts for each datasource in the database. These are mysqldump files which can be used for local access to parts of the database; there are two for each datasource, one containing the raw html pages and one, substantially smaller, containing just the parsed data points. These will be available shortly and will be an option for those who want to install a local copy of the DB; although we'd be very interested in reasons people find to do that, we'd like to have people sharing useful transformations of the data and the Data Central database should be pretty quick.

So now we have three great options for accessing the FLOSSmole data:

1. The traditional monthly flat files
2. Direct MySQL access to the full database @ DC.
3. Comprehensive datamarts for local access

Database access further info

August data released

Hello moles!

Lots of exciting news this month:

1. All releases are up on Google Code instead of SF this month. Go to the FLOSSmole downloads page on Google Code to pick up this month's files.

2. Sourceforge stats server is down, so no stats this month.

3. Debian is back! The data is minimal, but check out what we do have and enjoy that.

OSS Watch in Oxford

I am at Oxford in the UK (staying at Hertford College Graduate Center) for a few days for the OSS Watch workshop on Profiling Open Source Communities.

OSS Watch is the National Advisory Service on open source for UK Further Education (FE) and Higher Education (HE). As such, it is part of our remit to help FE and HE institutions and projects who want to engage with open source development, and a key factor for that is the development of open source communities.


I'm giving The Standard FLOSSmole Talk today at 11:45GMT.

July data released

Well moles, we were hoping to move to Google Code over the summer to host our file releases, but unfortunately, they have some size (and other?) limitations which are messing us up. They have no problem storing certain files, but others produce errors when I try to upload them. They are rejecting more and more of our files. I've put in a ticket to see if they can bump up the quota over there, but in the meantime, I'm back to putting files up on Sourceforge Downloads page.

Pages