megan's blog

Bad August 2008 file

The August 2008 SF file released to Google Code was corrupted, so anyone who downloaded this file only got 90k projects. Please download the new file here. It is called "Datamarts: SF Other, Aug 2008, New, v.1"

Direct DB access for FLOSSmole collection available

Hello moles,

I'm excited to give you all a heads up that the entire flossmole database is now available directly via a MySQL server.

We have transferred the database to the NSF TeraGrid Data Central hosting site [1] (based at the San Diego Supercomputing centre). It's a bigger machine and professionally administered, which was much better than we could offer ourselves. See below for access procedure.

The process of transferring the database also enabled us to prepare comprehensive datamarts for each datasource in the database. These are mysqldump files which can be used for local access to parts of the database; there are two for each datasource, one containing the raw html pages and one, substantially smaller, containing just the parsed data points. These will be available shortly and will be an option for those who want to install a local copy of the DB; although we'd be very interested in reasons people find to do that, we'd like to have people sharing useful transformations of the data and the Data Central database should be pretty quick.

So now we have three great options for accessing the FLOSSmole data:

1. The traditional monthly flat files
2. Direct MySQL access to the full database @ DC.
3. Comprehensive datamarts for local access

Database access further info

August data released

Hello moles!

Lots of exciting news this month:

1. All releases are up on Google Code instead of SF this month. Go to the FLOSSmole downloads page on Google Code to pick up this month's files.

2. Sourceforge stats server is down, so no stats this month.

3. Debian is back! The data is minimal, but check out what we do have and enjoy that.

OSS Watch in Oxford

I am at Oxford in the UK (staying at Hertford College Graduate Center) for a few days for the OSS Watch workshop on Profiling Open Source Communities.

OSS Watch is the National Advisory Service on open source for UK Further Education (FE) and Higher Education (HE). As such, it is part of our remit to help FE and HE institutions and projects who want to engage with open source development, and a key factor for that is the development of open source communities.


I'm giving The Standard FLOSSmole Talk today at 11:45GMT.

July data released

Well moles, we were hoping to move to Google Code over the summer to host our file releases, but unfortunately, they have some size (and other?) limitations which are messing us up. They have no problem storing certain files, but others produce errors when I try to upload them. They are rejecting more and more of our files. I've put in a ticket to see if they can bump up the quota over there, but in the meantime, I'm back to putting files up on Sourceforge Downloads page.

June SF data posted

June Sourceforge data has been posted.

This time, I've released the files onto our Google Code project page. Let's see how we like using Google Code for this (psst, I can tell you as the person who does the file releases: it's a lot easier to use Google Code than Sourceforge for this particular part of the job!)

Enjoy!

UPDATE: Because of file size limits on Google Code, I've had to re-release our code onto Sourceforge.

May 2008 data released

Moles, the May 2008 data has been released. Find it all at our SF project page. Forges included this month are: FM, RF, OW.

April 2008 data released

Hi moles! April 2008 data has been released on our SF project page. Enjoy it! (Debian data included!)

UPDATE 08-JUL-2008: new files released to get past problem with data quality on these April files.

February data released

Go to our file release page on Sourceforge and pick up all the latest files.

Included this month: SF, FM, RF, OW flat files and data marts (sql statements).

(Debian is on the way! I'll update here as soon as it's ready.)

Enjoy!

February data released

Go to our file release page on Sourceforge and pick up all the latest files.

Included this month: SF, FM, RF, OW flat files and data marts (sql statements).

(Debian is on the way! I'll update here as soon as it's ready.)

Enjoy!

Pages