megan's blog

October 2009 data released

October 2009 data has been released. Here are the forges we have this month:
Free Software Foundation directory
Savannah (new)
GitHub (new)

FLOSSmole Downloads

Sourceforge is undergoing a re-write, still, but we will be collecting again from there soon. In the meantime, don't forget that the June 2009 data is available, and also there is the Notre Dame data if you find that helps at all.


September 2009 data released

Data has been released for FSF, FM, RF, OW. Go get it!! Have fun.

Google Code Downloads Page

That Freshmeat data looks fairly popular. Anyone want to tell us how you use this data?

Savannah data available

Savannah data has been released for July. See what you think! (Datasource_id = 182)

July 2009 data

Hello moles, our July 2009 data has been released: this month we have Objectweb, Freshmeat, Rubyforge, Free Software Foundation directory.

Go to our Google Code pages to download the data.

The most recent datasource_ids are:

June data sets released

Hi moles, the June 2009 data sets are released.


Datamarts (sql files) and flat (delimited) files are located on our Google Code downloads area.

oss2009 requests, etc

Just back from OSS 2009 in Skövde, Sweden. (Finally figured out how to make the ö character on a mac: hit option-u, then o). Here are the requests I heard from sitting in talks, either for new forges, for features that FLOSSmole could provide, or just things that people were using/needing that might intersect with our mission here:

debian popularity contest
KDE's "10 years of data in an xml logfile"
"git" everything
developer skills from sourceforge
a wiki for common analyses, charts, graphs, SQL commands
a taxonomy of forges

May data, and April and May datamarts released

Go grab the May data, April & May datamarts from our Google code web site.

I'm backing up to Teragrid now, so Teragrid users, you should have a nice new set of data to play with RSN (real soon now).

April 2009 data released

Hi moles, April data has been released over to the google code site (all forges). I'll get that up on the Teragrid site ASAP. We have a few small problems with the scripts over there, nothing that is major but it will slow me down in getting the data uploaded over there.

In other news, we're getting a new backup system for the Elon servers in our continuing quest to keep the data safe. And I am pleased to announce that over the summer I will have another research student. She is going to be working specifically on user interface tools. Her name is Jamie and she's great. Look forward to a re-vamped web site, database schemas, and all that good "end user" type of stuff that I've been slack on providing up until now.

FOSSology grants

Hello moles,

Here is an RFP I received this past week. I've copied it here.

Request for PROPOSALS

Hewlett Packard, major sponsor of the FOSSology project
(, is accepting short proposals from academic
institutions involved in research in areas of interest to the
FOSSology project. FOSSology is an open source project dedicated to
analysis and mining of open source data and using the results for the
betterment of the free and open source software (FOSS) and FOSS

Interesting areas include but are not limited to:

- vulnerability analysis
- vulnerability tracking
- dependency analysis
- code reuse detection

Currently, the FOSSology project concentrates on software license
detection, but would like to expand into other areas. Feel free to
suggest your own area or propose something around the above topics

Grant amounts are in the range of $5,000 - $20,000 USD. The number of
grants awarded is dependent on the number and quality of proposals

Proposals should short (1-2 pages).

Grant recipients will be expected to communicate with other FOSSology
developers through the public mailing list and/or IRC channel. If

Feb and March data released

Sorry for the delay on February, but as a bonus I'll throw in the March data too!

Go to our Google Code downloads page and have fun.
Syndicate content