Submitted by megan on July 23, 2007 - 3:15am
Just wanted to let you guys know about a bug fix on the status pages for the SF data. Each project has a "status" (i.e. beta, alpha, production, etc). We were under the impression that each project only had a single status (we assumed this represented the project's "current" status), but this turned out not to be the case.
Our code was therefore erroneously grabbing only the first of a possible list of status codes. Consequently, some projects that had multiple status codes were not shown correctly.
Submitted by megan on July 5, 2007 - 2:27am
I collected some debian package data and started parsing it to see what kind of stuff we might find in there.
I will probably need some help from the user community on this one, to know what sort of data you find interesting in these packages.
Here are the files I collected:
Obviously there is a lot of information there, and I only parsed some of it out for this initial run. Here are the items I parsed and released:
- package name, version, parent directory
Submitted by megan on July 3, 2007 - 4:05am
July data is out for Freshmeat, Rubyforge, Objectweb, Free Software Foundation.
Go get it!
Submitted by megan on June 5, 2007 - 2:46pm
June data has been released for all forges.
Head over to the project page on Sourceforge and gather all the data you need!
Submitted by megan on May 3, 2007 - 3:36am
May 2007 data is released for the small forges. (Reminder that Sourceforge data is next scheduled for a release in June.)
As usual, there are 3 ways to get FLOSSmole data:
(1)
Flat files (includes May 2007 data, plus historical data if you wish)
(2)
Get the data marts(3) Browse results of
common queries
Submitted by megan on April 12, 2007 - 6:11pm
By popular demand of our user base (and some hard work by our developers, especially ruphus_13), we now provide data marts for Sourceforge data.
The new package, called
DataMarts contains all the SQL create and insert statements for creating your own version of the FLOSSmole database - for multiple data sources (Sourceforge, Freshmeat, Rubyforge, ObjectWeb, FSF)
The marts are created following each of our data collections; we collect and parse the data as usual. We then load it into our database as usual, and create the raw flat-file data dumps as we have been doing since 2004. The new feature we are announcing today is that we also now provide the SQL data dumps so you can auto-load our data into your own local database for easier processing and more complex mining tasks.
So, there are now numerous ways to get our data:
--install the
data marts into your own mysql database
--download and analyze the
flat, delimited data files--play around with the
query tool
Submitted by megan on April 5, 2007 - 8:30pm
April 2007 data is released for all forges. Here is a summary of the data we have and where to get it:
- Sourceforge data
- General Forge Information(Get it)
- Project code names, project display names, developer counts, date project was registered, long project descriptions
- Developer Information(Get it)
- Developer login names, real names, developers-per-project and what role they have on that project, are they an admin?
- Data about Projects(Get it)
- Database type by project, number of downloads per project, rank of project, intended audience, topic of project, status of project, license(s), operating system(s), programming language(s), real URL of project, tracker data, donors to projects, user interfaces
- Freshmeat data (Get it)
Submitted by megan on March 27, 2007 - 4:53am
Check out the
New Query Tool for running common, pre-defined canned queries. (Thanks Gregg!)
The old query tool is still available. We'll be adding real-time graphing and some more bells and whistles to the new tool as time is available.
Submitted by megan on March 4, 2007 - 3:00am
March data is released for the following sources (forges/directories/repositories):
--Freshmeat
--Rubyforge
--ObjectWeb
--Free Software Foundation
--SourceKibitzer
Get the data from our Sourceforge file release pageEnjoy!
(The April release will include Sourceforge and the other 5 forges.)
Submitted by megan on March 1, 2007 - 4:25am
Great news moles, we have a new donation partner:
Source Kibitzer.
The facts:
--In our system, SourceKibitzer is forge #6, and has the abbreviation "SK".
--SK will be part of the monthly data cycle, so expect new SK files once per month (just like Freshmeat, Rubyforge, Objectweb, and Free Software Fndn.)
--SK files are available on
our file releases page on Sourceforge.
Pages