Submitted by megan on February 9, 2006 - 5:53pm
...while we get February's data parsed and loaded.
These graphs showing December 2005 trends were made by FLOSSmole's
newest developer.
Most connected developers - I really like this chart because it shows that the
most connected developer on Sourceforge (i.e. member of the most projects) is a graphic designer! How cool is that? This makes perfect sense when you think about it for a second, but it wouldn't have been MY first guess.
Here are some older charts, similar to things we have run before - these graphs show the kinds of reports you can run using
FLOSSmole data:
Database Environments
Submitted by megan on January 7, 2006 - 4:24am
We got a request for Sourceforge project descriptions. These are the little paragraphs that the project owners write to describe a given project. I've parsed out the descriptions and put them in this
file release. Also, I created a new table called project_description to hold this information if you're using the
query tool.
Submitted by megan on January 3, 2006 - 3:09am
December and January Freshmeat files have been added as datasource_ids 14(Dec) and 15(Jan). Use the
Query Tool to explore the fm_* tables (these are the tables that hold the freshmeat data).
Submitted by megan on December 23, 2005 - 4:47am
We've run December 2005 Sourceforge data; the raw html has been stored as datasource_id #13 if you're using
the query tool, otherwise, text files are
over here at sourceforge on our project page.
We've got the usual stuff, all the Sourceforge project names, all project data, developer counts, who is working on what projects, what programming languages are being used, operating system counts, all that good stuff. Have fun!
Submitted by megan on November 22, 2005 - 5:01pm
Submitted by megan on October 7, 2005 - 2:31am
The SF and Freshmeat (surprise!) data collections for October are DONE. We had a 10-machine grid working to collect this time. Very speedy! We plan to move to collections on a 60-day rotation, rather than 90-days from here on out. This will match up nicely with the 60-day sourceforge stats interval also.
Also, we have a working prototype of our live query tool -thanks Dawid!- we're just waiting for the production environment to be set up and that will be available for you all to use.
Here is the master file list on our SF project page:
Master List of FLOSSmole Files, but we also have quicker links to:
Submitted by megan on September 17, 2005 - 2:56am
Happy Birthday, FLOSSmole (nee OSSmole). It was 1 year ago today that we started FLOSSmole project on Sourceforge. What a joyous occasion.
Submitted by megan on September 16, 2005 - 3:59am
The July raw developer data has finally been released. I had actually forgotten about it (oops). We had a problem in the way our spider collected the datasource_id=5 developers, so I had put off the release until I fixed that problem, and then I promptly forgot about the whole thing.
In any event,
the files are posted, so enjoy!
Submitted by megan on September 9, 2005 - 3:43pm
Coming up:
1. we'll be doing our next SF "run" in October
2. we're making a web interface for "live" read-only queries - this will satisfy our occasional middle-of-the-night wild hypotheses... "gee, I wonder if developers who program in python also program in java or perl... and where are all these ruby hackers coming from anyway...?" You've got questions, we've got answers.
Submitted by megan on August 6, 2005 - 1:34am
Since I've been at
Oscon this week, I've been delayed in posting my summary analyses. Nevertheless, for those who are waiting, I've posted some basic summaries in the directory below
Here is
a directory of excel files showing October 2004, January 2005, April 2005, and July 2005 Summary Data for projects on Sourceforge.
Remember,
raw data is also available!
Pages