sourceforge |
Lots of new data for you to peruse out on our FLOSSmole Data Downloads Page. Here's what's out there, recently added: Google Code, March 2010 (GC) - list of all GC projects donated by Audris Mockus (HUGE THANK YOU TO AUDRIS FOR THIS!!) We have another set of bugs to fix with Sourceforge collection this year, 2010, but those are forthcoming. I'm running a collection now. Hopefully the data will be good. We may even have stats this time. Hallelujah. Also, thanks to my phenomenal undergraduate superstar Steven Norris, Tigris is coming soon!! and Debian after that. We are rocking the repository collection... |
|||
After long delay, the December Sourceforge data has been released. You may recall that over summer 2009, SF redesigned their web site which broke many of our crawlers and all of our parsers. We have re-written these, and with only a few exceptions, have pretty much the same data as we always had. Here are some release notes: 1. The Datasource_id=206 Files are located at our Google Code page: http://code.google.com/p/flossmole/downloads/list For those of you with database access on the sdsc server, I'll get these files over there ASAP. |
|||
Description Visualization SQL Script |
|||
Description Visualization SQL Script |
|||
Description Visualization Projects listed as having NULL or 0 developers were disregarded (1432 and 1478 projects, respectively). SQL Script |
|||
Description Visualization SQL Script Sourceforge:
Freshmeat:
Rubyforge:
|
|||
Description Visualization Number of Projects at each Repository that List a Home Page at Another Repository Matching projects by URL has two possiblities: projects listed on different forges might both display the same external URL, or projects on one forge might actually list the project site on a competing forge as the home page of record. The diagram shown in the figure above depicts each forge/directory in FLOSSmole and how many of its projects list another forge as the actual hosting home page. For example, in the diagram, the topmost arrow shows 11,229 projects on the Freshmeat that actually have Sourceforge listed as the home page. The arrow notation is used to show a direction of the relationship (e.g. 11,229 Freshmeat projects show a home page on Sourceforge, but only 10 Sourceforge projects list a Freshmeat home page). Pairs of forges with no URLs in common do not show an arrow. (No Rubyforge projects list ObjectWeb URLs, and vice versa.) For more information on matching project names and URLs, see: SQL Script RF-SF RF-FM RF-OW FM-RF FM-SF FM-OW SF-FM SF-OW SF-RF OW-SF OW-RF OW-FM |
|||
Description Visualization Number of Projects at each Repository that Share an Identical Short Project Name This graph shows the number of short project names shared in common between each pair of projects. For instance, starfish is a project listed on both Sourceforge and Rubyforge. On Rubyforge, it is described as a "tool to make programming ridiculously easy", but on Sourceforge the starfish project is described as a password management application. There are 1367 projects with shared names on Rubyforge and Sourceforge. For more information on matching project names and URLs, see: SQL Script RF-SF RF-FM FM-SF SF-OW RF-OW FM-OW |
|||
Description Visualization SQL Script Sourceforge: Freshmeat: Rubyforge: Objectweb: |
|||
Description Visualization SQL Script
|
|||










