Two new data sets

Hi moles! I've got two new datasets for you to play with. These aren't perfect, but they're a start of a new type of dataset for FLOSSmole!

(1) Apache Roles: This dataset stores information about people affiliated with all the subprojects of the Apache Software Foundation, their roles, and what project they're working on with that role. Data sources include: Apache web site pages, board meeting minutes, etc. (Pre-Print on FLOSShub describing collection, curation, storage, sample queries)

Sample data:

Column Sample Row
Svn_id jsmith
Real_name John Smith
Project_name Apache Axiom
Role_on_project Committer
Organization BigCorp
Email jsmith@bigcorp.com
Web_site http://www.apache.org/~jsmith
Datasource_id 367
Details Appendix T

(2) Apache Twitter Screen Names: This dataset stores the verified twitter screen names of people affiliated with the Apache Software Foundation projects. Useful for matching to emails or source code commits, or to be used in tandem with the Apache roles dataset above. (Pre-print on FLOSShub describing collection, curation, storage, sample queries)

Sample data:

Column Sample Row
Svn_id jsmith
Twitter_screen_name jsmith
Real_name John Smith
Project_name Apache Cayenne
Details

Get the MySQL dumps on our FLOSSmole downloads page on Google Code or via direct database access.

Got a cool FLOSS-oriented dataset you want to share? If you wish to donate data to FLOSSmole, we can host it.

Data Resources: