Two new data sets
Hi moles! I've got two new datasets for you to play with. These aren't perfect, but they're a start of a new type of dataset for FLOSSmole!
(1) Apache Roles: This dataset stores information about people affiliated with all the subprojects of the Apache Software Foundation, their roles, and what project they're working on with that role. Data sources include: Apache web site pages, board meeting minutes, etc. (Pre-Print on FLOSShub describing collection, curation, storage, sample queries)
Sample data:
Column | Sample Row |
Svn_id | jsmith |
Real_name | John Smith |
Project_name | Apache Axiom |
Role_on_project | Committer |
Organization | BigCorp |
jsmith@bigcorp.com | |
Web_site | http://www.apache.org/~jsmith |
Datasource_id | 367 |
Details | Appendix T |
(2) Apache Twitter Screen Names: This dataset stores the verified twitter screen names of people affiliated with the Apache Software Foundation projects. Useful for matching to emails or source code commits, or to be used in tandem with the Apache roles dataset above. (Pre-print on FLOSShub describing collection, curation, storage, sample queries)
Sample data:
Column | Sample Row |
Svn_id | jsmith |
Twitter_screen_name | jsmith |
Real_name | John Smith |
Project_name | Apache Cayenne |
Details |
Get the MySQL dumps on our FLOSSmole downloads page on Google Code or via direct database access.
Got a cool FLOSS-oriented dataset you want to share? If you wish to donate data to FLOSSmole, we can host it.
- megan's blog
- Log in to post comments