About

FLOSSmole aims to:

  • freely provide data about open source projects in multiple formats for anyone to download
  • integrate donated data from other research teams
  • provide some tools so that you can gather your own data
  • provide a community for researchers to discuss public data about open source software development

FLOSSmole contains:

  • 300 GB of data covering the period 2004-now, and growing
  • data sets from over 200 web spidering operations, and growing each month
  • data about more than 200,000 different open source projects and their developers

Citation & Conditions of Use
Here's how to cite FLOSSmole data:
Howison, J., Conklin, M., & Crowston, K. (2006). FLOSSmole: A collaborative repository for FLOSS research data and analyses. International Journal of Information Technology and Web Engineering, 1(3), 17–26.

All original data is copyright of its owners.

  1. If you use the data, please cite the source as shown above.
  2. Please be ethical in your use of this data. You may wish to consult the publication Ethical decision-making and Internet research: Recommendations from the AOIR ethics working committee.
  3. Please be aware that your use of this data for research may require approval by your company's or institution's IRB (Institutional Review Board).