This will be more of a talk of tons of little lightning talks.
Interrupt me in the middle please as I don't think there's going to be a lot of room for questions at the end.
For each area, I'm going to give:
Note
3 minutes
There are researchers, mostly academic, who are interested in how software gets produced.
Many work in business schools and people focused on information technology management.
Many study Debian.
Note
4 minutes
Jesús M. González-Barahona, et al at the Universidad Rey Juan Carlos de Madrid has put together a bunchf of research with papers including:
I won't spend too much on it because Javier talked about this in depth on last Sunday.
The major question is one of statistic gathering, and more importantly, about how big and economically important is free software.
David Wheeler wrote a paper looking at Red Hat where he concluded it was "a gigabuck" but this group at Universidad Rey Juan Carlos has been demonstrating the much larger community in Debian and also a growing amount of information about how software is produced.
Research is primarily descriptive.
The swelling size of Debian is good for something. :)
Note
7 minutes
Sampling in Open Source Software Development: The case for using the Debian GNU/Linux Distribution by Sebastian Spaeth, Matthias Stuermer, Stefan Haefliger, Georg von Krogh (ETH Zurich):w
Similar to the previous argument, Spaeth et al asked:
How can we do useful samples of the free software community?
The usual answer is SourceForge (70,000 projects) which has lots of problems:
Primarily interested in "reuse" of components and the way that things might work.
We argue that a GNU/Linux distribution, such as Debian, is better suited for the sampling of projects because it avoids biases and contains unique information only available in an integrated environment. Especially research on the reuse of components can build on dependency information inherent in the Debian GNU/Linux packaging system.
Debian is useful because it:
Note
10 minutes
Martin Michlmayr (former DPL) has recently finished a thesis that used Debian as one of seven case studies:
Quality Improvement in Volunteer Free and Open Source Software Projects: Exploring the Impact of Release Management
Major questions asked:
Can volunteer teams, with their high volatility regarding project collaborators ensure consistent levels of quality in their output?
The dissertation:
Methodologically, the dissertation focuses primarily on qualitative data (e.g., interviews, "following projects over two years", mailing lists, documents, direct observation).
Results and conclusions:
Argues strongly for time-based releases, not feature based releases:
This dissertation has shown that feature based release management in FOSS projects is often associated with lack of planning, which leads to problems, such as delays and low levels of quality. Especially important for volunteers:
Time based releases are associated with two factors that act as important coordination mechanisms:
- Regularity: the production of releases according to a specific interval allows projects to create regular reference points which show contributors what kind of changes other members of the project have made. Regularity also contributes to familiarity with the release process, and it leads to more disciplined processes.
- Schedules: by using time rather than features as the orientation for a release, planning becomes possible in voluntary projects. Time based projects can create schedules which describes important deadlines and which contains dependency information between different work items and actors.
Note
14 minutes
Quality and the Reliance on Individuals in Free Software Projects
Martin Michlmayr and Benjamin Mako Hill
What are the problems and solutions associated with voluntary labor on free software projects?
Problems with single maintainership: people get busy, etc. NMU becomes stigmatized.
Problems with group maintainerships: lots of commits but no uploads.
Solutions: Situations like the currently uploader situation.
Note
17 minutes
Quantitative analysis of volunteers working in the Debian project...
They asked a variety of questions, I'll handle them one-by-one:
Number of maintainers: Growing by ~35% each year.
Team maintainership: From 14 (1.3%) to 600 (7.4%) packages in 6.5 years. Also impressive even when not including the Debian QA team.
Tracking individual maintainers: There were 216 contributions to 2.0. In 2004, only 121 (55%) remained. A rough half-life of 7.5 years seems to persist. People still involved maintain as many or even more packages. Burnout seems to be an on or off thing.
Maintainers who leave: More than 60% of packages are adopted. If a package falls out of release, it will be very likely that it will not be added again. Maturity model and determining which packages might fall apart is not clear.
Experience and importance: more experienced maintainers tend to maintain software that more important. New maintainers tend to maintain less widely used software.
Other results:
Debian is not adding maintainers as fast as it adds packages. The ratio of maintainers to packages is increasing which presents a potential problem.
Note
24 minutes
O'Mahony, S., Ferraro, F. (2004). Managing the Boundary of an “Open” Project. (March 30, 2004). Harvard NOM Working Paper No. 03-60.
Analysis in depth of the creation of the Debian new maintainer process.
They looked a lot at the factors that increased likelihood of becoming an AM and the likelihood that someone would become a developer who started or expressed interest in starting the process.
The greatest predictor they found for someone becoming an AM was the number of connections within the network (having five signatures increased the likelihood by 65%). Tenure had a negative effect. New members were more interested in NM and AM related issues.
In other words, new well connected formed the NM committee and make well connected a new requirement of participation -- encouraging people to be at least as connected as they were. The cycle repeats.
The process became increasingly and increasingly strict.
Note
27 minutes
Background:
Note
34 minutes
Evan Prodromou wrote up a summary of a long discussion on CC.
They approached us and asked to be part of the process.
The result: we got almost everything we wanted except (4), (7), and (8) which we ultimately decided were not real freedom issues.
Note
37 minutes
Don, Branden, Greg Pomerantz,and others were brought explicitly in the committee process. Probably more than any single project (except GNU).
This is precisely because Debian has a reputation of building up a strong ability to engage with issues of licensing critique and a strong committement to software freedom.
The FSF might not always agree but they have a deep respect for the Debian approach to these issues.
Debian called the AGPLv2 non-free because it involved barriers to modification. Through this critique and other decisions, a new version has been released that does not include barriers to modification at all.
Note
40 minutes
Debian and SPI have work and thinking done on an open use trademark license. While we don't know what we're doing, we're respected.
Note
40 minutes
When people go to create a distribution, it is usually based on Debian. There will be a derivers workshop tomorrow that talks about these things.
Debian is seems an important as:
Note
45 minutes