forked from software-tools-books/js4ds
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathcollab.tex
251 lines (214 loc) · 11 KB
/
collab.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
\chapter{Collaborating}\label{s:collab}
A project can survive badly-organized code;
none will survive for long if people are confused,
pulling in different directions,
or hostile.
This appendix therefore talks about what projects can do to make newcomers feel welcome
and to make things run smoothly after that.
It may seem strange to include this material in a tutorial on JavaScript,
but as Freeman pointed out in \cite{Free1972},
every group has a power structure;
the only question is whether it is formal and accountable
or informal and unaccountable.
Thirty-five years after the free software movement took on its modern, self-aware form,
its successes and failures have shown that if a project doesn't clearly state
who has the right to do what,
it will wind up being run by whoever argues loudest and longest.
For a much deeper discussion of these issues,
see \cite{Foge2005}.
\section{Licensing Software}\label{s:collab-software}
If the law or a publication agreement prevents people from reading your work or using your software,
you're probably hurting your own career.
You may need to do this in order to respect personal or commercial confidentiality,
but the first and most important rule of inclusivity
is to be open by default.
That is easier said than done,
not least because the law hasn't kept up with everyday practice.
\cite{Mori2012} and \hreffoot{http://www.astrobetter.com/blog/2014/03/10/the-whys-and-hows-of-licensing-scientific-code/}{this blog post}
are good starting points from a scientist's point of view,
while \cite{Lind2008} is a deeper dive for those who want details.
In brief,
creative works are automatically eligible for intellectual property (and thus copyright) protection.
This means that every creative work has some sort of license:
the only question is whether authors and users know what it is.
Every project should therefore include an explicit license.
This license should be chosen early:
if you don't set it up right at the start,
then each collaborator will hold copyright on their work
and will need to be asked for approval when a license \emph{is} chosen.
By convention,
the license is usually put in a file called \texttt{LICENSE} or \texttt{LICENSE.txt} in the project's root directory.
This file should clearly state the license(s) under which the content is being made available;
the plural is used because code, data, and text may be covered by different licenses.
\begin{aside}{Don't Write Your Own License}
Not even if you are a lawyer:
legalese is a highly technical language,
and words don't mean what you think they do.
\end{aside}
To make license selection as easy as possible,
GitHub allows you to select one of the most common licenses when creating a repository.
The Open Source Initiative maintains \hreffoot{http://opensource.org/licenses}{a list of licenses},
and \hreffoot{http://choosealicense.com/}{choosealicense.com} will help you find a license that suits your needs.
Some of the things you will need to think about are:
\begin{enumerate}
\item
Do you want to license the code at all?
\item
Is the content you are licensing source code?
\item
Do you require people distributing derivative works to also distribute their code?
\item
Do you want to address patent rights?
\item
Is your license compatible with the licenses of the software you depend on?
For example, as we will discuss below,
you can use MIT-licensed code in a GPL-licensed project but not vice versa.
\end{enumerate}
The two most popular licenses for software are
the \gref{g:mit-license}{MIT license} and the \gref{g:gpl}{GNU Public License} (GPL).
The MIT license (and its close sibling the BSD license)
say that people can do whatever they want to with the software as long as they cite the original source,
and that the authors accept no responsibility if things go wrong.
The GPL gives people similar rights,
but requires them to share their own work on the same terms:
\begin{quote}
You may copy, distribute and modify the software as long as you track changes/dates in source files.
Any modifications to or software including (via compiler) GPL-licensed code must also be made available under the GPL
along with build \& install instructions.
--- \hreffoot{https://tldrlegal.com/license/gnu-general-public-license-v3-(gpl-3)}{tl;dr}
\end{quote}
We recommend the MIT license:
it places the fewest restrictions on future action,
it can be made stricter later on,
and the last thirty years show that it's good enough to keep work open.
\section{Licensing Data and Documentation}\label{s:collab-datadocs}
The MIT license and the GPL apply to software.
When it comes to data and reports,
the most widely used family of licenses are those produced by \hreffoot{https://creativecommons.org/}{Creative Commons},
which have been written and checked by lawyers and are well understood by the community.
The most liberal license is referred to as \gref{g:cc-0}{CC-0},
where the ``0'' stands for ``zero restrictions''.
CC-0 puts work in the public domain,
i.e.,
allows anyone who wants to use it to do so however they want with no restrictions.
This is usually the best choice for data,
since it simplifies aggregate analysis.
For example,
if you choose a license for data that requires people to cite their source,
then anyone who uses that data in an analysis must cite you;
so must anyone who cites \emph{their} results,
and so on,
which quickly becomes unwieldy.
The next most common license is the Creative Commons--Attribution license,
usually referred to as \gref{g:cc-by}{CC-BY}.
This allows people to do whatever they want to with the work
as long as they cite the original source.
This is the best license to use for manuscripts,
since you \emph{want} people to share them widely
but also want to get credit for your work.
Other Creative Commons licenses incorporate various restrictions on specific use cases:
\begin{itemize}
\item
ND (no derivative works) prevents people from creating modified versions of your work.
Unfortunately, this also inhibits translation and reformatting.
\item
NC (no commercial use) does \emph{not} mean that people cannot charge money for something that includes your work,
though some publishers still try to imply that in order to scare people away from open licensing.
Instead,
the NC clause means that people cannot charge for something that uses your work without your explicit permission,
which you can give under whatever terms you want.
\item
Finally,
SA (share-alike) requires people to share work that incorporates yours
on the same terms that you used.
Again,
this is fine in principle,
but in practice makes aggregation a headache.
\end{itemize}
\section{Code of Conduct}\label{s:collab-conduct}
You don't expect to have a fire,
but every large building or event should have a fire safety plan.
Similarly,
having a Code of Conduct like \appref{s:conduct} for your project
reduces the uncertainty that participants face about what is acceptable and unacceptable behavior.
You might think this is obvious,
but long experience shows that articulating it clearly and concisely reduces problems caused by having different expectations,
particularly when people from very different cultural backgrounds are trying to collaborate.
An explicit Code of Conduct is particularly helpful for newcomers,
so having one can help your project grow
and encourage people to give you feedback.
Having a Code of Conduct is particularly important for people from marginalized or under-represented groups,
who have probably experienced harassment or unwelcoming behavior before.
By adopting one,
you signal that your project is trying to be a better place than YouTube,
Twitter,
and other online cesspools.
Some people may push back claiming that it's unnecessary,
or that it infringes freedom of speech,
but in our experience,
what they often mean is that thinking about how they might have benefited from past inequity makes them feel uncomfortable,
or that they like to argue for the sake of arguing.
If having a Code of Conduct leads to them going elsewhere,
that will probably make your project run more smoothly.
Just as you shouldn't write your own license for a project,
you probably shouldn't write your own Code of Conduct.
We recommend using the \hreffoot{https://www.contributor-covenant.org}{Contributor Covenant} for development projects
and the \hreffoot{http://geekfeminism.wikia.com/wiki/Conference\_anti-harassment/Policy}{model code of conduct}
from the \hreffoot{http://geekfeminism.wikia.com/}{Geek Feminism Wiki} for in-person events.
Both have been thought through carefully and revised in the light of experience,
and both are now used widely enough that
many potential participants in your project will not need to have them explained.
Rules are meaningless if they aren't enforced.
If you adopt a Code of Conduct,
it is therefore important to be clear about how to report issues and who will handle them.
\cite{Auro2018} is a short, practical guide to handling incidents;
like the Contributor Covenant and the model code of conduct,
it's better to start with something that other people have thought through and refined
than to try to create something from scratch.
\section{Governance}{s:collab-governance}
If your project involves more than half a dozen people,
you should be explicit about the way decisions are made.
We recommend Martha's Rules~\cite{Mina1986}:
\begin{enumerate}
\item
Before each meeting, anyone who wishes may sponsor a proposal. Proposals must
be circulated at least 24 hours before a meeting in order to be considered at
that meeting, and must include:
\begin{itemize}
\item a one-line summary (the subject line of the issue);
\item the full text of the proposal;
\item any required background information;
\item pros and cons; and
\item possible alternatives.
\end{itemize}
\item
A quorum is established in a meeting if half or more of voting members are
present.
\item
Once a person has sponsored a proposal, they are responsible for it. The
group may not discuss or vote on the issue unless the sponsor or their
delegate is present. The sponsor is also responsible for presenting the item
to the group.
\item
After the sponsor presents the proposal, a \emph{sense vote} is cast for the
proposal prior to any discussion:
\begin{itemize}
\item Who likes the proposal?
\item Who can live with the proposal?
\item Who is uncomfortable with the proposal?
\end{itemize}
\item
If all or most of the group likes or can live with the proposal, it is
immediately moved to a formal vote with no further discussion.
\item
If most of the group is uncomfortable with the proposal, it is postponed for
further rework by the sponsor.
\item
If some members are uncomfortable they can briefly state their objections. A
timer is then set for a brief discussion moderated by the facilitator. After
10 minutes or when no one has anything further to add (whichever comes first),
the facilitator calls for a yes-or-no vote on the question: ``Should we
implement this decision over the stated objections?'' If a majority votes
"yes" the proposal is implemented. Otherwise, the proposal is returned to the
sponsor for further work.
\end{enumerate}