doc/documentation/chapters/philosophy.texi

   1 @cindex Philosophy
   2 @node Philosophy
   3 @chapter Philosophy
   4
   5 @c NOTE: We should probably re-use some of the images lynX created
   6 @c for secushare, showing some of the relations and functionalities
   7 @c of GNUnet.
   8 The foremost goal of the GNUnet project is to become a widely used,
   9 reliable, open, non-discriminating, egalitarian, unconstrained and
  10 censorship-resistant system of free information exchange.
  11 We value free speech above state secrets, law-enforcement or
  12 intellectual property.
  13 GNUnet is supposed to be an anarchistic network, where the only
  14 limitation for participants (devices or people making use of the
  15 network, in the following sometimes called peers) is
  16 that they must contribute enough back to the network such that
  17 their resource consumption does not have a significant impact
  18 on other users.
  19 GNUnet should be more than just another file-sharing network.
  20 The plan is to offer many other services and in particular
  21 to serve as a development platform for the next generation of
  22 Internet Protocols.
  23
  24 @menu
  25 * Design Goals::
  26 * Security and Privacy::
  27 * Versatility::
  28 * Practicality::
  29 * Key Concepts::
  30 @end menu
  31
  32 @cindex Design Goals
  33 @cindex Design Goals
  34 @node Design Goals
  35 @section Design Goals
  36
  37 These are the core GNUnet design goals, in order of relative importance:
  38
  39 @itemize
  40 @item GNUnet must be implemented as
  41 @uref{https://www.gnu.org/philosophy/free-sw.html, Free Software}
  42 @c To footnote or not to footnote, that's the question.
  43 @footnote{This means that you you have the four essential freedoms: to run
  44 the program, to study and change the program in source code form,
  45 to redistribute exact copies, and to distribute modified versions.
  46 Refer to @uref{https://www.gnu.org/philosophy/free-sw.html, https://www.gnu.org/philosophy/free-sw.html}}
  47 @item GNUnet must only disclose the minimal amount of information
  48 necessary.
  49 @c TODO: Explain 'fully' in the terminology section.
  50 @item GNUnet must be fully distributed and survive
  51 @uref{https://en.wikipedia.org/wiki/Byzantine_fault_tolerance, Byzantine failures}
  52 @footnote{@uref{https://en.wikipedia.org/wiki/Byzantine_fault_tolerance, https://en.wikipedia.org/wiki/Byzantine_fault_tolerance}}
  53 at any position in the network.
  54 @item GNUnet must make it explicit to the user which entities are
  55 considered to be trustworthy when establishing secured communications.
  56 @item GNUnet must use compartmentalization to protect sensitive
  57 information.
  58 @item GNUnet must be open and permit new peers to join.
  59 @item GNUnet must be self-organizing and not depend on administrators.
  60 @item GNUnet must support a diverse range of applications and devices.
  61 @item The GNUnet architecture must be cost effective.
  62 @item GNUnet must provide incentives for peers to contribute more
  63 resources than they consume.
  64 @end itemize
  65
  66
  67 @cindex Security and Privacy
  68 @node Security and Privacy
  69 @section Security and Privacy
  70
  71 GNUnet's primary design goals are to protect the privacy of its users and
  72 to guard itself against attacks or abuse.
  73 GNUnet does not have any mechanisms to control, track or censor users.
  74 Instead, the GNUnet protocols aim to make it as hard as possible to
  75 find out what is happening on the network or to disrupt operations.
  76
  77 @cindex Versatility
  78 @node Versatility
  79 @section Versatility
  80
  81 We call GNUnet a peer-to-peer framework because we want to support many
  82 different forms of peer-to-peer applications. GNUnet uses a plugin
  83 architecture to make the system extensible and to encourage code reuse.
  84 While the first versions of the system only supported anonymous
  85 file-sharing, other applications are being worked on and more will
  86 hopefully follow in the future.
  87 A powerful synergy regarding anonymity services is created by a large
  88 community utilizing many diverse applications over the same software
  89 infrastructure. The reason is that link encryption hides the specifics
  90 of the traffic for non-participating observers. This way, anonymity can
  91 get stronger with additional (GNUnet) traffic, even if the additional
  92 traffic is not related to anonymous communication. Increasing anonymity
  93 is the primary reason why GNUnet is developed to become a peer-to-peer
  94 framework where many applications share the lower layers of an
  95 increasingly complex protocol stack.
  96 If merging traffic to hinder traffic analysis was not important,
  97 we could have just developed a dozen stand-alone applications
  98 and a few shared libraries.
  99
 100 @cindex Practicality
 101 @node Practicality
 102 @section Practicality
 103
 104 GNUnet allows participants to trade various amounts of security in
 105 exchange for increased efficiency. However, it is not possible for any
 106 user's security and efficiency requirements to compromise the security
 107 and efficiency of any other user.
 108
 109 For GNUnet, efficiency is not paramount. If there were a more secure and
 110 still practical approach, we would choose to take the more secure
 111 alternative. @command{telnet} is more efficient than @command{ssh}, yet
 112 it is obsolete.
 113 Hardware gets faster, and code can be optimized. Fixing security issues
 114 as an afterthought is much harder.
 115
 116 While security is paramount, practicability is still a requirement.
 117 The most secure system is always the one that nobody can use.
 118 Similarly, any anonymous system that is extremely inefficient will only
 119 find few users.
 120 However, good anonymity requires a large and diverse user base. Since
 121 individual security requirements may vary, the only good solution here is
 122 to allow individuals to trade-off security and efficiency.
 123 The primary challenge in allowing this is to ensure that the economic
 124 incentives work properly.
 125 In particular, this means that it must be impossible for a user to gain
 126 security at the expense of other users. Many designs (e.g. anonymity via
 127 broadcast) fail to give users an incentive to choose a less secure but
 128 more efficient mode of operation.
 129 GNUnet should avoid where ever possible to rely on protocols that will
 130 only work if the participants are benevolent.
 131 While some designs have had widespread success while relying on parties
 132 to observe a protocol that may be sub-optimal for the individuals (e.g.
 133 TCP Nagle), a protocol that ensures that individual goals never conflict
 134 with the goals of the group is always preferable.
 135
 136 @cindex Key Concepts
 137 @node Key Concepts
 138 @section Key Concepts
 139
 140 In this section, the fundamental concepts of GNUnet are explained.
 141 @c FIXME: Use @uref{https://docs.gnunet.org/bib/, research papers}
 142 @c once we have the new bibliography + subdomain setup.
 143 Most of them are also described in our research papers.
 144 First, some of the concepts used in the GNUnet framework are detailed.
 145 The second part describes concepts specific to anonymous file-sharing.
 146
 147 @menu
 148 * Authentication::
 149 * Accounting to Encourage Resource Sharing::
 150 * Confidentiality::
 151 * Anonymity::
 152 * Deniability::
 153 * Peer Identities::
 154 * Zones in the GNU Name System (GNS Zones)::
 155 * Egos::
 156 @end menu
 157
 158 @cindex Authentication
 159 @node Authentication
 160 @subsection Authentication
 161
 162 Almost all peer-to-peer communications in GNUnet are between mutually
 163 authenticated peers. The authentication works by using ECDHE, that is a
 164 DH (Diffie---Hellman) key exchange using ephemeral eliptic curve
 165 cryptography. The ephemeral ECC (Eliptic Curve Cryptography) keys are
 166 signed using ECDSA (@uref{http://en.wikipedia.org/wiki/ECDSA, ECDSA}).
 167 The shared secret from ECDHE is used to create a pair of session keys
 168 @c FIXME: LOng word for HKDF. More FIXMEs: Explain MITM etc.
 169 (using HKDF) which are then used to encrypt the communication between the
 170 two peers using both 256-bit AES (Advanced Encryption Standard)
 171 and 256-bit Twofish (with independently derived secret keys).
 172 As only the two participating hosts know the shared secret, this
 173 authenticates each packet
 174 without requiring signatures each time. GNUnet uses SHA-512
 175 (Secure Hash Algorithm) hash codes to verify the integrity of messages.
 176
 177 In GNUnet, the identity of a host is its public key. For that reason,
 178 man-in-the-middle attacks will not break the authentication or accounting
 179 goals. Essentially, for GNUnet, the IP of the host has nothing to do with
 180 the identity of the host. As the public key is the only thing that truly
 181 matters, faking an IP, a port or any other property of the underlying
 182 transport protocol is irrelevant. In fact, GNUnet peers can use
 183 multiple IPs (IPv4 and IPv6) on multiple ports --- or even not use the
 184 IP protocol at all (by running directly on layer 2).
 185
 186 @c NOTE: For consistency we will use @code{HELLO}s throughout this Manual.
 187 GNUnet uses a special type of message to communicate a binding between
 188 public (ECC) keys to their current network address. These messages are
 189 commonly called @code{HELLO}s or peer advertisements.
 190 They contain the public key of the peer and its current network
 191 addresses for various transport services.
 192 A transport service is a special kind of shared library that
 193 provides (possibly unreliable, out-of-order) message delivery between
 194 peers.
 195 For the UDP and TCP transport services, a network address is an IP and a
 196 port.
 197 GNUnet can also use other transports (HTTP, HTTPS, WLAN, etc.) which use
 198 various other forms of addresses. Note that any node can have many
 199 different active transport services at the same time,
 200 and each of these can have a different addresses.
 201 Binding messages expire after at most a week (the timeout can be
 202 shorter if the user configures the node appropriately).
 203 This expiration ensures that the network will eventually get rid of
 204 outdated advertisements.
 205 @footnote{Ronaldo A. Ferreira, Christian Grothoff, and Paul Ruth.
 206 A Transport Layer Abstraction for Peer-to-Peer Networks
 207 Proceedings of the 3rd International Symposium on Cluster Computing
 208 and the Grid (GRID 2003), 2003.
 209 (@uref{https://gnunet.org/git/bibliography.git/plain/docs/transport.pdf, https://gnunet.org/git/bibliography.git/plain/docs/transport.pdf})}
 210
 211 @cindex Accounting to Encourage Resource Sharing
 212 @node Accounting to Encourage Resource Sharing
 213 @subsection Accounting to Encourage Resource Sharing
 214
 215 Most distributed P2P networks suffer from a lack of defenses or
 216 precautions against attacks in the form of freeloading.
 217 While the intentions of an attacker and a freeloader are different, their
 218 effect on the network is the same; they both render it useless.
 219 Most simple attacks on networks such as @command{Gnutella}
 220 involve flooding the network with traffic, particularly
 221 with queries that are, in the worst case, multiplied by the network.
 222
 223 In order to ensure that freeloaders or attackers have a minimal impact on
 224 the network, GNUnet's file-sharing implementation tries to distinguish
 225 good (contributing) nodes from malicious (freeloading) nodes. In GNUnet,
 226 every file-sharing node keeps track of the behavior of every other node it
 227 has been in contact with. Many requests (depending on the application)
 228 are transmitted with a priority (or importance) level.
 229 That priority is used to establish how important the sender believes
 230 this request is. If a peer responds to an important request, the
 231 recipient will increase its trust in the responder:
 232 the responder contributed resources.
 233 If a peer is too busy to answer all requests, it needs to prioritize.
 234 For that, peers do not take the priorities of the requests received at
 235 face value.
 236 First, they check how much they trust the sender, and depending on that
 237 amount of trust they assign the request a (possibly lower) effective
 238 priority. Then, they drop the requests with the lowest effective priority
 239 to satisfy their resource constraints. This way, GNUnet's economic model
 240 ensures that nodes that are not currently considered to have a surplus in
 241 contributions will not be served if the network load is high.
 242 @footnote{Christian Grothoff. An Excess-Based Economic Model for Resource
 243 Allocation in Peer-to-Peer Networks. Wirtschaftsinformatik, June 2003.
 244 (@uref{https://gnunet.org/git/bibliography.git/plain/docs/ebe.pdf, https://gnunet.org/git/bibliography.git/plain/docs/ebe.pdf})}
 245 @c 2009?
 246
 247 @cindex Confidentiality
 248 @node Confidentiality
 249 @subsection Confidentiality
 250
 251 Adversaries outside of GNUnet are not supposed to know what kind of
 252 actions a peer is involved in. Only the specific neighbor of a peer that
 253 is the corresponding sender or recipient of a message may know its
 254 contents, and even then application protocols may place further
 255 restrictions on that knowledge.
 256 In order to ensure confidentiality, GNUnet uses link encryption, that is
 257 each message exchanged between two peers is encrypted using a pair of
 258 keys only known to these two peers.
 259 Encrypting traffic like this makes any kind of traffic analysis much
 260 harder. Naturally, for some applications, it may still be desirable if
 261 even neighbors cannot determine the concrete contents of a message.
 262 In GNUnet, this problem is addressed by the specific application-level
 263 protocols (see for example, deniability and anonymity in anonymous file
 264 sharing).
 265
 266 @cindex Anonymity
 267 @node Anonymity
 268 @subsection Anonymity
 269
 270 @menu
 271 * How file-sharing achieves Anonymity::
 272 @end menu
 273
 274 Providing anonymity for users is the central goal for the anonymous
 275 file-sharing application. Many other design decisions follow in the
 276 footsteps of this requirement.
 277 Anonymity is never absolute. While there are various
 278 scientific metrics@footnote{Claudia Díaz, Stefaan Seys, Joris Claessens,
 279 and Bart Preneel. Towards measuring anonymity.
 280 2002.
 281 (@uref{https://gnunet.org/git/bibliography.git/plain/docs/article-89.pdf, https://gnunet.org/git/bibliography.git/plain/docs/article-89.pdf})}
 282 that can help quantify the level of anonymity that a given mechanism
 283 provides, there is no such thing as complete anonymity.
 284 GNUnet's file-sharing implementation allows users to select for each
 285 operation (publish, search, download) the desired level of anonymity.
 286 The metric used is the amount of cover traffic available to hide the
 287 request.
 288 While this metric is not as good as, for example, the theoretical metric
 289 given in scientific metrics@footnote{likewise},
 290 it is probably the best metric available to a peer with a purely local
 291 view of the world that does not rely on unreliable external information.
 292 The default anonymity level is 1, which uses anonymous routing but
 293 imposes no minimal requirements on cover traffic. It is possible
 294 to forego anonymity when this is not required. The anonymity level of 0
 295 allows GNUnet to use more efficient, non-anonymous routing.
 296
 297 @cindex How file-sharing achieves Anonymity
 298 @node How file-sharing achieves Anonymity
 299 @subsubsection How file-sharing achieves Anonymity
 300
 301 Contrary to other designs, we do not believe that users achieve strong
 302 anonymity just because their requests are obfuscated by a couple of
 303 indirections. This is not sufficient if the adversary uses traffic
 304 analysis.
 305 The threat model used for anonymous file sharing in GNUnet assumes that
 306 the adversary is quite powerful.
 307 In particular, we assume that the adversary can see all the traffic on
 308 the Internet. And while we assume that the adversary
 309 can not break our encryption, we assume that the adversary has many
 310 participating nodes in the network and that it can thus see many of the
 311 node-to-node interactions since it controls some of the nodes.
 312
 313 The system tries to achieve anonymity based on the idea that users can be
 314 anonymous if they can hide their actions in the traffic created by other
 315 users.
 316 Hiding actions in the traffic of other users requires participating in the
 317 traffic, bringing back the traditional technique of using indirection and
 318 source rewriting. Source rewriting is required to gain anonymity since
 319 otherwise an adversary could tell if a message originated from a host by
 320 looking at the source address. If all packets look like they originate
 321 from one node, the adversary can not tell which ones originate from that
 322 node and which ones were routed.
 323 Note that in this mindset, any node can decide to break the
 324 source-rewriting paradigm without violating the protocol, as this
 325 only reduces the amount of traffic that a node can hide its own traffic
 326 in.
 327
 328 If we want to hide our actions in the traffic of other nodes, we must make
 329 our traffic indistinguishable from the traffic that we route for others.
 330 As our queries must have us as the receiver of the reply
 331 (otherwise they would be useless), we must put ourselves as the receiver
 332 of replies that actually go to other hosts; in other words, we must
 333 indirect replies.
 334 Unlike other systems, in anonymous file-sharing as implemented on top of
 335 GNUnet we do not have to indirect the replies if we don't think we need
 336 more traffic to hide our own actions.
 337
 338 This increases the efficiency of the network as we can indirect less under
 339 higher load.@footnote{Krista Bennett and Christian Grothoff.
 340 GAP --- practical anonymous networking. In Proceedings of
 341 Designing Privacy Enhancing Technologies, 2003.
 342 (@uref{https://gnunet.org/git/bibliography.git/plain/docs/aff.pdf, https://gnunet.org/git/bibliography.git/plain/docs/aff.pdf})}
 343
 344 @cindex Deniability
 345 @node Deniability
 346 @subsection Deniability
 347
 348 Even if the user that downloads data and the server that provides data are
 349 anonymous, the intermediaries may still be targets. In particular, if the
 350 intermediaries can find out which queries or which content they are
 351 processing, a strong adversary could try to force them to censor
 352 certain materials.
 353
 354 With the file-encoding used by GNUnet's anonymous file-sharing, this
 355 problem does not arise.
 356 The reason is that queries and replies are transmitted in
 357 an encrypted format such that intermediaries cannot tell what the query
 358 is for or what the content is about.  Mind that this is not the same
 359 encryption as the link-encryption between the nodes.  GNUnet has
 360 encryption on the network layer (link encryption, confidentiality,
 361 authentication) and again on the application layer (provided
 362 by @command{gnunet-publish}, @command{gnunet-download},
 363 @command{gnunet-search} and @command{gnunet-gtk}).
 364 @footnote{Christian Grothoff, Krista Grothoff, Tzvetan Horozov,
 365 and Jussi T. Lindgren.
 366 An Encoding for Censorship-Resistant Sharing.
 367 2009.
 368 (@uref{https://gnunet.org/git/bibliography.git/plain/docs/ecrs.pdf, https://gnunet.org/git/bibliography.git/plain/docs/ecrs.pdf})}
 369
 370 @cindex Peer Identities
 371 @node Peer Identities
 372 @subsection Peer Identities
 373
 374 Peer identities are used to identify peers in the network and are unique
 375 for each peer. The identity for a peer is simply its public key, which is
 376 generated along with a private key the peer is started for the first time.
 377 While the identity is binary data, it is often expressed as ASCII string.
 378 For example, the following is a peer identity as you might see it in
 379 various places:
 380
 381 @example
 382 UAT1S6PMPITLBKSJ2DGV341JI6KF7B66AC4JVCN9811NNEGQLUN0
 383 @end example
 384
 385 @noindent
 386 You can find your peer identity by running @command{gnunet-peerinfo -s}.
 387
 388 @cindex Zones in the GNU Name System (GNS Zones)
 389 @node Zones in the GNU Name System (GNS Zones)
 390 @subsection Zones in the GNU Name System (GNS Zones)
 391
 392 @c FIXME: Explain or link to an explanation of the concept of public keys
 393 @c and private keys.
 394 GNS@footnote{Matthias Wachs, Martin Schanzenbach, and Christian Grothoff.
 395 A Censorship-Resistant, Privacy-Enhancing and Fully Decentralized Name
 396 System. In proceedings of 13th International Conference on Cryptology and
 397 Network Security (CANS 2014). 2014.
 398 @uref{https://gnunet.org/git/bibliography.git/plain/docs/gns2014wachs.pdf, https://gnunet.org/git/bibliography.git/plain/docs/gns2014wachs.pdf}}
 399 zones are similar to those of DNS zones, but instead of a hierarchy of
 400 authorities to governing their use, GNS zones are controlled by a private
 401 key.
 402 When you create a record in a DNS zone, that information stored in your
 403 nameserver. Anyone trying to resolve your domain then gets pointed
 404 (hopefully) by the centralised authority to your nameserver.
 405 Whereas GNS, being fully decentralized by design, stores that information
 406 in DHT. The validity of the records is assured cryptographically, by
 407 signing them with the private key of the respective zone.
 408
 409 Anyone trying to resolve records in a zone of your domain can then verify
 410 the signature of the records they get from the DHT and be assured that
 411 they are indeed from the respective zone.
 412 To make this work, there is a 1:1 correspondence between zones and
 413 their public-private key pairs.
 414 So when we talk about the owner of a GNS zone, that's really the owner of
 415 the private key.
 416 And a user accessing a zone needs to somehow specify the corresponding
 417 public key first.
 418
 419 @cindex Egos
 420 @node Egos
 421 @subsection Egos
 422
 423 @c what is the difference between peer identity and egos? It seems
 424 @c like both are linked to public-private key pair.
 425 Egos are your "identities" in GNUnet. Any user can assume multiple
 426 identities, for example to separate their activities online. Egos can
 427 correspond to pseudonyms or real-world identities. Technically, an
 428 ego is first of all a public-private key pair.